I built a tool that shows you what GPT-2 is "thinking" in real-time as it generates 3D graph of concept activations per token [R]
Summary
A developer built AXON, a tool that visualizes GPT-2's internal concept activations as a live 3D force graph using Sparse Autoencoders, allowing users to see interpretable features firing before token generation.
Similar Articles
@AlphaSignalAI: This free interactive explainer just exposed how GPT actually works. Most people treat Transformers like magic. You typ…
A free interactive tool called Transformer Explainer runs a live GPT-2 model in the browser, visualizing the internal workings of Transformers with a Sankey diagram and live inference.
Extracting Concepts from GPT-4
OpenAI introduces sparse autoencoders as a method to extract and interpret concepts from large language models like GPT-4, addressing the fundamental challenge of understanding neural network behavior. They release a research paper, code, and feature visualization tools to help researchers train autoencoders at scale and improve AI safety through better interpretability.
Transformer Explainer: Interactive Learning of Text-Generative Models
Transformer Explainer is an interactive visualization tool that allows non-experts to understand the inner workings of the GPT-2 model through real-time experimentation and visualization in a web browser.
@DamiDefi: A developer just mapped every AI concept powering Claude, ChatGPT, and every agent stack you are building on. 20 concep…
A developer created a free 40-minute breakdown explaining 20 key AI concepts behind models like Claude and ChatGPT, covering tokenization, attention, RAG, agents, and more, aiming to provide practical mental models for builders.
You can now read Gemma 3's mind
Anthropic and Neuronpedia released research and tools on Natural Language Autoencoders (NLA), enabling users to view the internal 'thoughts' of Gemma 3 during token generation. The release includes model weights for the Auto Verbalizer and Activation Reconstructor, hosted on Hugging Face and Neuronpedia.