@GitHub_Daily: Want to understand how Large Language Models actually work? Existing resources are either too academic and hard to digest, or too superficial, focusing only on concepts, with nothing that clearly explains the entire process from start to finish. Similarly, I came across the 'how-llms-work' project, which turns the complete workflow of LLMs into a visual interactive webpage, based on Andrej Karpathy’s...

X AI KOLs Timeline 05/11/26, 04:00 AM Tools

large-language-models visualization education karpathy interactive-guide open-source

Summary

An interactive visual guide, 'how-llms-work', breaks down the entire lifecycle of Large Language Models based on Andrej Karpathy's lectures, covering data collection to post-training.

If you want to understand how Large Language Models actually work, the resources you find are often too academic and hard to digest, or too superficial, focusing only on concepts, with nothing that clearly explains the entire process from start to finish. Similarly, I came across the 'how-llms-work' project, which turns the complete workflow of LLMs into a visual interactive webpage, based on Andrej Karpathy’s classic lectures. From web data collection and tokenization to neural network training, and finally to inference generation and post-training alignment, each stage is accompanied by intuitive diagrams and explanations, allowing you to see the entire process at a glance. GitHub: http://github.com/ynarwal/how-llms-work It also covers advanced topics such as hallucinations, context windows, and Retrieval-Augmented Generation (RAG), helping us understand the capabilities and boundaries of models as well as their practical application methods. If you want to understand the entire process of LLMs from training to conversation without wading through academic papers, this tutorial is worth spending ten minutes to read.

Original Article

View Cached Full Text

Cached at: 05/12/26, 12:47 AM

If you’re curious about how large language models actually work, you’ll likely find that most resources are either too academic and dense, or too superficial, focusing only on high-level concepts. It’s rare to find content that clearly explains the entire process from start to finish.

Coincidentally, I came across the how-llms-work project, which presents the complete lifecycle of large models as a visual, interactive web page. The content is based on Andrej Karpathy’s classic lectures. From web data collection, tokenization, and neural network training, to inference, generation, and post-training alignment, each stage is illustrated with intuitive diagrams and explanations, allowing you to grasp the entire workflow in a single view.

GitHub: http://github.com/ynarwal/how-llms-work

It also covers advanced topics such as hallucinations, context windows, and Retrieval-Augmented Generation (RAG), helping us understand the model’s capability boundaries and practical application methods. If you want to understand the entire process of large models—from training to conversational interaction—without wading through academic papers, this tutorial is worth spending ten minutes on.

ynarwal/how-llms-work

Source: https://github.com/ynarwal/how-llms-work

How LLMs Actually Work

A visual, interactive guide to how large language models are built — from raw internet text to a conversational assistant.

Live site: https://ynarwal.github.io/how-llms-work/

Based on Andrej Karpathy’s Intro to Large Language Models (https://www.youtube.com/watch?v=zjkBMFhNj_g) lecture.

What’s inside

Data Collection — how the web is scraped and filtered into training data (Common Crawl, FineWeb)
Tokenization — how text is broken into subword tokens via Byte Pair Encoding (BPE)
Neural Network Training — the loss function, gradient descent, and what a forward pass looks like
Inference & Sampling — how the model generates text token by token, and how temperature works
The Base Model — what a model knows after pre-training and what it can’t do yet
Post-Training — RLHF, instruction tuning, and how a base model becomes an assistant
LLM Psychology — hallucinations, context windows, and how to think about what models “know”
RAG — retrieval-augmented generation: embeddings, vector search, and context injection
Full Pipeline Summary — end-to-end visual of every stage

Files

File	Description
`index.html`	Main site (v2 redesign)
`v1.html`	Original dark-theme version
`transcript.txt`	Full Karpathy lecture transcript
`council.py`	LLM council fact-checker (runs via `uv run council.py`)
`report.html`	Latest council fact-check report

HN discussion

Posted to Hacker News (https://news.ycombinator.com/item?id=47886517) and generated heated debate, mostly about it being LLM-generated. Fair point — but the content isn’t the AI’s. Every claim, figure, and framing is traced directly to Karpathy’s lecture, not hallucinated by a model.

Vibe check

The code and content in this repo is mostly LLM-generated (Claude via Claude Code). The ideas, direction, and editorial decisions are mine — the implementation was largely written by AI. The council fact-checker exists precisely because of this: automated content warrants automated verification.

ynarwal/how-llms-work

How LLMs Actually Work

What’s inside

Files

HN discussion

Vibe check

Similar Articles

@mdancho84: This 277-page PDF unlocks the secrets of Large Language Models. Here's what's inside:

@royvanrijn: For curious developers I built "The Anatomy of an LLM", an interactive explainer showing how text becomes tokens, vecto…

@phosphenq: This 2 hour video by Andrej Karpathy (co-founder of OpenAI) will teach you more about using LLMs than every AI tutorial…

Submit Feedback

Similar Articles

@GitHub_Daily: To dive deep into model research, you can't just stay at the application layer—you need to understand how the underlying system is trained and optimized. I stumbled upon LLMSys-PaperList, a carefully curated collection of papers related to large model systems. It is continuously updated from 2022 to the latest top conference papers in 2026, and organized by categories such as training, inference, multimodality...

@Xx15573208: I've read many articles about Transformers and understand the theory, but when I actually sit down to write code, I have no idea where to start. LLMs-from-scratch is specifically designed to solve this problem: it accompanies the book "Build a Large Language Model" and guides you through implementing GPT from scratch using PyTorch…

@mdancho84: This 277-page PDF unlocks the secrets of Large Language Models. Here's what's inside:

@royvanrijn: For curious developers I built "The Anatomy of an LLM", an interactive explainer showing how text becomes tokens, vecto…

@phosphenq: This 2 hour video by Andrej Karpathy (co-founder of OpenAI) will teach you more about using LLMs than every AI tutorial…