@rohit4verse: Every night you're not running an autonomous research agent, you're hand-running experiments someone else automated mon…
Summary
Andrej Karpathy open-sourced an autonomous research agent that runs its own ML experiments overnight using a single GPU, automatically iterating on improvements by editing code and keeping changes that lower validation loss.
View Cached Full Text
Cached at: 05/25/26, 12:53 PM
Every night you’re not running an autonomous research agent, you’re hand-running experiments someone else automated months ago.
Most people are still hunting for the “right” setup. Frameworks, orchestration, glue code.
You don’t need any of it. Andrej Karpathy open-sourced his own version that runs its own ML research. One GPU. ~100 experiments overnight. You never touch the Python.
Here’s the exact setup (takes 2 minutes):
- Clone it: (repo link in comments)
- uv sync, then uv run prepare[.]py
- uv run train[.]py once to confirm the baseline runs
- Point your coding agent at program.md and walk away
The agent edits one file, trains 5 minutes, keeps the change if val_bpb drops, reverts it if it doesn’t. Git is the memory. The metric is the judge.
You wake up to a staircase of validated improvements, not a backlog of ideas you never tested.
Similar Articles
@lftherios: 1/ Autoresearch from @karpathy has been one of the most interesting agentic patterns to emerge this year. The challenge…
Andrej Karpathy's autoresearch pattern highlights how current AI agents run experiments in isolation, wasting compute by duplicating work and rediscovering dead ends.
@sitinme: Saw Karpathy open-sourced a very interesting project autoresearch, which gives a real but small-scale LLM training task to an AI Agent, letting it do research, modify code, run experiments, look at results, and then decide whether to keep or discard the changes. The project is based on a single NVIDIA…
Karpathy open-sourced an experimental project, autoresearch, that lets an AI Agent automatically complete the research loop for small-scale LLM training: modify code, run experiments, evaluate results, and iterate. Humans only need to write the research plan and constraints.
@JeremyNguyenPhD: "I left 3 AI agents alone with a research problem overnight. They came back with 72 peer-reviewed papers" -- @ProfJieDi…
Professor Jie Ding open-sourced Autoresearch and WorldSeed, AI agent frameworks capable of autonomously reviewing 72 peer-reviewed papers overnight to address a research problem.
@seelffff: > reads papers on arXiv autonomously > finds and checks datasets on HF Hub > writes the training script itself > genera…
Hugging Face open-sourced ml-intern, an autonomous agent that performs the entire ML post-training loop—reading papers, finding datasets, writing scripts, generating data, monitoring training, and uploading weights—achieving significant GPQA improvement with a 1.7B model in 10 hours without human intervention.
@rohit4verse: karpathy hasn't typed a line of code since december. he calls the state ai psychosis. 16 hours a day expressing his wil…
Andrej Karpathy has reportedly stopped writing code since December, instead using AI agents for macro-level delegation, auto-research loops, and home automation, optimizing token throughput and removing himself from loops to run systems autonomously.