DeepCode: Open Agentic Coding

Papers with Code Trending 12/08/25, 04:07 PM Papers

agentic-coding autonomous-agents code-generation llm scientific-reproduction document-to-code benchmark

Summary

DeepCode is a fully autonomous framework for document-to-codebase synthesis that uses principled information-flow management to convert scientific papers into production-grade code, achieving state-of-the-art results on PaperBench and surpassing PhD-level human experts.

Recent advances in large language models (LLMs) have given rise to powerful coding agents, making it possible for code assistants to evolve into code engineers. However, existing methods still face significant challenges in achieving high-fidelity document-to-codebase synthesis--such as scientific papers to code--primarily due to a fundamental conflict between information overload and the context bottlenecks of LLMs. In this work, we introduce DeepCode, a fully autonomous framework that fundamentally addresses this challenge through principled information-flow management. By treating repository synthesis as a channel optimization problem, DeepCode seamlessly orchestrates four information operations to maximize task-relevant signals under finite context budgets: source compression via blueprint distillation, structured indexing using stateful code memory, conditional knowledge injection via retrieval-augmented generation, and closed-loop error correction. Extensive evaluations on the PaperBench benchmark demonstrate that DeepCode achieves state-of-the-art performance, decisively outperforming leading commercial agents such as Cursor and Claude Code, and crucially, surpassing PhD-level human experts from top institutes on key reproduction metrics. By systematically transforming paper specifications into production-grade implementations comparable to human expert quality, this work establishes new foundations for autonomous scientific reproduction that can accelerate research evaluation and discovery.

Original Article

View Cached Full Text

Cached at: 05/08/26, 12:27 PM

Paper page - DeepCode: Open Agentic Coding

Source: https://huggingface.co/papers/2512.07921 Published on Dec 8, 2025

Submitted byhttps://huggingface.co/taesiri

taesirion Dec 10, 2025

Abstract

DeepCode, a fully autonomous framework, addresses the challenges of document-to-codebase synthesis by optimizing information flow through source compression, structured indexing, knowledge injection, and error correction, achieving state-of-the-art performance and surpassing human experts.

Recent advances inlarge language models(LLMs) have given rise to powerfulcoding agents, making it possible for code assistants to evolve into code engineers. However, existing methods still face significant challenges in achieving high-fidelitydocument-to-codebase synthesis--such as scientific papers to code--primarily due to a fundamental conflict betweeninformation overloadand thecontext bottlenecksof LLMs. In this work, we introduceDeepCode, a fully autonomous framework that fundamentally addresses this challenge through principled information-flow management. By treating repository synthesis as achannel optimizationproblem,DeepCodeseamlessly orchestrates four information operations to maximize task-relevant signals under finite context budgets: source compression viablueprint distillation, structured indexing usingstateful code memory, conditional knowledge injection viaretrieval-augmented generation, andclosed-loop error correction. Extensive evaluations on thePaperBenchbenchmark demonstrate thatDeepCodeachieves state-of-the-art performance, decisively outperforming leading commercial agents such as Cursor and Claude Code, and crucially, surpassing PhD-level human experts from top institutes on key reproduction metrics. By systematically transforming paper specifications into production-grade implementations comparable to human expert quality, this work establishes new foundations forautonomous scientific reproductionthat can accelerate research evaluation and discovery.

View arXiv page View PDF GitHub15.4k Add to collection

Get this paper in your agent:

hf papers read 2512\.07921

Don’t have the latest CLI?curl \-LsSf https://hf\.co/cli/install\.sh \| bash

Models citing this paper0

No model linking this paper

Cite arxiv.org/abs/2512.07921 in a model README.md to link it from this page.

Datasets citing this paper0

No dataset linking this paper

Cite arxiv.org/abs/2512.07921 in a dataset README.md to link it from this page.

Spaces citing this paper0

No Space linking this paper

Cite arxiv.org/abs/2512.07921 in a Space README.md to link it from this page.

Collections including this paper18

Browse 18 collections that include this paper

DeepCode: Open Agentic Coding

Paper page - DeepCode: Open Agentic Coding

Abstract

Models citing this paper0

Datasets citing this paper0

Spaces citing this paper0

Collections including this paper18

Similar Articles

CodeAlchemy: Synthetic Code Rewriting at Scale

Building Decypher: An Execution Context Engine for Agents

@RealCodedAlpha: https://x.com/RealCodedAlpha/status/2064921935507837260

Harness engineering: leveraging Codex in an agent-first world

Codex-maxxing

Submit Feedback

Similar Articles

CodeAlchemy: Synthetic Code Rewriting at Scale

Building Decypher: An Execution Context Engine for Agents

@RealCodedAlpha: https://x.com/RealCodedAlpha/status/2064921935507837260

Harness engineering: leveraging Codex in an agent-first world