whole-repository-generation

Tag

Cards List
#whole-repository-generation

DeNovoSWE: Scaling Long-Horizon Environments for Generating Entire Repositories from Scratch

Hugging Face Daily Papers · 5d ago Cached

DeNovoSWE is a large-scale dataset for training code agents to generate entire software repositories from documentation, using a sandboxed agentic workflow and difficulty-aware filtering. Fine-tuning Qwen3-30B-A3B on it boosts performance on the BeyondSWE-Doc2Repo benchmark from 5.8% to 47.2%.

0 favorites 0 likes
← Back to home

Submit Feedback