AutoDev: Automated AI-Driven Development

Papers with Code Trending 03/13/24, 07:12 AM Papers

ai-agents software-development code-generation automation docker open-source testing

Summary

AutoDev is an AI-driven software development framework that automates complex engineering tasks, such as code and test generation, within a secure Docker environment. It achieves high performance on the HumanEval dataset by enabling autonomous planning and execution of intricate software engineering tasks.

The landscape of software development has witnessed a paradigm shift with the advent of AI-powered assistants, exemplified by GitHub Copilot. However, existing solutions are not leveraging all the potential capabilities available in an IDE such as building, testing, executing code, git operations, etc. Therefore, they are constrained by their limited capabilities, primarily focusing on suggesting code snippets and file manipulation within a chat-based interface. To fill this gap, we present AutoDev, a fully automated AI-driven software development framework, designed for autonomous planning and execution of intricate software engineering tasks. AutoDev enables users to define complex software engineering objectives, which are assigned to AutoDev's autonomous AI Agents to achieve. These AI agents can perform diverse operations on a codebase, including file editing, retrieval, build processes, execution, testing, and git operations. They also have access to files, compiler output, build and testing logs, static analysis tools, and more. This enables the AI Agents to execute tasks in a fully automated manner with a comprehensive understanding of the contextual information required. Furthermore, AutoDev establishes a secure development environment by confining all operations within Docker containers. This framework incorporates guardrails to ensure user privacy and file security, allowing users to define specific permitted or restricted commands and operations within AutoDev. In our evaluation, we tested AutoDev on the HumanEval dataset, obtaining promising results with 91.5% and 87.8% of Pass@1 for code generation and test generation respectively, demonstrating its effectiveness in automating software engineering tasks while maintaining a secure and user-controlled development environment.

Original Article

View Cached Full Text

Cached at: 05/08/26, 08:45 AM

Paper page - AutoDev: Automated AI-Driven Development

Source: https://huggingface.co/papers/2403.08299 Published on Mar 13, 2024

Abstract

AutoDev is an AI-driven software development framework that automates complex engineering tasks within a secure Docker environment, achieving high performance in code and test generation.

The landscape of software development has witnessed a paradigm shift with the advent of AI-powered assistants, exemplified by GitHub Copilot. However, existing solutions are not leveraging all the potential capabilities available in an IDE such as building, testing, executing code, git operations, etc. Therefore, they are constrained by their limited capabilities, primarily focusing on suggesting code snippets and file manipulation within a chat-based interface. To fill this gap, we present AutoDev, a fully automated AI-driven software development framework, designed for autonomous planning and execution of intricate software engineering tasks. AutoDev enables users to define complex software engineering objectives, which are assigned to AutoDev’sautonomous AI Agentsto achieve. These AI agents can perform diverse operations on a codebase, including file editing, retrieval, build processes, execution, testing, and git operations. They also have access to files, compiler output, build and testing logs, static analysis tools, and more. This enables the AI Agents to execute tasks in a fully automated manner with a comprehensive understanding of the contextual information required. Furthermore, AutoDev establishes a secure development environment by confining all operations withinDocker containers. This framework incorporates guardrails to ensure user privacy and file security, allowing users to define specific permitted or restricted commands and operations within AutoDev. In our evaluation, we tested AutoDev on theHumanEval dataset, obtaining promising results with 91.5% and 87.8% ofPass@1forcode generationandtest generationrespectively, demonstrating its effectiveness in automating software engineering tasks while maintaining a secure and user-controlled development environment.

View arXiv page View PDF GitHub16.6kauto Add to collection

Get this paper in your agent:

hf papers read 2403\.08299

Don’t have the latest CLI?curl \-LsSf https://hf\.co/cli/install\.sh \| bash

Models citing this paper0

No model linking this paper

Cite arxiv.org/abs/2403.08299 in a model README.md to link it from this page.

Datasets citing this paper0

No dataset linking this paper

Cite arxiv.org/abs/2403.08299 in a dataset README.md to link it from this page.

Spaces citing this paper0

No Space linking this paper

Cite arxiv.org/abs/2403.08299 in a Space README.md to link it from this page.

AutoDev: Automated AI-Driven Development

Paper page - AutoDev: Automated AI-Driven Development

Abstract

Models citing this paper0

Datasets citing this paper0

Spaces citing this paper0

Collections including this paper4

Similar Articles

AutoScout24 scales engineering with AI-powered workflows

@tom_doerr: Turns AI coding chats into a repeatable engineering workflow https://github.com/codeaholicguy/ai-devkit…

My Homelab AI Dev Platform

OpenDevin: An Open Platform for AI Software Developers as Generalist Agents

@tom_doerr: Runs a virtual company with 14 expert AI agents https://github.com/MaxMiksa/Auto-Company…

Submit Feedback

Similar Articles

AutoScout24 scales engineering with AI-powered workflows

@tom_doerr: Turns AI coding chats into a repeatable engineering workflow https://github.com/codeaholicguy/ai-devkit…

OpenDevin: An Open Platform for AI Software Developers as Generalist Agents

@tom_doerr: Runs a virtual company with 14 expert AI agents https://github.com/MaxMiksa/Auto-Company…