A research agenda for assessing the economic impacts of code generation models

OpenAI Blog Papers

Summary

OpenAI is laying out a research agenda to assess the economic impacts of code generation models like Codex, covering areas such as productivity, employment, skill development, and inequality, while inviting external researchers to collaborate.

No content available
Original Article Export to Word Export to PDF
View Cached Full Text

Cached at: 04/20/26, 02:55 PM

# A research agenda for assessing the economic impacts of code generation models Source: [https://openai.com/index/economic-impacts-research/](https://openai.com/index/economic-impacts-research/) OpenAI is developing a research program to assess the economic impacts of code generation models and is inviting collaboration with external researchers\. Rapid advances in the capabilities of large language models \(LLMs\) trained on code have made it increasingly important to study their economic impacts on individuals, firms, and society\. Codex – an LLM developed by OpenAI by fine\-tuning GPT‑3 on billions of lines of publicly available code from GitHub – has been shown to generate functionally correct code 28\.8% of the time on a sample of evaluation problems \(Chen et al\. 2021\)\. This may have important implications for the future of coding and the economics of the industries that depend on it\. In this document, we lay out a research agenda to assess the effects of Codex on economic factors of interest to policymakers, firms, and the public\. We make a case for this research agenda by highlighting the potentially broad applicability of code generation models to software development, the potential for other LLMs to create significant social and economic impact as model capabilities advance, and the value of using Codex to generate evidence and establish methodologies that may be applicable to research on the economic impacts of future models\. We propose that academic and policy research focus on studying code generation models and other LLMs so that evidence on their economic impacts can be used to inform decision\-making in three key areas: Deployment policy, AI system design, and public policy\. To help guide this research, we outline six priority outcome areas within the realm of economic impacts that we intend to use Codex to study: Productivity, Employment, Skill Development, Inter\-firm Competition, Consumer Prices, and Economic Inequality\. For each area, we briefly discuss previous literature on the impacts of artificial intelligence on each of these outcomes, describe questions that we believe to be key inputs to the three decision\-making areas mentioned above, and provide examples of research that could be conducted with Codex\. To catalyze work that builds off of this initial research agenda, we are announcing a[Call for Expressions of Interest⁠](https://openai.com/index/economic-impacts/)from external researchers to collaborate with OpenAI researchers and customers to better measure the economic impacts of code generation models and other LLMs\.

Similar Articles

Economic impacts research at OpenAI

OpenAI Blog

OpenAI launches a call for external researchers to study the economic impacts of large language models like GPT-3, ChatGPT, and DALL-E 2, releasing a research agenda and inviting PhD-level collaborators to examine labor market effects, inequality, and policy implications of AI deployment.

OpenAI’s new economic analysis

OpenAI Blog

OpenAI releases economic analysis showing ChatGPT's productivity impact on workers and announces a 12-month research collaboration with Harvard and AEI economists to assess AI's effects on jobs and workforce, establishing a new OpenAI Workshop in Washington, DC.

Powering next generation applications with OpenAI Codex

OpenAI Blog

OpenAI Codex, a natural language-to-code system based on GPT-3, is now powering 70+ applications across various use cases including GitHub Copilot. Azure OpenAI Service has expanded availability to limited preview, enabling enterprise access to Codex and other OpenAI models.

Harness engineering: leveraging Codex in an agent-first world

OpenAI Blog

OpenAI describes an internal experiment building a production software product with zero manually-written code using Codex agents, achieving approximately 10x faster development with 1.5M lines of code written by AI in five months. The team learned that effective agent-driven development requires engineers to focus on systems design, scaffolding, and feedback loops rather than writing code directly.