Scaling social science research

OpenAI Blog Tools

Summary

OpenAI releases GABRIEL, an open-source toolkit that uses GPT to convert unstructured qualitative data (text, images) into quantitative measurements for social scientists and economists. The tool enables researchers to analyze large-scale qualitative datasets more efficiently by automating repetitive labeling tasks while preserving the richness of human data.

GABRIEL is a new open-source toolkit from OpenAI that uses GPT to turn qualitative text and images into quantitative data, helping social scientists analyze research at scale.
Original Article
View Cached Full Text

Cached at: 04/20/26, 02:52 PM

# Scaling social science research Source: [https://openai.com/index/scaling-social-science-research/](https://openai.com/index/scaling-social-science-research/) OpenAIA new tool to help researchers turn qualitative data into numbers they can analyze\. A core part of our work at OpenAI is enabling scientists to move faster and solve harder problems\. Today, our Economic Research Team is releasing GABRIEL: an open\-source toolkit that uses GPT to turn unstructured text and images into quantitative measurements\. It is designed for economists, social scientists, and data scientists to study qualitative data at scale\. Qualitative data tells the richest stories about the world—what people say, write, teach, argue, and experience\. It spans everything from syllabi and interviews to social media and photographs\. There is a tremendous amount of it\. But transforming that type of data into rigorous evidence is incredibly time\-consuming\. Often it isn't feasible at all\. In too many cases, social scientists are forced to forego important avenues of research, not because the data doesn’t exist, but because it’s impossible to analyze\. GABRIEL is built to make qualitative data much more accessible\. It allows researchers to describe what they want to measure in everyday words—like “how family\-friendly is this job listing?”—and then applies that same question consistently across thousands \(or millions\) of documents, returning a score for each one\. This lets researchers spend less time on repetitive data labeling and more time on the work that actually requires expertise: choosing what to measure, validating results, and drawing careful conclusions\. For example, GABRIEL can analyze a large collection of scientific papers to see what specific methods are used and how they evolve over time\. It can look at course curricula to measure how much attention is given to different subjects or skills\. It can extract structured historical details for every small town across Europe, or examine a trove of customer reviews and discover patterns in what people value most\. In[our paper⁠\(opens in a new window\)](https://cdn.openai.com/pdf/7517a586-5bfa-4b87-bd3d-6ea0e9e844c7/GPT-as-a-measurement-tool.pdf), we benchmark GPT at labeling qualitative data across many use cases and find that it is highly accurate\. Beyond this type of measurement, GABRIEL also provides practical tools researchers often need\. These include merging datasets even when the columns don’t match, smart deduplication, passage coding, ideating new scientific theories, and deidentifying personal information from text to preserve privacy\. GABRIEL is available now as an[open\-source Python library⁠\(opens in a new window\)](https://github.com/openai/GABRIEL), with a[tutorial notebook⁠\(opens in a new window\)](https://colab.research.google.com/drive/1RMUeAWACpViqiUMlPMMwPTKyGU-OX756?usp=sharing)to get started\. It is designed to require minimal technical background\. We’ll keep improving GABRIEL over time based on feedback from the academic community\. We hope this tool will help more researchers bring the richness of qualitative data and human stories into their work\.

Similar Articles

ChatGPT for research

OpenAI Blog

OpenAI Academy introduces ChatGPT for research, featuring Search and Deep Research capabilities to help users move from questions to evidence-backed insights through source synthesis, citation generation, and structured report production.

Empowering teams to unlock insights faster at OpenAI

OpenAI Blog

OpenAI has developed an internal research assistant that combines dashboards with a conversational GPT-5 interface to help teams analyze millions of support tickets and generate insights in minutes instead of weeks. The tool democratizes data analysis across teams, allowing non-technical users to ask questions in plain language and get actionable reports on product feedback, customer sentiment, and trends.

Economic impacts research at OpenAI

OpenAI Blog

OpenAI launches a call for external researchers to study the economic impacts of large language models like GPT-3, ChatGPT, and DALL-E 2, releasing a research agenda and inviting PhD-level collaborators to examine labor market effects, inequality, and policy implications of AI deployment.

Research with ChatGPT

OpenAI Blog

OpenAI Academy introduces two research features for ChatGPT: Search for real-time web information and Deep Research for comprehensive multi-step analysis. These tools help users gather, synthesize, and cite information from across the web more efficiently than traditional browsing.

Analyzing data with ChatGPT

OpenAI Blog

OpenAI Academy publishes a guide on using ChatGPT for data analysis, enabling users to upload files and ask natural language questions to explore, clean, and visualize data without requiring formula or dashboard expertise.