Tag
Waymark fine-tunes GPT-3 to automatically generate marketing copy and video scripts, solving a key customer pain point where two-thirds of users struggled with scriptwriting for video ads.
Article discusses using GPT-3 to create advanced AI-powered characters for applications, likely in gaming, interactive media, or virtual environments.
OpenAI researchers demonstrate that GPT-3 can learn to express calibrated uncertainty about its answers in natural language without using model logits, introducing the CalibratedMath benchmark suite to evaluate this capability. The approach shows robust generalization under distribution shift and represents the first evidence of models expressing well-calibrated verbal uncertainty about their own predictions.
OpenAI Codex, a natural language-to-code system based on GPT-3, is now powering 70+ applications across various use cases including GitHub Copilot. Azure OpenAI Service has expanded availability to limited preview, enabling enterprise access to Codex and other OpenAI models.
OpenAI announces new Edit and Insert capabilities for GPT-3 and Codex, enabling mid-file code completion and text editing. The Insert feature is being piloted in GitHub Copilot and is now available in beta via the completions API.
OpenAI introduces InstructGPT, a GPT-3 variant fine-tuned using reinforcement learning from human feedback (RLHF) to better follow instructions and reduce harmful outputs. A 1.3B InstructGPT model is preferred by human evaluators over a 175B GPT-3 model, now becoming the default on OpenAI's API.
OpenAI fine-tuned GPT-3 to answer open-ended questions more accurately by enabling it to use a text-based web browser to search, retrieve, and cite sources. The model outperforms human demonstrators 56% of the time on questions from ELI5 dataset but shows limitations on out-of-distribution tasks like TruthfulQA.
OpenAI has launched fine-tuning capabilities for GPT-3, allowing developers to customize the model on their own data via a single CLI command, resulting in improved accuracy, reduced costs, and lower latency for production use cases. Early customers like Keeper Tax, Viable, and Sana Labs report significant accuracy improvements after fine-tuning.
OpenAI removes the waitlist for its GPT-3 API, allowing developers in supported countries to immediately access the service. The announcement highlights new safety features, the Instruct Series models, content filtering tools, and the introduction of Codex for code generation.
OpenAI trained a system using verifiers to solve grade school math word problems with 90% of child-level accuracy, nearly doubling fine-tuned GPT-3 performance. The approach addresses language models' weakness in multistep reasoning by training verifiers to evaluate candidate solutions and select the best one.
OpenAI announces that over 300 applications are now using GPT-3 through their API, nine months after launch, generating 4.5 billion words daily. Featured use cases include Viable for customer feedback analysis, Fable Studio for interactive storytelling, and Algolia for semantic search capabilities.
A comprehensive discussion summary from OpenAI and Stanford researchers examining GPT-3's technical capabilities, limitations, and broader societal implications across multiple disciplines including computer science, linguistics, philosophy, and policy.
OpenAI announces VP of Research Dario Amodei's departure after nearly five years to start a new AI research-focused project, while promoting Mira Murati to SVP of Research, Product, and Partnerships to strengthen focus on safety integration.
OpenAI has licensed GPT-3 technology to Microsoft as part of their multiyear partnership, allowing Microsoft to incorporate the model into its own products and services while maintaining OpenAI's continued API access for developers.
OpenAI introduces GPT-3, a 175-billion parameter autoregressive language model that demonstrates strong few-shot learning capabilities across diverse NLP tasks without gradient updates or fine-tuning, representing a paradigm shift in how language models can be applied to new tasks through text interactions alone.