Tag
A personal reflection on the rapid evolution of AI over the past three years, from early ChatGPT and GPT-4 quotas to BabyAGI, DALL·E, and voice cloning.
OpenAI releases GABRIEL, an open-source toolkit that uses GPT to convert unstructured qualitative data (text, images) into quantitative measurements for social scientists and economists. The tool enables researchers to analyze large-scale qualitative datasets more efficiently by automating repetitive labeling tasks while preserving the richness of human data.
Higgsfield is a generative media platform that uses GPT-4.1, GPT-5, and Sora 2 to turn simple product links or ideas into cinematic short-form social videos, generating roughly 4 million videos per day with a 'cinematic logic layer' that translates user intent into structured video plans.
OpenAI announces a strengthened safety ecosystem through external third-party testing and evaluations of frontier AI models, including independent assessments, methodology reviews, and subject-matter expert probing. The company commits to transparency by publicly sharing third-party assessment results and supporting independent evaluations since GPT-4's launch.
Blue J demonstrates how to scale AI expertise in complex regulated domains by combining GPT-4.1 with retrieval-augmented generation over curated tax documents, achieving <0.14% error rates and 70% weekly user engagement through rigorous feedback loops and domain-specific optimization.
Intercom shares three lessons from rapidly adopting AI to transform their customer service platform: testing models early and deeply, building AI-first from the ground up rather than bolting it on, and using rigorous evaluation processes to quickly adopt new models like GPT-4.1.
Morgan Stanley has successfully deployed AI solutions powered by GPT-4 across its wealth management division, with over 98% of advisor teams using the internal AI Assistant chatbot. The deployment was enabled by a robust evaluation framework that tests AI performance on real-world use cases like document summarization and multilingual translation before production rollout.
Arco Educação, Brazil's largest educational operating system, is partnering with OpenAI to launch the Teacher Assistant, an AI tool powered by GPT-4 that helps teachers create personalized lesson plans for students with diverse learning needs, significantly reducing time spent on administrative tasks and lesson planning.
Ada uses GPT-4 and a multi-agent system powered by OpenAI's API to improve customer service quality, doubling resolution rates from 30% to 60-80% while maintaining high containment rates, establishing a new industry standard beyond traditional metrics.
Upwork integrates OpenAI's models across its platform to launch AI-powered features including Job Post Generator, Upwork Chat Pro, and Uma (Upwork's Mindful AI), improving user productivity and spending metrics.
Indeed deployed a fine-tuned smaller GPT model with OpenAI to deliver contextual job matching for millions of users, achieving 60% token efficiency improvement through fine-tuning and GPT-4-based data augmentation on dedicated instances.
OpenAI introduces Rule-Based Rewards (RBRs), a method to improve AI model safety by using explicit rules instead of human feedback in reinforcement learning. RBRs have been integrated into GPT-4 and subsequent models to maintain safety-helpfulness balance while reducing reliance on human feedback collection.
OpenAI introduced CriticGPT, a GPT-4-based model designed to catch errors in ChatGPT's code output. When human trainers use CriticGPT for code review, they outperform those without assistance 60% of the time, addressing a fundamental limitation of RLHF as models become increasingly capable.
Clay launched Claygent, an AI-powered web scraper using GPT-4 that enables intelligent data extraction and sales prospecting with 10x growth potential. The tool optimizes token usage through smart model selection and binary search approaches, with plans for vertical-specific fine-tuning and proactive trigger-based features.
OpenAI introduces sparse autoencoders as a method to extract and interpret concepts from large language models like GPT-4, addressing the fundamental challenge of understanding neural network behavior. They release a research paper, code, and feature visualization tools to help researchers train autoencoders at scale and improve AI safety through better interpretability.
MavenAGI launches an automated customer support platform powered by GPT-4 that processes customer data, integrates with enterprise systems, and uses self-evaluation to provide human-quality support responses. The system has been validated on over 1M customer interactions across Salesforce, Zendesk, and other platforms.
Canva's AI-powered Magic Studio has been used 5 billion times, leveraging OpenAI's GPT-4 to integrate generative AI tools for text, image, and video creation while maintaining an intuitive user experience for 175 million monthly users.
OpenAI announced GPT-4 API general availability and deprecated older completion models (GPT-3 base models and text-davinci-003), requiring developers to migrate to new models like gpt-3.5-turbo-instruct or newer by January 4, 2024. Fine-tuned models will need to be retrained on new base models with priority access offered for GPT-3.5 Turbo and GPT-4 fine-tuning.
OpenAI announced its first Asian office in Tokyo, Japan, and released a GPT-4 custom model optimized for Japanese language with 3x faster performance and 47% reduced token costs. The expansion includes appointing Tadao Nagasaki as President of OpenAI Japan and partnerships with major Japanese companies and government institutions.
Zelma, a GPT-4 powered research assistant developed by Dr. Emily Oster and her team at Brown University in partnership with Novy, makes standardized test data accessible to parents, teachers, administrators, and policymakers by allowing natural language queries about student performance across districts and demographics.