Tag
Travelers deployed an AI Claim Assistant powered by OpenAI's Realtime API nationwide, achieving 85-90% customer completion of claims through AI.
Perplexity shared engineering best practices for adding voice functionality to their AI browser Comet using the OpenAI Realtime API, including key techniques like chunked context feeding, role management, and unified audio pipeline.
OpenAI has launched three new real-time audio models to enable continuous, multitasking voice interactions that prioritize long-context reasoning, live translation, and seamless tool use.
OpenAI is making the Realtime API generally available with a new advanced speech-to-speech model called gpt-realtime, featuring improved instruction following, tool calling, and natural speech quality. New capabilities include MCP server support, image inputs, SIP phone calling, and two new voices (Cedar and Marin).
Genspark launched Super Agent, a no-code AI agent platform powered by GPT-4.1 and OpenAI's Realtime API, enabling users to automate complex real-world tasks like phone calls, slide creation, and video generation. The product reached $36M ARR in 45 days after Genspark pivoted from AI search to agentic AI in April 2025.
OpenAI introduces the Realtime API, enabling developers to build low-latency multimodal speech-to-speech conversational experiences with natural voice interactions powered by GPT-4o. The API supports six preset voices and simplifies development by eliminating the need to integrate multiple models.