Tag
vLLM version 0.20.1rc0 is released, adding a system_fingerprint field to OpenAI-compatible API responses for better request tracking.
NVIDIA offers free AI inference via DGX Cloud with OpenAI-compatible API for popular models like DeepSeek, MiniMax, Kimi, GLM, and Llama, claimable in 5 minutes.
SwiftLM is a Swift-native LLM inference server for Apple Silicon that runs large models without Python, using SSD streaming to load MoE weights and enabling 122B models on 64 GB Macs.
OpenAI Node.js SDK v6.33.0 release providing TypeScript/JavaScript access to OpenAI APIs with support for the new Responses API and workload identity authentication across Kubernetes, Azure, and GCP.
OpenAI Node.js SDK v6.31.0 release - TypeScript/JavaScript library for accessing OpenAI's REST API with support for Chat Completions and Responses APIs, featuring workload identity authentication for cloud environments.
TRUSTBANK has integrated AI agents powered by OpenAI's API into its Furusato Choice platform to help users personalize hometown tax donation gifts through conversational recommendations and intelligent search.
India-based fintech CRED partnered with OpenAI to build AI-powered tools—Cleo (customer-facing chatbot), Thea (agent support), and Stark (operations SOP management)—resulting in a 14-point CSAT improvement and 98% resolution accuracy. The company is expanding these AI capabilities across all business lines to deliver concierge-like experiences at scale.
Rogo, an enterprise AI finance platform, scales its AI-driven financial research using OpenAI's models (GPT-4o, o1, o1-mini) to serve 5,000+ bankers across investment banks and private equity firms. The platform has achieved 27x ARR growth by automating financial analysis tasks and saving analysts 10+ hours weekly on meeting prep, company profiling, and market research.
Rox, a startup leveraging OpenAI's models and APIs, launches an AI-powered sales platform that unifies fragmented data and uses agent swarms to boost sales rep productivity by 50% and increase customer engagement by 35%.
Ada uses GPT-4 and a multi-agent system powered by OpenAI's API to improve customer service quality, doubling resolution rates from 30% to 60-80% while maintaining high containment rates, establishing a new industry standard beyond traditional metrics.
Oscar Health has successfully deployed OpenAI's API to automate clinical documentation and claims processing, reducing documentation time by 40% and claims resolution time by 50%, while establishing an AI Pod to guide responsible AI adoption across the organization.
Zelma, a GPT-4 powered research assistant developed by Dr. Emily Oster and her team at Brown University in partnership with Novy, makes standardized test data accessible to parents, teachers, administrators, and policymakers by allowing natural language queries about student performance across districts and demographics.
Ironclad integrated OpenAI's GPT-4 into its AI Assist™ contract review tool, enabling legal teams to automate contract editing while maintaining human oversight and data privacy. The integration has been live since GPT-4's launch in April 2023.