Tag
Cactus-Compute released Needle, a 26M parameter open-source model distilled from Gemini for efficient on-device function calling using a novel Simple Attention Network architecture without MLPs.
OpenAI releases o1 model to API with production-ready features including function calling, structured outputs, vision capabilities, and 60% lower latency than o1-preview. Additional developer tools include Realtime API improvements, Preference Fine-Tuning, and new Go and Java SDKs.
OpenAI announced the Assistants API, enabling developers to build agent-like experiences with persistent threads, Code Interpreter, Retrieval, and function calling capabilities. The company also launched a Custom Models program for organizations to train domain-specific GPT-4 models with exclusive access.
OpenAI announces function calling capability for GPT-4 and GPT-3.5-turbo models, allowing developers to describe functions via JSON Schema and have models intelligently choose to output structured JSON for external tool integration. The update also extends support for older model versions until June 2024 and improves model evaluation methodology.