Tag
Google has expanded the Gemini API File Search tool to support multimodal data, enabling developers to build more efficient and verifiable retrieval-augmented generation (RAG) systems with features like custom metadata filtering and page citations.
Google DeepMind upgraded Deep Research with arbitrary MCP support for secure third-party data analysis and automatic generation of presentation-ready visuals.
Google introduces Flex and Priority inference tiers for the Gemini API, offering developers granular control over cost and reliability for synchronous requests. Flex provides 50% savings for latency-tolerant tasks, while Priority ensures high reliability for critical applications.
Google releases Veo 3.1 Lite, a cost-effective video generation model available on the Gemini API with 50% lower cost than Veo 3.1 Fast while maintaining the same speed. The model supports text-to-video and image-to-video generation with flexible resolutions and aspect ratios.
Google has released Lyria 3, its newest music generation model, available to developers through the Gemini API and Google AI Studio. The model offers two variants: Lyria 3 Pro for full songs and Lyria 3 Clip for shorter clips, with controls for tempo, lyrics, and image-to-music multimodal input.