Tag
OpenAI is releasing GPT-5.3-Codex-Spark, a smaller, ultra-low-latency coding model optimized for real-time collaboration, delivering over 1000 tokens per second on Cerebras hardware. It is available as a research preview to ChatGPT Pro users and marks the first milestone in OpenAI's partnership with Cerebras.
DeepMind introduces D4RT, a unified AI model for dynamic 4D scene reconstruction and tracking that is up to 300x more efficient than previous methods. The model uses a query-based Transformer architecture to solve complex spatial and temporal tasks for robotics and AR applications.
Google has launched Gemini 3 Pro, a new AI model designed to outperform previous versions in coding, agentic workflows, and multimodal reasoning. The model is available via the Gemini API, Google AI Studio, and the new Google Antigravity development platform.
OpenAI has released Sora 2, an advanced video generation model representing a significant advancement in AI-powered content creation capabilities.
Google releases Gemini 2.5 Pro Preview (I/O edition) with significantly improved coding capabilities, ranking #1 on the WebDev Arena leaderboard for frontend development and enabling advanced features like video-to-code generation.
Google announces Gemini 2.5 Flash, a new hybrid reasoning model available in preview through the Gemini API. The model features toggleable thinking capabilities, fine-grained thinking budgets for quality-cost-latency tradeoffs, and maintains fast inference speeds while improving performance over 2.0 Flash.
Google has developed DolphinGemma, a large language model designed to learn and generate dolphin vocalizations, collaborating with Georgia Tech and the Wild Dolphin Project to advance understanding of dolphin communication patterns and enable potential interspecies dialogue.
Google announced Gemini 2.5, its most intelligent AI model, with Gemini 2.5 Pro Experimental leading LMArena benchmarks by significant margins and demonstrating enhanced reasoning and coding capabilities through improved thinking model architecture.
OpenAI has launched Sora, its video generation technology, to the public with safety measures including C2PA metadata, watermarks, and safeguards against abuse. The system has known limitations in physics and complex actions but represents a significant advancement in AI-powered video creation capabilities.
A video from Google DeepMind titled 'Project Genie | Silver Sphere' has no audio track, making it impossible to extract technical details. The title suggests an AI model or project announcement.
Google DeepMind introduced Lyria 3 Pro through a promotional video featuring background music, without technical details.
Black Forest Labs has released Flux 2 Pro, a new image generation and editing model with improved text rendering, photorealism, and character consistency. The model is available via API on Replicate.
Pruna's p-image-edit is a premium AI model on Replicate offering fast state-of-the-art image editing under one second, combining speed, affordability, and high visual quality with precise prompt adherence and text rendering capabilities.