@dlouapre: Meet physics-intern, our agentic framework for theoretical physics. It takes Gemini 3.1 Pro from 17.7% to 31.4% on Crit…
Summary
Physics-intern is an agentic framework for theoretical physics that improves Gemini 3.1 Pro's performance on the CritPt benchmark from 17.7% to 31.4%, achieving a new state-of-the-art.
Similar Articles
Agentic harness for theoretical physics research
Hugging Face releases 'physics-intern', an agentic framework for theoretical physics research that doubles the performance of Gemini models on the CritPt benchmark and sets a new state-of-the-art compared to GPT-5.5 Pro.
Accelerating Mathematical and Scientific Discovery with Gemini Deep Think
DeepMind announces Gemini Deep Think's ability to solve professional research problems in mathematics, physics, and computer science, highlighted by a new agent 'Aletheia' that iteratively verifies and revises solutions.
AlphaEvolve: Gemini-powered coding agent scaling impact across fields
DeepMind highlights the expanded impact of AlphaEvolve, a Gemini-powered coding agent, demonstrating its ability to optimize algorithms for genomics, grid optimization, earth sciences, quantum physics, and mathematics.
Start building with Gemini 3
Google has launched Gemini 3 Pro, a new AI model designed to outperform previous versions in coding, agentic workflows, and multimodal reasoning. The model is available via the Gemini API, Google AI Studio, and the new Google Antigravity development platform.
Gemini 3 Deep Think: Advancing science, research and engineering
Google has released a major update to Gemini 3 Deep Think, a specialized reasoning mode designed to solve complex challenges in science, research, and engineering by blending deep scientific knowledge with practical utility.