@dlouapre: Meet physics-intern, our agentic framework for theoretical physics. It takes Gemini 3.1 Pro from 17.7% to 31.4% on Crit…
Summary
Physics-intern is an agentic framework for theoretical physics that improves Gemini 3.1 Pro's performance on the CritPt benchmark from 17.7% to 31.4%, achieving a new state-of-the-art.
Similar Articles
@lvwerra: We released physics-intern: a simple harness for science problems! It gets models like Gemini 3.1 Pro to go from 17.7 -…
Released physics-intern, a simple harness that significantly boosts the performance of reasoning models like Gemini 3.1 Pro on science problems, from 17.7 to 31.4, outperforming GPT 5.5 Pro.
Agentic harness for theoretical physics research
Hugging Face releases 'physics-intern', an agentic framework for theoretical physics research that doubles the performance of Gemini models on the CritPt benchmark and sets a new state-of-the-art compared to GPT-5.5 Pro.
Accelerating Mathematical and Scientific Discovery with Gemini Deep Think
DeepMind announces Gemini Deep Think's ability to solve professional research problems in mathematics, physics, and computer science, highlighted by a new agent 'Aletheia' that iteratively verifies and revises solutions.
Gemini 3.5: frontier intelligence with action
Google announces Gemini 3.5, a new family of AI models focused on agentic workflows and coding, starting with 3.5 Flash which delivers frontier performance at high speed.
AlphaEvolve: Gemini-powered coding agent scaling impact across fields
DeepMind highlights the expanded impact of AlphaEvolve, a Gemini-powered coding agent, demonstrating its ability to optimize algorithms for genomics, grid optimization, earth sciences, quantum physics, and mathematics.