@lvwerra: We released physics-intern: a simple harness for science problems! It gets models like Gemini 3.1 Pro to go from 17.7 -…

X AI KOLs Following 05/21/26, 03:01 PM Tools

physics-intern harness science-problems reasoning-models subagent model-boosting

Summary

Released physics-intern, a simple harness that significantly boosts the performance of reasoning models like Gemini 3.1 Pro on science problems, from 17.7 to 31.4, outperforming GPT 5.5 Pro.

We released physics-intern: a simple harness for science problems! It gets models like Gemini 3.1 Pro to go from 17.7 -> 31.4, thus beating GPT 5.5 Pro. The physics-intern harness can wrap any model and via dedicated subagent boost the performance of the vanilla reasoning models. While I think more and more of these harness capability gains will be absorbed into the models (like prompting tricks disappeared over time) there is a lot to be gained right now by building good scaffolds for those models and integrating tools well. Interestingly, the exception we found that GPT 5.5 Pro actually didn't benefit from the physics-intern harness! Read more about it here: https://huggingface.co/spaces/huggingface/physics-intern… PS: I think the Harness[Model] notation is kind of nice.

Original Article

View Cached Full Text

Cached at: 05/21/26, 05:35 PM

We released physics-intern: a simple harness for science problems!

It gets models like Gemini 3.1 Pro to go from 17.7 -> 31.4, thus beating GPT 5.5 Pro.

The physics-intern harness can wrap any model and via dedicated subagent boost the performance of the vanilla reasoning models.

While I think more and more of these harness capability gains will be absorbed into the models (like prompting tricks disappeared over time) there is a lot to be gained right now by building good scaffolds for those models and integrating tools well.

Interestingly, the exception we found that GPT 5.5 Pro actually didn’t benefit from the physics-intern harness!

Read more about it here: https://huggingface.co/spaces/huggingface/physics-intern…

PS: I think the Harness[Model] notation is kind of nice.

physics-intern: an Autonomous Agent for Physics Research - a Hugging Face Space by huggingface

Source: https://huggingface.co/spaces/huggingface/physics-intern Fetching metadata from the HF Docker repository...

@lvwerra: We released physics-intern: a simple harness for science problems! It gets models like Gemini 3.1 Pro to go from 17.7 -…

physics-intern: an Autonomous Agent for Physics Research - a Hugging Face Space by huggingface

Similar Articles

@dlouapre: Meet physics-intern, our agentic framework for theoretical physics. It takes Gemini 3.1 Pro from 17.7% to 31.4% on Crit…

Agentic harness for theoretical physics research

Gemini 3.1 Pro: A smarter model for your most complex tasks

Advancing science and math with GPT-5.2

Start building with Gemini 3

Submit Feedback

Similar Articles

@dlouapre: Meet physics-intern, our agentic framework for theoretical physics. It takes Gemini 3.1 Pro from 17.7% to 31.4% on Crit…

Agentic harness for theoretical physics research

Gemini 3.1 Pro: A smarter model for your most complex tasks

Advancing science and math with GPT-5.2