@FinanceYF5: 2/ SkillOpt: Treating Documents as Trainable Parameters Microsoft treats SKILL.md as trainable model parameters—without changing weights, only optimizing natural language documents, with a validation gate filtering each change. 6 Benchmarks, 52 consecutive wins, GPT-5.5 conversation boost…

X AI KOLs Following Papers

Summary

Microsoft proposes the SkillOpt method, which treats documents as trainable parameters. By optimizing natural language documents without modifying weights, it improves model performance. It achieves 52 consecutive wins across 6 benchmarks, with GPT-5.5 improving by 23.5 points and Claude Code by 19.1 points.

2/📄 SkillOpt: Treating Documents as Trainable Parameters Microsoft treats SKILL.md as trainable model parameters—without changing weights, only optimizing natural language documents, with a validation gate filtering each change. 6 Benchmarks, 52 consecutive wins, GPT-5.5 direct chat improvement +23.5 points, Claude Code improvement +19.1 points.
Original Article
View Cached Full Text

Cached at: 06/01/26, 11:18 AM

2/📄 SkillOpt: Treating Documents as Trainable Parameters

Microsoft treats SKILL.md as a trainable model parameter — without modifying weights, only optimizing natural language documentation, with a validation gate filtering each change.

52 consecutive wins across 6 benchmarks, GPT-5.5 direct chat improved by +23.5 points, Claude Code improved by +19.1 points.

Similar Articles

@omarsar0: New research from Microsoft Research I see a lot of AI engineers handwriting agent skill docs and hope they generalize.…

X AI KOLs Following

Microsoft Research introduces SkillOpt, a method that treats agent skill documents as trainable external state, using an optimizer model to make bounded edits validated by a held-out set. The approach achieves best or tied results across 52 evaluation cells and improves accuracy by over 23 points on GPT-5.5, with zero extra inference cost and transferable skills.

@Xudong07452910: This SkillOpt paper is quite interesting—it actually addresses a very important point: AI agents in the future won't just rely on humans writing prompts; they can train their own 'job descriptions'. Currently, many skills/prompts are written one-off, and when real tasks pile up, various edge cases start to fail...

X AI KOLs Timeline

SkillOpt introduces a systematic controllable text-space optimizer that enables AI agents to train and improve their own skills (like 'work instructions') through iterative edits and validation, outperforming human-crafted and one-shot prompts across multiple benchmarks and models.