Tag
StepFun's Step 3.7 Flash, a 198B sparse MoE model with 11B active parameters, matches 97% of Claude Opus 4.6's coding performance on SWE-Bench Verified at roughly one-ninth the cost, using an Advisor Mode strategy that reserves expensive frontier model calls for critical decision points.
ClawCodex is an open-source Python coding agent that implements an /advisor mode, pairing a cheap worker model with an expensive reviewer model at decision points to reduce cost while maintaining quality. It supports multiple providers and achieves 58.2% on SWE-bench Verified.