Tag
StepFun_ai highlights a thoughtful take on the Step 3.7 Flash model and its implications for agent efficiency.
StepFun's Step 3.7 Flash, a 198B sparse MoE model with 11B active parameters, matches 97% of Claude Opus 4.6's coding performance on SWE-Bench Verified at roughly one-ninth the cost, using an Advisor Mode strategy that reserves expensive frontier model calls for critical decision points.
Step 3.7 Flash, an open-weight 198B sparse MoE model, claims 98% agent reliability on tau2-bench across all difficulty levels, with mid raw capability but strong multi-step consistency.
Modal announces day 0 support for the Step 3.7 Flash AI model, a 198B parameter MoE with 11B active parameters, 256K context, three reasoning levels, and native image and video understanding.
StepFun releases Step 3.7 Flash, an open-weight model designed for agentic, coding, search, and multimodal tasks, achieving top scores on several benchmarks.