@nathanhabib1011: Step-3.7-Flash from @StepFun_ai is a silent winner. Super impressive results, the best model under 500B params on HF le…
Summary
Step-3.7-Flash from StepFun_ai is highlighted as the best model under 500B parameters on Hugging Face leaderboards, with strong multimodal performance.
View Cached Full Text
Cached at: 06/02/26, 11:39 PM
Step-3.7-Flash from @StepFun_ai is a silent winner. Super impressive results, the best model under 500B params on HF leaderboards.. All while being multimodal https://t.co/Zo5nUdZTwq
Similar Articles
@AdinaYakup: Step-3.7-Flash New VL model from @StepFun_ai 198B / 11B active - MoE 256K context 3 reasoning level Up to 400 tokens/sec
StepFun releases Step-3.7-Flash, a new large vision-language MoE model with 198B parameters (11B active), 256K context, and up to 400 tokens/sec inference speed.
@NielsRogge: Impressive release by StepFun, explore it at https://paperswithcode.co/paper/83892
StepFun releases Step 3.7 Flash, an open-weight model designed for agentic, coding, search, and multimodal tasks, achieving top scores on several benchmarks.
stepfun-ai/Step-3.7-Flash
Step 3.7 Flash is a 198B-parameter sparse MoE vision-language model with 11B active parameters per token, supporting 256k context and three reasoning levels, designed for high-throughput agentic workflows.
StepFun 3.7 Flash
StepFun released Step 3.7 Flash, a high-efficiency multimodal model optimized for real-world agentic tasks, featuring improved coding benchmarks (SWE-Bench Pro, Terminal-Bench) and compatibility with multiple agent harnesses.
stepfun-ai/Step-3.7-Flash-GGUF
StepFun releases GGUF quantizations of their 198B-parameter sparse MoE vision-language model Step-3.7-Flash, enabling local deployment with up to 256K context and selectable reasoning levels.