Mellum 2 12B A2.5B
Summary
JetBrains released Mellum 2 12B A2.5B, a coding-focused small MoE model with reasoning performance comparable to Qwen 3.5 9B but weaker in other tasks.
Similar Articles
Mellum2 Technical Report
Mellum 2 is a 12B-parameter open-weight MoE language model by JetBrains with 2.5B active parameters, specialized in software engineering tasks and optimized for efficient inference on commodity GPUs.
JetBrains's Mellum 2 (49 minute read)
JetBrains releases Mellum 2, a 12B-parameter open-weight Mixture-of-Experts language model specialized in software engineering, with competitive performance in code generation, reasoning, and tool use, available under Apache 2.0.
JetBrains/Mellum2-12B-A2.5B-Thinking
JetBrains releases Mellum2-12B-A2.5B-Thinking, an open-source Mixture-of-Experts reasoning model with 131k context length, trained with RLVR for explicit chain-of-thought reasoning.
Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains
JetBrains introduces Mellum2, a 12B parameter Mixture-of-Experts model optimized for code generation and reasoning tasks, with a focus on private deployment and integration into development workflows.
Mellum2 Goes Open Source: A Fast Model for AI Workflows | The JetBrains AI Blog
JetBrains open-sources Mellum2, a fast 12B Mixture-of-Experts model designed for low-latency AI workflows in software engineering, available under Apache 2.0 license.