Tag
MiniMax's new m3 model achieves the same score as Opus 4.7 on terminal-bench 2.1 while using 1/20th the compute and cost, attributed to their novel MiniMax Sparse Attention architecture.
MiniMax has released a detailed technical report on its M2 series and teased the upcoming M3 model, which uses a novel sparse attention mechanism to achieve up to 15.6× faster decoding at million-token contexts.