@Modular: Our kernel team has been deep in MiniMax M3 all week. The 1M-token context and native multimodality make it a hard mode…

X AI KOLs Following Models

Summary

Modular's kernel team is optimizing serving for MiniMax M3's 1M-token context and native multimodality, with open weights dropping soon for immediate deployment on Modular.

Our kernel team has been deep in MiniMax M3 all week. The 1M-token context and native multimodality make it a hard model to serve well, which is exactly the kind of problem we like! When the open weights drop in the next few days, you'll be able to run it on Modular right away. Stay tuned for @MiniMax_AI x Modular.
Original Article
View Cached Full Text

Cached at: 06/10/26, 12:20 AM

Our kernel team has been deep in MiniMax M3 all week. The 1M-token context and native multimodality make it a hard model to serve well, which is exactly the kind of problem we like!

When the open weights drop in the next few days, you’ll be able to run it on Modular right away.

Stay tuned for @MiniMax_AI x Modular.

Similar Articles

MiniMax M3 (2 minute read)

TLDR AI

MiniMax introduces M3, the first open-weights model to combine coding, agentic, and multimodal capabilities with up to 1M context via sparse attention.