minimax

#minimax

@Phoenixyin13: This kind of assessment method where students create questions that stump AI is indeed very innovative and highly forward-looking. Students need to explore the strengths and weaknesses of the three models: Claude, DeepSeek, and MiniMax. In this process, students no longer blindly trust AI outputs but learn to review AI responses with a critical and discerning eye, which...

X AI KOLs Timeline ↗ · 21h ago Cached

This educational assessment method encourages students to explore the strengths and weaknesses of Claude, DeepSeek, and MiniMax, creating questions that defeat AI, thereby cultivating critical thinking and competitiveness needed in the AI era.

0 favorites 0 likes

#minimax

Minimax M3 vs M2.7

Reddit r/LocalLLaMA ↗ · 2d ago

Discussion comparing the new Minimax M3 model to its predecessor M2.7, seeking user feedback after two weeks of release.

0 favorites 0 likes

#minimax

GLM 5.2 on Dual Strix Halo (256GB): Worth it?

Reddit r/LocalLLaMA ↗ · 6d ago Cached

This article evaluates the performance of running GLM 5.2 (IQ2M quantized version) on Dual Strix Halo (256GB VRAM). The generation speed is only about 7 tokens/s, and coding tasks take twice as long as DeepSeek V4 Flash. Its cost-performance ratio is far inferior to other models, so it is not recommended for use with this hardware configuration.

0 favorites 0 likes

#minimax

MiniMax-M3-EAGLE3-GGUF - Llama.cpp compatible MiniMax M3 EAGLE draft model!

Reddit r/LocalLLaMA ↗ · 2026-06-23

A GGUF conversion of MiniMax M3's EAGLE draft model for llama.cpp is now available, enabling speculative decoding speedups on compatible hardware.

0 favorites 0 likes

#minimax

@rohanpaul_ai: Quite incredible, MiniMax Sparse Attention cuts attention compute by 28.4X at 1M tokens, with 14.2X faster prefill and …

X AI KOLs Following ↗ · 2026-06-15 Cached

MiniMax Sparse Attention (MSA) achieves up to 28.4x reduction in attention compute at 1M tokens by adding a routing branch that selectively chooses key-value blocks for attention, enabling 14.2x faster prefill and 7.6x faster decoding on H800 GPUs while matching full attention benchmark performance.

0 favorites 0 likes

#minimax

@MiaAI_lab: A PR to vLLM to allow TP=3 for MiniMax M3 His NVFP4 quant is 260GB - lukealonso/MiniMax-M3-NVFP4 Hopefully this will wo…

X AI KOLs Timeline ↗ · 2026-06-14 Cached

A pull request to vLLM adds support for tensor parallelism degree 3 for MiniMax M3 with its NVFP4 quantization, enabling the model to run on 3x DGX Sparks with 87GB memory each.

0 favorites 0 likes

#minimax

@askalphaxiv: "MiniMax Sparse Attention" This paper from Minimax adds a tiny Index Branch to GQA that picks top k KV blocks per group…

X AI KOLs Timeline ↗ · 2026-06-13 Cached

This paper from Minimax introduces MiniMax Sparse Attention, which adds a tiny Index Branch to GQA to select top-k KV blocks per group, enabling GPU-native sparsity with exponential speedups on a 109B multimodal MoE.

0 favorites 0 likes

#minimax

MiniMax M3 available on HuggingChat (with Artifacts support)

Reddit r/LocalLLaMA ↗ · 2026-06-12 Cached

MiniMax M3 model is now available on HuggingChat, an open source AI chat app with Artifacts support.

0 favorites 0 likes

#minimax

Minimax M3 sm_120

Reddit r/LocalLLaMA ↗ · 2026-06-12

Minimax's M3 model requires vllm updates to support sm_120 compute capability, as the current repo only supports sm_100.

0 favorites 0 likes

#minimax

Minimax M3 open weights release planned for Friday

Reddit r/LocalLLaMA ↗ · 2026-06-11 Cached

MiniMaxAI announces plans to release open weights for its upcoming M3 model on Friday, following the earlier M2.7 model.

0 favorites 0 likes

#minimax

Tested a batch of free AI tools this week, honest verdicts on Claude, MiniMax, K2Think, and a couple comparison playgrounds

Reddit r/artificial ↗ · 2026-06-08

A review of free AI tools tested this week, including Claude, MiniMax Agent, K2Think, Indic LLM Arena, and Together.ai playground, with honest assessments of their capabilities and limitations.

0 favorites 0 likes

#minimax

MiniMax is digging its own grave

Reddit r/AI_Agents ↗ · 2026-06-08

MiniMax's price increases and model limitations are driving users away to competitors like DeepSeek and premium options like Claude or ChatGPT, reversing its earlier reputation as a cheap, usable daily driver.

0 favorites 0 likes

#minimax

@mnmn94253156337: Let AI make a PPT for you, and it gives you a bunch of divs and a randomly laid out layout. Click to open, and it looks uglier than what you would have done yourself. What's more annoying is Excel—formulas are written incorrectly, formatting is all over the place, and after generation, you have to manually fix everything from start to finish. You might as well do it yourself. MiniMax open-sourced these four doc…

X AI KOLs Timeline ↗ · 2026-06-08 Cached

MiniMax open-sourced four AI document generation skills (PPT, PDF, Excel, Word), usable without an API key, aiming to solve issues like messy formatting and formula errors in AI-generated documents.

0 favorites 0 likes

#minimax

@cyrilXBT: Nemotron 3 Ultra versus DeepSeek V4 versus MiniMax M3 versus Qwen 3.7 Max. Same two prompts. Four frontier models. One …

X AI KOLs Following ↗ · 2026-06-06 Cached

A comparison of four frontier AI models (Nemotron 3 Ultra, DeepSeek V4, MiniMax M3, Qwen 3.7 Max) on the same two prompts, with full results linked.

0 favorites 0 likes

#minimax

M3 scores well on SWE-Bench but that's not why Im impressed its the stuff no benchmark measures.

Reddit r/AI_Agents ↗ · 2026-06-04

M3 achieves solid benchmark scores but impresses with its ability to perform risk assessment and pre-mortem analysis before making code changes, highlighting a more cautious and thorough approach to refactoring in messy legacy repos.

0 favorites 0 likes

#minimax

Big Model Value Wars - DeepSeek V4 Pro vs MiMo-V2.5-Pro vs MiniMax M3

Reddit r/LocalLLaMA ↗ · 2026-06-03

A discussion comparing DeepSeek V4 Pro, MiMo-V2.5-Pro, and MiniMax M3 for best value in local or openrouter use, with a focus on agentic and coding tasks, and mentions of Hermes Agent and Qwen 3.6 variants.

0 favorites 0 likes

#minimax

Minimax M3 appears to have no political censorship

Reddit r/LocalLLaMA ↗ · 2026-06-02

The Minimax M3 model appears to have no political censorship, standing out among Chinese LLMs in a bias benchmark.

0 favorites 0 likes

#minimax

@sdrzn: MiniMax's new m3 model scores the same as opus 4.7 on terminal-bench 2.1 at 1/20th the compute/cost of their previous m…

X AI KOLs Following ↗ · 2026-06-01 Cached

MiniMax's new m3 model achieves the same score as Opus 4.7 on terminal-bench 2.1 while using 1/20th the compute and cost, attributed to their novel MiniMax Sparse Attention architecture.

0 favorites 0 likes

#minimax

Minimax M3 has been released

Reddit r/singularity ↗ · 2026-06-01

Minimax has released its M3 model, as announced in a blog post.

0 favorites 0 likes

#minimax

MiniMax teases upcoming M3 model with new sparse attention mechanism and 15.6X long-context response speed boost (12 minute read)

TLDR AI ↗ · 2026-05-29 Cached

MiniMax has released a detailed technical report on its M2 series and teased the upcoming M3 model, which uses a novel sparse attention mechanism to achieve up to 15.6× faster decoding at million-token contexts.

0 favorites 1 likes

minimax

Submit Feedback