eagle

#eagle

MiniMax-M3-EAGLE3-GGUF - Llama.cpp compatible MiniMax M3 EAGLE draft model!

Reddit r/LocalLLaMA ↗ · yesterday

A GGUF conversion of MiniMax M3's EAGLE draft model for llama.cpp is now available, enabling speculative decoding speedups on compatible hardware.

0 favorites 0 likes

#eagle

Eagle 3.1: Collaboration Between the EAGLE Team, vLLM Team, and TorchSpec Team

Hacker News Top ↗ · 2026-05-26 Cached

EAGLE 3.1 improves speculative decoding robustness with post-norm architecture, achieving up to 2x longer acceptance length in long-context workloads, with training support from TorchSpec and integration into vLLM.

0 favorites 0 likes

#eagle

@Ex0byt: the different flavors of specdec, and why I'm trying produce a Qwen-3.6-27b EAGLE-3 drafter for ya'll

X AI KOLs Timeline ↗ · 2026-05-17 Cached

Discussion of different flavors of speculative decoding and an attempt to produce a Qwen-3.6-27b EAGLE-3 drafter for the community.

0 favorites 0 likes

eagle

MiniMax-M3-EAGLE3-GGUF - Llama.cpp compatible MiniMax M3 EAGLE draft model!

Eagle 3.1: Collaboration Between the EAGLE Team, vLLM Team, and TorchSpec Team

@Ex0byt: the different flavors of specdec, and why I'm trying produce a Qwen-3.6-27b EAGLE-3 drafter for ya'll

Submit Feedback