moe-pruning

Tag

Cards List
#moe-pruning

REAP-pruned Nemotron-3-Super (512 -> 256 experts) + GRPO fine-tune + FP8/AWQ. AIME 2026 90%+. Benchmark inside.

Reddit r/LocalLLaMA · 2026-04-22

Community release of REAP-pruned Nemotron-3-Super-120B to 64B, GRPO fine-tuned on math, quantized to AWQ/FP8, hitting 90%+ on AIME 2026 and runnable on a single H100/RTX PRO 6000.

1 favorites 1 likes
← Back to home

Submit Feedback