fsdp

Tag

Cards List
#fsdp

@plugyawn: Introducing: Megaprop: a library for efficient preconditioned optimization across GPUs! Megaprop is a fork of Megatron …

X AI KOLs Following · 2026-06-15 Cached

Megaprop is a new library for efficient preconditioned optimization across GPUs, forked from Megatron and TransformerEngine, with FSDP support for Muon, FOOF, KFAC, and Newton-Muon, and MuP support for width and depth.

0 favorites 0 likes
← Back to home

Submit Feedback