nanotron

Tag

Cards List
#nanotron

Built an LLM training framework that actually runs on older GPUs without crashing

Reddit r/ArtificialInteligence · 2d ago

Introduces Picotron, a clean-room rewrite of Nanotron that eliminates mandatory GPU-specific dependencies, enabling LLM training on older GPUs like T4 and V100. It defaults to standard PyTorch SDPA but supports FlashAttention-2 at runtime.

0 favorites 0 likes
← Back to home

Submit Feedback