scratch-pretrained

Tag

Cards List
#scratch-pretrained

Sumi: Open Uniform Diffusion Language Model from Scratch

Hugging Face Daily Papers · 2026-06-17 Cached

Sumi is a 7B uniform diffusion language model pretrained from scratch on 1.5T tokens, achieving competitive performance on knowledge and reasoning tasks while being fully open-source with released weights and training recipe.

0 favorites 0 likes
← Back to home

Submit Feedback