Tag
Sumi is a 7B uniform diffusion language model pretrained from scratch on 1.5T tokens, achieving competitive performance on knowledge and reasoning tasks while being fully open-source with released weights and training recipe.
Microsoft released Fara-7B, an efficient 7 billion parameter agentic small language model (SLM) for computer use tasks, achieving state-of-the-art performance within its size class and competitive with larger systems.