Tag
Empero released Qwythos-9B-Claude-Mythos-5, a full-parameter reasoning model fine-tuned with 1M context, based on synthetic chain-of-thought data from Fable-5 and Mythos-5 session logs.
Empero AI releases Qwythos-9B, a fine-tuned reasoning model with 1M token context and uncensored capabilities, showing large benchmark improvements over its Qwen3.5-9B base.
A 600M parameter reasoning model trained using SYNTH reportedly outperforms a 397B model and Sonnet 4.5 in an industrial application for the Paris subway, highlighting the effectiveness of small, specialized models.
Weibo AI releases VibeThinker-3B, a 3B parameter open-source reasoning model with MIT license, achieving competitive results on math, coding, and STEM reasoning benchmarks.
WeiboAI released VibeThinker-3B, a small 3B reasoning model tested locally on coding tasks, achieving 3/3 on algorithm problems.
MSL releases a preparedness report for its extreme reasoning model Muse Spark Contemplating, benchmarking its capabilities in biology and cybersecurity.
OpenAI shares links to their podcast episode about how a reasoning model cracked an 80-year-old problem, available on Spotify, Apple Podcasts, and YouTube.
Microsoft AI announces MAI-Thinking-1, a 35B active/1T total MoE reasoning model competitive on STEM and coding tasks, developed using Ray for distributed training and orchestration.
Microsoft announced two new on-device AI models at Build 2026: Aion 1.0 Instruct, an open-weights small language model, and Aion 1.0 Plan, a 14B parameter reasoning and tool-calling model for local agentic workflows.
Microsoft AI introduces MAI-Thinking-1, a 35B-active parameter reasoning model trained from scratch without distillation, achieving strong performance on software engineering and math benchmarks while emphasizing clean data and self-sufficiency.
The article discusses where to add heavy reasoning using Ring-2.6-1T in agent workflows to guard against failure points such as state corruption, tool-contract mismatch, or the final external action.
NVIDIA announces Alpamayo 2 Super, a 32B open reasoning model for Level 4 robotaxis, featuring 360-degree perception, meta-actions, and a full stack including AlpaGym simulation and OmniDreams scenario generation.
Microsoft is set to announce new AI models including its first reasoning model, MAI-Thinking-1, along with Windows 11 developer experience improvements and Copilot updates at its Build conference.
A discussion on where to allocate reasoning budget in AI agents, referencing the trillion-parameter Ring-2.6-1T model with high/xhigh reasoning-effort modes.
JetBrains released Mellum 2 12B A2.5B, a coding-focused small MoE model with reasoning performance comparable to Qwen 3.5 9B but weaker in other tasks.
Discussion about routing failure classes (bad tool choice, bad replanning, final-answer verification) to Ring-2.6-1T, a trillion-parameter reasoning model for agent workflows with high reasoning-effort modes.
Liquid AI released LFM2.5-8B-A1B, an edge MoE model trained on 38T tokens with a 128K context window, improved tool calling, and reasoning capabilities, available on Hugging Face.
A reflection on the trade-offs between using a single trillion-parameter reasoning model with adjustable depth (like Ring-2.6-1T) versus routing between separate specialized models, exploring which approach is cleaner or more cost-effective for agent workflows.
OpenAI claims its unreleased reasoning model has solved the 80-year-old planar unit distance problem in mathematics, producing an original proof that outperforms traditional grid-based arrangements.
OpenAI claims its general-purpose reasoning model discovered a counterexample to the conjectured upper bound in Erdős's planar unit-distance problem, producing a proof reviewed by mathematicians.