low-parameter

#low-parameter

Microsoft's new MAI models

Simon Willison's Blog ↗ · 2026-06-02 Cached

Microsoft announced two new LLMs: MAI-Thinking-1 (35B reasoning model) and MAI-Code-1-Flash (5B code model), both trained on enterprise-grade, clean data without third-party distillation, with MAI-Thinking-1 claimed to be preferred over Sonnet 4.6 in blind evaluations.

0 favorites 0 likes

#low-parameter

Graph-Conditioned Mixture of Graph Neural Network Experts for Traffic Forecasting

arXiv cs.LG ↗ · 2026-06-01 Cached

Proposes GC-MoE, a graph-conditioned mixture of experts framework for traffic forecasting that assigns each node a personalized combination of frozen pretrained spatio-temporal GNN experts based on graph topology and recent input, training only a lightweight routing module (∼17K parameters) and achieving competitive performance on four benchmarks.

0 favorites 0 likes

low-parameter

Microsoft's new MAI models

Graph-Conditioned Mixture of Graph Neural Network Experts for Traffic Forecasting

Submit Feedback