low-parameter

Tag

Cards List
#low-parameter

Microsoft's new MAI models

Simon Willison's Blog · yesterday Cached

Microsoft announced two new LLMs: MAI-Thinking-1 (35B reasoning model) and MAI-Code-1-Flash (5B code model), both trained on enterprise-grade, clean data without third-party distillation, with MAI-Thinking-1 claimed to be preferred over Sonnet 4.6 in blind evaluations.

0 favorites 0 likes
#low-parameter

Graph-Conditioned Mixture of Graph Neural Network Experts for Traffic Forecasting

arXiv cs.LG · 3d ago Cached

Proposes GC-MoE, a graph-conditioned mixture of experts framework for traffic forecasting that assigns each node a personalized combination of frozen pretrained spatio-temporal GNN experts based on graph topology and recent input, training only a lightweight routing module (∼17K parameters) and achieving competitive performance on four benchmarks.

0 favorites 0 likes
← Back to home

Submit Feedback