Tag
Mimo 2.5 demonstrates fast performance with large context windows using dual RTX Pro 6000 GPUs.
Google introduces DiffusionGemma, an experimental 26B MoE open model that achieves up to 4x faster text generation on GPUs using text diffusion, targeting speed-critical interactive local workflows.
Mosaic is a probabilistic weather model that matches state-of-the-art skill while generating a 24-member, 10-day global forecast in under 12 seconds on a single H100.
Santiago (@svpino) highlights MiniMax-M2.7, a 230B open-weight model that rivals top proprietary models like Opus 4.6 and GPT-5.4, achieving 440+ tokens/s inference on SambaNova at low cost.
Baidu releases ERNIE-Image-Turbo, a distilled text-to-image generation model that achieves fast generation in 8 inference steps while maintaining strong text rendering, instruction following, and structured image generation capabilities.
Pruna's p-image-edit is a premium AI model on Replicate offering fast state-of-the-art image editing under one second, combining speed, affordability, and high visual quality with precise prompt adherence and text rendering capabilities.