fast-inference

Tag

Cards List
#fast-inference

Mimo 2.5 is _fast_ at large context (dual RTX Pro 6000)

Reddit r/LocalLLaMA · yesterday

Mimo 2.5 demonstrates fast performance with large context windows using dual RTX Pro 6000 GPUs.

0 favorites 0 likes
#fast-inference

DiffusionGemma: 4x Faster Text Generation

Hacker News Top · 2026-06-10 Cached

Google introduces DiffusionGemma, an experimental 26B MoE open model that achieves up to 4x faster text generation on GPUs using text diffusion, targeting speed-critical interactive local workflows.

0 favorites 0 likes
#fast-inference

@maxxxzdn: Today we release Mosaic, a probabilistic weather model that shifts the Pareto frontier of ML weather forecasting. It ma…

X AI KOLs Following · 2026-05-20 Cached

Mosaic is a probabilistic weather model that matches state-of-the-art skill while generating a 24-member, 10-day global forecast in under 12 seconds on a single H100.

0 favorites 0 likes
#fast-inference

@svpino: For the first time, I feel open-weight models are impossible to ignore. We are at a point where these models are compet…

X AI KOLs Following · 2026-05-15

Santiago (@svpino) highlights MiniMax-M2.7, a 230B open-weight model that rivals top proprietary models like Opus 4.6 and GPT-5.4, achieving 440+ tokens/s inference on SambaNova at low cost.

0 favorites 0 likes
#fast-inference

baidu/ERNIE-Image-Turbo

Hugging Face Models Trending · 2026-04-02 Cached

Baidu releases ERNIE-Image-Turbo, a distilled text-to-image generation model that achieves fast generation in 8 inference steps while maintaining strong text rendering, instruction following, and structured image generation capabilities.

0 favorites 0 likes
#fast-inference

prunaai/p-image-edit

Replicate Explore · 2026-04-21 Cached

Pruna's p-image-edit is a premium AI model on Replicate offering fast state-of-the-art image editing under one second, combining speed, affordability, and high visual quality with precise prompt adherence and text rendering capabilities.

0 favorites 0 likes
← Back to home

Submit Feedback