@no_stp_on_snek: In progress

X AI KOLs Following Tools

Summary

Promoting Atlas Inference, an open-source inference serving tool that achieved 200+ tok/s on a Qwen3.6-35B-A3B benchmark.

In progress https://t.co/DFkWLU43lH
Original Article
View Cached Full Text

Cached at: 05/24/26, 08:18 AM

In progress https://t.co/DFkWLU43lH

Azeez (@AtlasInference): Try Atlas Inference. You’ll be ready to serve in <2 mins. https://t.co/vxZLwBJMub ⚡️

Works with sparkrun out the box, happy to share Docker commands as well but all are on the website.

Open source too, most recently achieved 200+ tok/s on a Qwen3.6-35B-A3B benchmark!

Similar Articles

@no_stp_on_snek: https://x.com/no_stp_on_snek/status/2052833502475833384

X AI KOLs Following

An open-source stack using Qwen2.5-32B-Instruct with longctx and vllm-turboquant on a single AMD MI300X achieves competitive results (0.601-0.688) versus SubQ's closed model (0.659) on the MRCR v2 1M-context benchmark, demonstrating open-weights approaches are within striking distance.