inference-api

#inference-api

@philipkiely: https://x.com/philipkiely/status/2069212319746506968

X AI KOLs Timeline ↗ · yesterday Cached

Baseten announces the world's fastest API for the GLM-5.2 open model, achieving over 280 tokens per second via NVFP4 quantization, disaggregated inference, and other optimizations.

0 favorites 0 likes

#inference-api

@omarsar0: We are entering an extremely exciting era for open-weight models. Kimi K2.6 now feels like a top agentic model. I took …

X AI KOLs Timeline ↗ · 2026-04-21 Cached

Kimi K2.6 is released as an open-weight model with strong agentic capabilities, accessible via FireworksAI’s fast inference APIs.

0 favorites 0 likes

inference-api

@philipkiely: https://x.com/philipkiely/status/2069212319746506968

@omarsar0: We are entering an extremely exciting era for open-weight models. Kimi K2.6 now feels like a top agentic model. I took …

Submit Feedback