disaggregation

#disaggregation

@CyrusHakha: One pattern we keep seeing with customers serving LLMs at scale: Prefill-decode disaggregation is often treated like a …

X AI KOLs Following ↗ · 4d ago Cached

Discusses the nuanced reality of prefill-decode disaggregation in LLM serving at scale, based on customer patterns and validated on AMD with vLLM.

0 favorites 0 likes

#disaggregation

@charles_irl: congrats to my colleague @nanjiangwill on getting this important technique merged into slime!

X AI KOLs Following ↗ · 2026-05-30 Cached

Delta-compressed weight sync technique merged into slime, enabling lossless delta sync for Megatron ↔ SGLang disaggregation, enhancing reinforcement learning at scale.

0 favorites 0 likes

disaggregation

@CyrusHakha: One pattern we keep seeing with customers serving LLMs at scale: Prefill-decode disaggregation is often treated like a …

@charles_irl: congrats to my colleague @nanjiangwill on getting this important technique merged into slime!

Submit Feedback