ai-model-optimization

Tag

Cards List
#ai-model-optimization

@LottoLabs: This is awesome work Dflash for qwen 3.5/6 series

X AI KOLs Timeline · 3d ago Cached

Charles Frye announces the co-release with Z Lab of six new DFlash speculators for Alibaba Qwen 3.x models, achieving over 1k output tokens per second for Qwen 3.5 122B-A10B on a B200.

0 favorites 0 likes
← Back to home

Submit Feedback