together-ai

#together-ai

@h100envy: Dan Fu co-wrote FlashAttention with Tri Dao. Then he co-built Hyena, Monarch Mixer, and ThunderKittens. Now he's distin…

X AI KOLs Timeline ↗ · 2026-06-28 Cached

Profiles Dan Fu, a key contributor to high-performance kernels like FlashAttention, Hyena, Monarch Mixer, and ThunderKittens, now a distinguished researcher at Together AI whose work is used in ChatGPT, Claude, and Gemini.

0 favorites 0 likes

#together-ai

@realDanFu: Excited to chat with @olive_jy_song live next week on stage at @aiDotEngineer about MiniMax 3! It’ll be a fun one, come…

X AI KOLs Timeline ↗ · 2026-06-28 Cached

Dan Fu announces a live chat with Olive Song at aiDotEngineer about MiniMax 3, covering its training decisions.

0 favorites 0 likes

#together-ai

New KV Quants coming 😍 Welcome OSCAR kv quant open sourced by togetherAI

Reddit r/LocalLLaMA ↗ · 2026-05-26 Cached

Together AI open-sources OSCAR, an attention-aware 2-bit KV cache quantization system that enables efficient long-context LLM serving by redistributing quantization error according to attention importance.

0 favorites 0 likes

together-ai

@h100envy: Dan Fu co-wrote FlashAttention with Tri Dao. Then he co-built Hyena, Monarch Mixer, and ThunderKittens. Now he's distin…

@realDanFu: Excited to chat with @olive_jy_song live next week on stage at @aiDotEngineer about MiniMax 3! It’ll be a fun one, come…

New KV Quants coming 😍 Welcome OSCAR kv quant open sourced by togetherAI

Submit Feedback