@natolambert: New podcast with @finbarrtimbers! We survey the latest post-training recipes, from GLM 5.1, Kimi K2.6, DeepSeek V4, Xia…

X AI KOLs Timeline Events

Summary

Nathan Lambert and Finbarr Timbers discuss the latest post-training recipes for large language models, including DeepSeek V4, GLM 5.1, Kimi K2.6, and the industry shift to multi-teacher on-policy distillation.

New podcast with @finbarrtimbers! We survey the latest post-training recipes, from GLM 5.1, Kimi K2.6, DeepSeek V4, Xiaomi MiMo V2.5, Nemotron Ultra, etc. and discuss: - Why the industry slowly shifted to multi-teacher on-policy distillation (MOPD). - What an Olmo-style recipe would need improvements in - How post-training works / suits larger organizational efforts - Career advice in the foothills of the singularity - and other topics I heard y'all wanted me to start doing this, so making some time when I'm in funemployment! Chapters: 00:00 Introduction & Olmo reflections 06:28 Post-train recipes review (history) 23:00 2026’s model recipes (MiMo Flash, DeepSeek V4, GLM 5, Kimi K2.6, etc.) 39:05 Open-ended post-training discussions 48:22 Career advice in the LLM race Links below, please follow @interconnectsai and like and subscribe and buy my book?
Original Article
View Cached Full Text

Cached at: 06/17/26, 01:44 AM

New podcast with @finbarrtimbers! We survey the latest post-training recipes, from GLM 5.1, Kimi K2.6, DeepSeek V4, Xiaomi MiMo V2.5, Nemotron Ultra, etc. and discuss:

  • Why the industry slowly shifted to multi-teacher on-policy distillation (MOPD).
  • What an Olmo-style recipe would need improvements in
  • How post-training works / suits larger organizational efforts
  • Career advice in the foothills of the singularity
  • and other topics

I heard y’all wanted me to start doing this, so making some time when I’m in funemployment!

Chapters:

00:00 Introduction & Olmo reflections 06:28 Post-train recipes review (history) 23:00 2026’s model recipes (MiMo Flash, DeepSeek V4, GLM 5, Kimi K2.6, etc.) 39:05 Open-ended post-training discussions 48:22 Career advice in the LLM race

Links below, please follow @interconnectsai and like and subscribe and buy my book?

Similar Articles

Deepseek, kimi etc..

Reddit r/AI_Agents

Mentions of AI models Deepseek and Kimi, possibly discussing recent updates or comparisons.

@tom_doerr: Trained on 13M hours of mixed audio and text data https://github.com/MoonshotAI/Kimi-Audio…

X AI KOLs Timeline

Trained on 13M hours of mixed audio and text data https://t.co/SvoKmvzphI https://t.co/UlKN3OiqG8 --- # MoonshotAI/Kimi-Audio Source: [https://github.com/MoonshotAI/Kimi-Audio](https://github.com/MoonshotAI/Kimi-Audio) <p align="center"> <img src="assets/kimia_logo.png" width="400"/> <p> <p align="center"> Kimi-Audio-7B <a href="https://huggingface.co/moonshotai/Kimi-Audio-7B">🤗</a>&nbsp; | Kimi-Audio-7B-Instruct <a href="https://huggingface.co/moonshotai/Kimi-Audio-7B-Instruct">🤗</a>&nbsp; |