model-training

#model-training

@googledevs: Teach models repeatable behavior with dataset distillation. Train smaller models on clean, structured examples to stabi…

X AI KOLs Following ↗ · 16h ago Cached

Google Devs discusses using dataset distillation to train smaller models on clean, structured examples to stabilize parser outputs and teach models repeatable behavior.

0 favorites 0 likes

#model-training

@cursor_ai: Three announcements from our keynote at Compile, including how we're training a new model with SpaceX.

X AI KOLs Following ↗ · 2d ago Cached

Cursor AI announced three updates from its Compile keynote, including training a new model in collaboration with SpaceX.

0 favorites 0 likes

#model-training

@LottoLabs: If you’ve been thinking about training models or like the idea but don’t know where to start This is one of the best re…

X AI KOLs Following ↗ · 3d ago Cached

A tweet recommending 'The Smol Training Playbook' on Hugging Face, a resource that demystifies model training for beginners.

0 favorites 0 likes

#model-training

@TheTuringPost: https://x.com/TheTuringPost/status/2068474648925216861

X AI KOLs Timeline ↗ · 4d ago Cached

An educational overview of knowledge distillation, covering its history, core concepts like softmax and temperature, types, scaling laws, and practical examples including DeepSeek-R1.

0 favorites 0 likes

#model-training

@JongwonPar9958: GLM-5.2 has a neat trick for reward hacking. They don't penalize the model, they detect the suspicious tool call, block…

X AI KOLs Timeline ↗ · 5d ago Cached

GLM-5.2 uses a technique to counteract reward hacking by detecting and blocking suspicious tool calls rather than penalizing the model, which prevents obfuscation seen in other methods.

0 favorites 0 likes

#model-training

@servasyy_ai: Cursor just held their first conference. They released: - A 1.5T parameter model trained from scratch - Origin, a direct GitHub alternative - iOS app No one saw this coming

X AI KOLs Timeline ↗ · 2026-06-17 Cached

At Cursor's first conference, they released a 1.5T parameter model trained from scratch, Origin as a direct GitHub alternative, and an iOS app, exceeding market expectations.

0 favorites 0 likes

#model-training

@mervenoyann: day 2 findings on this pipeline > it works, got map@50=0.8028 on road sign detection against human annotations, with on…

X AI KOLs Timeline ↗ · 2026-06-17 Cached

Merve (@mervenoyann) shares day two findings of a pipeline using multiple small VLMs as judges for road sign detection, achieving map@50=0.8028 with only 1.3k examples. The thread compares model rejection rates and discusses dataset shrinking, super-specific prompts, and plans to generalize the library.

0 favorites 0 likes

#model-training

Get in here: Community model build thread

Reddit r/LocalLLaMA ↗ · 2026-06-16

A thread proposing a method for creating a community AI model using crowdsourced compute via Branch-Train-Stitch to build a Mixture-of-Experts model from independently trained submodels, with discussion of hardware requirements, participant involvement, and technical challenges.

0 favorites 0 likes

#model-training

@liumengxinfly: Redis creator speaks out on X, saying that those who keep claiming Chinese models are distilled don't understand machine learning at all.

X AI KOLs Timeline ↗ · 2026-06-16 Cached

This article explains the technical principles of knowledge distillation in machine learning, pointing out that merely collecting output dialogues from ChatGPT/Claude cannot achieve effective distillation due to the lack of probability distribution information, and discusses the limitations of using generated data in SFT and pre-training.

0 favorites 0 likes

#model-training

@qinzytech: https://x.com/qinzytech/status/2066585405479371092

X AI KOLs Timeline ↗ · 2026-06-15 Cached

A technical analysis of two approaches to building self-evolving AI agents: model-based (via architecture like SSMs or transformer with fast-weight updates, and training methods) and harness-based (via memory or meta harness that can rewrite itself). The author provides practical recommendations for different audiences.

0 favorites 0 likes

#model-training

Observation: the best agent harness for each model will be from the model developer themselves

Reddit r/AI_Agents ↗ · 2026-06-01

A discussion on how AI models perform best with harnesses developed by their own creators, as third-party harnesses may cause underperformance despite strong benchmarks, citing examples like Claude Code for Claude and Codex for GPT.

0 favorites 0 likes

#model-training

100 Trillion+ Pretraining data??? This is the largest data I've see a model being trained on.

Reddit r/LocalLLaMA ↗ · 2026-06-01

A new AI model is being trained on over 100 trillion tokens, doubling the typical pretraining data size of 27-50 trillion tokens used by other models like Kimi, Mimo, and DeepSeek.

0 favorites 0 likes

#model-training

@TheNoise2Signal: How does frontier training use 2,048 GPUs? Because there are five dimensions you can split work across - and at scale, …

X AI KOLs Timeline ↗ · 2026-05-25 Cached

Explains how frontier AI training uses up to 2,048 GPUs by splitting work across five dimensions, demystifying model training frameworks.

0 favorites 0 likes

#model-training

@BohuTANG: During the development of Evot, I discovered that to get the best out of Anthropic's Opus series models, the official Claude Code approach is basically the optimal solution, hard to bypass. After in-depth analysis and quantitative verification of the Claude Code prompt, I found that during training they already...

X AI KOLs Timeline ↗ · 2026-05-23 Cached

During the development of Evot, it was discovered that to maximize the performance of the Anthropic Opus model, the official Claude Code method is the optimal solution, because the Agent Harness behavior pattern is baked into the weights during training, rather than pure prompt engineering; in the future, Agent Harness competition will push behavior down to the model layer.

0 favorites 0 likes

#model-training

Cursor Introduces Composer 2.5

Hacker News Top ↗ · 2026-05-18 Cached

Cursor released Composer 2.5, a major update to its AI coding assistant featuring improved intelligence, behavior, and training via targeted reinforcement learning and increased compute, built on Moonshot's Kimi K2.5.

0 favorites 0 likes

#model-training

@AnjneyMidha: very cool a 2-3x speed up in training by essentially letting the model learn more flexibly in its early stages than rig…

X AI KOLs Following ↗ · 2026-05-14 Cached

A new training method achieves 2-3x speedup by allowing models to learn more flexibly in early stages, akin to homeschooling vs. factory education.

0 favorites 0 likes

#model-training

@aiDotEngineer: Your Agent Can Now Train Models The argument from @mervenoyann: open source models have caught up. GLM 5.1 is leading t…

X AI KOLs Following ↗ · 2026-05-13 Cached

The talk by @mervenoyann demonstrates that open source models like GLM 5.1 have caught up to closed models, and shows how Hugging Face's ecosystem enables agents to train models, run inference, and build workflows.

0 favorites 0 likes

#model-training

@WilliamBarrHeld: To train better open models, we need predictable scaling. Delphi is Marin’s first step: we pretrained many small models…

X AI KOLs Following ↗ · 2026-05-11 Cached

Marin AI researchers, led by William Barr Held, introduce Delphi, a methodology that pretrains small models to accurately predict the training outcomes of larger 25B-parameter runs. This research aims to establish predictable scaling for more efficient open-source AI model development.

0 favorites 0 likes

#model-training

@akseljoonas: 3 weeks since ml-intern launched and we just hit 1M messages exchanged. that's 3.3 agent-years of ML research in 21 day…

X AI KOLs Following ↗ · 2026-05-11 Cached

ml-intern has processed over 1M messages in 3 weeks, enabling accelerated ML research with user projects including model training, architecture replication, and automation tasks.

1 favorites 1 likes

#model-training

@FireworksAI_HQ: Frontier labs are betting AGI models will be so good you won't ever want to customize them. We think different. Buildin…

X AI KOLs Following ↗ · 2026-05-09 Cached

Fireworks AI announces its training platform in preview, allowing developers to train, fine-tune, and deploy custom AI models with full ownership of data and weights.

0 favorites 0 likes

model-training

Submit Feedback