self-training

#self-training

Model Collapse as Cultural Evolution

arXiv cs.CL ↗ · 2026-05-25 Cached

This paper reframes model collapse in LLMs as a cultural transmission phenomenon, showing that iterated learning theory predicts a non-monotonic trajectory of compositionality under self-training, confirmed across multiple languages and models.

0 favorites 0 likes

#self-training

@reach_vb: codex is writing a blogpost about its experiments in training a model all by itself

X AI KOLs Following ↗ · 2026-05-22

Codex is writing a blogpost about its experiments in training a model autonomously.

0 favorites 0 likes

#self-training

Self-Training Doesn't Flatten Language -- It Restructures It: Surface Markers Amplify While Deep Syntax Dies

arXiv cs.CL ↗ · 2026-05-21 Cached

This paper presents evidence that self-training on language model outputs does not uniformly flatten language but restructures it, with surface markers (discourse connectives, hedges, em-dashes) increasing while deep syntactic structures (passives, subjunctives, parentheticals) collapse, formalized as the Structural Depth Hypothesis.

0 favorites 0 likes

#self-training

I Let a Small Model Train on Its Own Mistakes. It Reached 80% on HumanEval and Beat GPT-3.5 on Math

Reddit r/LocalLLaMA ↗ · 2026-05-14

A researcher trained small language models on their own self-generated coding mistakes and corrections, achieving 80% on HumanEval and surpassing GPT-3.5 on math, demonstrating effective self-improvement with minimal resources.

0 favorites 0 likes

self-training

Model Collapse as Cultural Evolution

@reach_vb: codex is writing a blogpost about its experiments in training a model all by itself

Self-Training Doesn't Flatten Language -- It Restructures It: Surface Markers Amplify While Deep Syntax Dies

I Let a Small Model Train on Its Own Mistakes. It Reached 80% on HumanEval and Beat GPT-3.5 on Math

Submit Feedback