sequential-training

Tag

Cards List
#sequential-training

@kyutai_labs: We train 6B-param models on Common Crawl ordered sequentially from 2018 to 2025, so that the freshest data is seen last…

X AI KOLs Following · 2026-05-26 Cached

Kyutai Labs trains 6B-parameter models on Common Crawl data ordered sequentially from 2018 to 2025, showing that performance drop on recent years disappears, and open-sources the checkpoints for continual learning research.

0 favorites 0 likes
#sequential-training

Balancing Stability and Plasticity in Sequentially Trained Early-Exiting Neural Networks

arXiv cs.LG · 2026-05-08 Cached

The paper addresses catastrophic forgetting in sequentially trained early-exiting neural networks and proposes two methods based on Elastic Weight Consolidation and Learning without Forgetting to preserve earlier exit performance while adding new ones.

0 favorites 0 likes
← Back to home

Submit Feedback