token-sizes

Tag

Cards List
#token-sizes

100 Trillion+ Pretraining data??? This is the largest data I've see a model being trained on.

Reddit r/LocalLLaMA · 3d ago

A new AI model is being trained on over 100 trillion tokens, doubling the typical pretraining data size of 27-50 trillion tokens used by other models like Kimi, Mimo, and DeepSeek.

0 favorites 0 likes
← Back to home

Submit Feedback