capacity

#capacity

Why Larger Models Learn More: Effects of Capacity, Interference, and Rare-Task Retention

Hugging Face Daily Papers ↗ · 2026-05-28 Cached

This paper investigates why larger models outperform smaller ones, attributing it to reduced gradient interference and better resource allocation, allowing them to learn rare and complex tasks even with infinite data. Experiments on synthetic data and OLMo models verify that larger models avoid overwriting rare-task features due to weaker gradient updates for common tasks.

0 favorites 0 likes

#capacity

@sama: we will offer this until we sell out of our current allocation for this program. (we will make sure to leave enough cap…

X AI KOLs ↗ · 2026-05-19 Cached

Sam Altman announces that a program offering compute capacity will be available until the current allocation sells out, with plans to resume later while reserving capacity for ChatGPT and Codex.

0 favorites 0 likes

#capacity

@sama: customers are increasingly asking us for certainty on capacity. as models get better, we expect that the world will be …

X AI KOLs ↗ · 2026-05-19 Cached

Sam Altman announces OpenAI's Guaranteed Capacity, offering discounted tokens for 1-3 year commitments to provide customers with capacity certainty.

0 favorites 0 likes

capacity

Why Larger Models Learn More: Effects of Capacity, Interference, and Rare-Task Retention

@sama: we will offer this until we sell out of our current allocation for this program. (we will make sure to leave enough cap…

@sama: customers are increasingly asking us for certainty on capacity. as models get better, we expect that the world will be …

Submit Feedback