@dwarkesh_sp: We pre-train LLMs on the whole of the internet. You might think this explains how they learn so many emergent capabilit…

X AI KOLs Timeline News

Summary

Dwarkesh Patel tweets about Sergey Levine's argument that emergent capabilities in LLMs arise from compositionality, not just from training data.

We pre-train LLMs on the whole of the internet. You might think this explains how they learn so many emergent capabilities: the knowledge is implicit in the training data. But in fact models can do things that were never demonstrated anywhere in training! @svlevine argues that the real source of emergent capabilities is compositionality:
Original Article

Similar Articles