Tag
Surprising new results show that for large LMs with enough compute, the best data filter might be no filter, as they tolerate low-quality data well.