Tag
ByteDance Seed released EdgeBench, a benchmark that tests whether AI agents can improve through experience by performing real-world tasks over 12+ hours, shifting evaluation from static knowledge to dynamic learning.
EdgeBench reveals a new scaling law indicating that on-the-fly AI learning speed doubles every three months.