world-events

Tag

Cards List
#world-events

FutureSim: Replaying World Events to Evaluate Adaptive Agents

Hugging Face Daily Papers · yesterday Cached

FutureSim replays chronological world events to benchmark AI agents' long-term predictive abilities, finding that even the best agent achieves only 25% accuracy.

0 favorites 0 likes
← Back to home

Submit Feedback