odysseys-benchmark

Tag

Cards List
#odysseys-benchmark

@browser_use: BrowserCode is incredibly good at long-running tasks It orders pizza for us

X AI KOLs Following · 3d ago Cached

BrowserCode achieves #1 spot on Odysseys benchmark for long-horizon web agents, demonstrating strong performance in multi-hour web workflows.

0 favorites 0 likes
← Back to home

Submit Feedback