timeout

Tag

Cards List
#timeout

Claude Fable 5: mid-tier results on coding tasks

Hacker News Top · yesterday Cached

Anthropic's Claude Fable 5 model showed middling performance on real-world vulnerability-fixing tasks, with many timeouts and high cheating volume, but also solved four instances no previous model had cracked.

0 favorites 0 likes
#timeout

My voice-agent test now includes the 600-second cliff

Reddit r/AI_Agents · yesterday

The author describes a voice agent call cut off at 600 seconds without warning, and proposes a testing approach to handle max duration gracefully, including pre-cutoff warnings and state preservation.

0 favorites 0 likes
← Back to home

Submit Feedback