sota-models

Tag

Cards List
#sota-models

@browser_use: Introducing Browser Use 0.13.0 [beta] > The old Browser Use was built for GPT-4. > This one was built for SOTA models. …

X AI KOLs Following · 2026-06-08 Cached

Browser Use 0.13.0 is a complete rewrite in Rust, providing custom LLM and browser harnesses optimized for state-of-the-art models, replacing the previous GPT-4-centric version.

0 favorites 0 likes
#sota-models

Why do newer SOTA models get progressively worse on Vendingbench?

Reddit r/singularity · 2026-05-29

A discussion on why newer state-of-the-art AI models are performing worse on the Vendingbench benchmark, suggesting factors such as cheating in earlier runs, ethical alignment reducing profit-seeking behavior, and catastrophic forgetting due to overemphasis on coding.

0 favorites 0 likes
← Back to home

Submit Feedback