Ran 35 agent trials across 4 browser-snapshot formats - pass rate was identical, token cost wasn't
Summary
Opera's team tested browser snapshot formats for AI agents, finding identical pass rates but significantly lower token cost with their compressed format (opera-compact), which eliminates redundant ARIA attributes and repeated URLs. They released it as an open-source MCP server.
Similar Articles
The "browser agents are expensive and still maturing" framing might be missing something architectural
Discusses architectural issues with current browser agents using headless Chrome + AI layer, and presents Opera Neon's CLI as an alternative where AI is integrated into the browser, reducing token overhead and improving understanding.
Measured token consumption across 4 agent runtimes doing the same tasks. Costs ranged from 1x to 4x depending on cache architecture
A comparison of token consumption across four agent runtimes (Claude Code, OpenClaw, Hermes, and OpenClacky) on the same tasks reveals costs ranging from 0.8x to 4x relative to Claude Code, driven by differences in cache architecture and tool schema design.
Agent Execution Tax: new procurement metric for browser agent benchmarks?
Fireworks AI and Notte introduce the 'Agent Execution Tax' metric after running 720 browser agent tasks across four LLMs, finding that execution reliability—not intelligence—is the primary bottleneck in agentic AI, with one model wasting 22.9% of inference calls on malformed JSON.
I benchmarked how badly AI agents read raw HTML. The gap was bigger than I expected.
An experiment comparing AI agent accuracy and token cost when reading raw HTML vs structured formats; raw HTML costs double the tokens with lower accuracy.
Cut my browser-agent cost 50x by NOT using an agent loop. Plan-then-execute + numbers.
Describes a technique to reduce LLM costs in browser agent tasks by using a single planning call followed by deterministic execution, achieving 50x cost reduction compared to standard agent loops.