Tools: Is This a Technical Victory, or a Price War Victory?
Summary
Analysis of OpenRouter data shows that Chinese AI models have become the most used in Kilo Code's coding agent, accounting for 58% of token usage, challenging the dominance of Claude and GPT due to lower cost and longer context windows.
Similar Articles
Surprising screenshot - Most token usage is non-coders (openrouter ranking)
OpenRouter usage stats show 6 of the top 10 "coding agent" apps are actually used by non-coders, suggesting broader adoption beyond developers.
AI Gateway Production Trends (8 minute read)
Vercel's AI Gateway data shows Anthropic leads in spending, Google in token volume, and agentic workloads carrying 59% of token volume. OpenAI's spend share tripled after recent model updates.
@akshay_pachaar: https://x.com/akshay_pachaar/status/2053166970166772052
The article discusses a shift in AI agent tool usage from the 'MCP vs CLI' debate to 'Code Mode,' where agents write code to dynamically import tools, significantly reducing context window usage. It highlights Anthropic's approach and Cloudflare's implementation, demonstrating a 98.7% reduction in token consumption for specific tasks.
@_avichawla: The No. 1 deep researcher beats Claude and ChatGPT with a trick neither uses. I studied the open-source architecture be…
The Onyx open-source deep research system achieves top ranking by stripping search access from its orchestrator agent, forcing it to decompose queries into focused research threads. Its three-phase pipeline and two-level architecture prevent information distortion and premature answering, outperforming proprietary solutions from OpenAI, Anthropic, and Google.
@_avichawla: A smarter Claude model burns more tokens, not fewer! And it's not a minor 3-5% difference. But 54% higher token usage. …
The article analyzes why smarter AI agents like Claude consume more tokens when interacting with human-centric backends like Supabase due to inefficient context discovery. It introduces InsForge, an open-source backend tool designed for agents that provides structured context to significantly reduce token usage and manual interventions.