Tools: Is This a Technical Victory, or a Price War Victory?

Reddit r/artificial 05/18/26, 03:17 PM News

ai-coding openrouter kilocode chinese-models token-usage price-war technical-victory

Summary

Analysis of OpenRouter data shows that Chinese AI models have become the most used in Kilo Code's coding agent, accounting for 58% of token usage, challenging the dominance of Claude and GPT due to lower cost and longer context windows.

If you only follow discussions on social media, you might think AI coding is still dominated by Claude, GPT, and Gemini. But Kilo Code’s usage data on OpenRouter paints a somewhat counterintuitive picture: over the past 30 days, the top three most-used models on Kilo Code were Step 3.5 Flash, MiniMax M2.5, and Ling-2.6-1T. Together, they accounted for roughly 3.15T tokens, or about 58% of Kilo Code’s total token usage over the same period. In other words, in this real-world AI coding agent usage scenario, Chinese models are no longer just backup options. They have become a major source of token consumption. Kilo Code’s OpenRouter data does not necessarily prove that Chinese models have fully surpassed Claude or GPT. But it does show at least one thing: in high-frequency, high-token, highly automated AI coding agent workflows, Chinese models have already entered the core of real production usage. Why is this happening? Is it because Chinese models are cheaper, offer longer context windows, and are better suited for workloads that consume large amounts of tokens?

Original Article

Similar Articles

Surprising screenshot - Most token usage is non-coders (openrouter ranking)

Reddit r/LocalLLaMA

OpenRouter usage stats show 6 of the top 10 "coding agent" apps are actually used by non-coders, suggesting broader adoption beyond developers.

AI Gateway Production Trends (8 minute read)

TLDR AI

Vercel's AI Gateway data shows Anthropic leads in spending, Google in token volume, and agentic workloads carrying 59% of token volume. OpenAI's spend share tripled after recent model updates.

@akshay_pachaar: https://x.com/akshay_pachaar/status/2053166970166772052

X AI KOLs Timeline

The article discusses a shift in AI agent tool usage from the 'MCP vs CLI' debate to 'Code Mode,' where agents write code to dynamically import tools, significantly reducing context window usage. It highlights Anthropic's approach and Cloudflare's implementation, demonstrating a 98.7% reduction in token consumption for specific tasks.

@_avichawla: The No. 1 deep researcher beats Claude and ChatGPT with a trick neither uses. I studied the open-source architecture be…

X AI KOLs Timeline

The Onyx open-source deep research system achieves top ranking by stripping search access from its orchestrator agent, forcing it to decompose queries into focused research threads. Its three-phase pipeline and two-level architecture prevent information distortion and premature answering, outperforming proprietary solutions from OpenAI, Anthropic, and Google.

@_avichawla: A smarter Claude model burns more tokens, not fewer! And it's not a minor 3-5% difference. But 54% higher token usage. …

X AI KOLs Following

The article analyzes why smarter AI agents like Claude consume more tokens when interacting with human-centric backends like Supabase due to inefficient context discovery. It introduces InsForge, an open-source backend tool designed for agents that provides structured context to significantly reduce token usage and manual interventions.