Tag
An analysis suggesting that AI companies like Anthropic and OpenAI may be spending over $1000 for every $100 in revenue, highlighting unsustainable economics in the LLM market.
Lowfat is a lightweight CLI filter that reduces AI token costs by stripping unnecessary output before it reaches your agent, claiming up to 91.8% token savings. It provides shell integration, plugin support, and transparent rewriting for tools like Claude Code and OpenCode.
AI agents are causing runaway token consumption, turning overspend into a production incident category. The article highlights cases like a single engineer's $1.3M OpenAI bill and Uber burning its annual AI budget in four months, and asks the community how they are capping agent spending.
A comparison of the cost per million tokens for running LLMs locally on Apple Silicon hardware versus using cloud inference via OpenRouter, finding that local inference is typically 3x more expensive and slower.
The article notes that the price of LLM intelligence has dropped 100-fold in 18 months, and argues that this cost reduction will drive demand to expand outward, countering purely pessimistic views.