@morganlinton: I asked Teknium, who is probably one of the smartest agent devs in the world, what he did recently to speed up tool cal…

X AI KOLs Following News

Summary

Teknium shares recent performance improvements for tool calling in AI agents, including deferring imports, cutting 47% of per-conversation function calls, and deferring compression feasibility checks, with links to working code on GitHub.

I asked Teknium, who is probably one of the smartest agent devs in the world, what he did recently to speed up tool calling. This is what he shared. So much better than an article or a deck, real examples of working code.
Original Article
View Cached Full Text

Cached at: 05/21/26, 05:36 PM

I asked Teknium, who is probably one of the smartest agent devs in the world, what he did recently to speed up tool calling.

This is what he shared. So much better than an article or a deck, real examples of working code.

I mean, he’s kinda famously a 10x engineer, not that many of them on the planet, and very few who share as much about his process, code, and workflow than Tek.

We’re very lucky.

Similar Articles

@akshay_pachaar: https://x.com/akshay_pachaar/status/2053166970166772052

X AI KOLs Timeline

The article discusses a shift in AI agent tool usage from the 'MCP vs CLI' debate to 'Code Mode,' where agents write code to dynamically import tools, significantly reducing context window usage. It highlights Anthropic's approach and Cloudflare's implementation, demonstrating a 98.7% reduction in token consumption for specific tasks.