Tag
Google released Agents CLI, a unified command-line tool that integrates scaffolding, evaluation, and deployment of agents on Google Cloud, enabling coding assistants to practice agentic engineering without context switching.
A discussion on techniques for keeping LLM agents consistently in character, highlighting an often overlooked aspect of steering.
A hackathon for building AI agents that can handle business operations, sponsored by NVIDIA, Stripe, and Nous Research, with prizes including cash, DGX Spark, and Stripe credits.
Anton Osika shares that Lovable designer Niklas's personal mission is to create the best interface for agents, emphasizing the importance of extreme ownership in teams.
Jerry Liu agrees that both agents and software have value, but notes that their interfaces are different—agents use simple communication interfaces like chat, while software tools need tailored interfaces for specific tasks.
The article compares two patterns for deploying AI agents in the cloud: directly in sandboxes vs decoupling components. It explains the limitations of the sandbox approach due to cloud failures, and highlights Anthropic's Claude Managed Agent as a solution that decouples session store, agent runtime, and sandbox for resilience.
A detailed article introducing Loop Prompt Engineering, a method to automate prompt optimization by iteratively rewriting prompts based on evaluation against a dataset, with emphasis on avoiding recursive traps.
The LangChain team has open-sourced the tutorial 'Deep Agents from Scratch', which deconstructs the core design patterns of mainstream agents from scratch, covering task planning, context offloading to a file system, and sub-agent isolation. It includes 5 progressive notebooks, allowing you to build a complete deep research agent hands-on.
Vercel launched Eve, an agent framework with built-in observability, debugging, durable execution, and sandboxed compute, aiming to simplify production agent development.
LangChain shares a discussion between Brace Sproul and Jake Broekhuizen comparing the use of MCPs versus CLIs when building agents.
BlitzOS is a free and open-source Mac app that allows users to run multiple AI agents (like Claude and Codex) in one place, connect them to any browser app, and manage their progress via a live kanban board.
OpenAI's Codex saw over 5x growth in active users in the first half of 2026, with non-developer teams like Legal and Finance driving the fastest adoption. Usage shifted towards longer tasks, and Codex now generates 99.8% of OpenAI's weekly output tokens.
A curated list of 93 free projects for AI engineers covering MCP, Agents, and RAG, shared by Matt Dancho.
The article explores the problem of AI agents trusting and selecting among over 1,300 paid services on Coinbase's Base blockchain via the x402 standard, and asks the community for ideas on how to evaluate service reliability.
A staff engineer describes how LLM agents have evolved by 2026 to become reliable collaborators for coding, debugging, and codebase research, while humans retain responsibility for judgment and review.
OpenAI is fully utilizing the agent tool Codex internally for cross-departmental work, demonstrating how agentic tools are reshaping work.
Vercel released AI SDK 7, a major update to their TypeScript SDK for building AI applications, adding enhanced agent development, reasoning control, tool context, runtime context, and more.
Experiences building 6 AI agents reveal that model performance was not the bottleneck; instead, other practical issues consistently caused failures.
OpenAI employees are using ChatGPT Codex to automate personal tasks like groceries and taxes, as shared by an employee, highlighting the internal adoption of agentic tools.
OpenAI's internal use of Codex agents is accelerating work across departments, offering an early glimpse into how agentic tools may reshape workflows.