Tag
This paper introduces Queen-Bee, a governed multi-agent architecture for enterprise MCP orchestration that separates planning and execution via a BeeSpec intermediate representation, achieving high task success rates with zero governance failures in prototype evaluations.
MCP-Persona is a benchmark evaluating LLM agents on personalized tools interacting with individual accounts and local databases. Experiments reveal significant challenges for state-of-the-art agents in personalized tool use.
According to @smthomas3, most companies with multiple engineering teams are building MCP servers, referencing a HN discussion on whether MCP is dead and input from OpenAI's @mxstbr.
A technical critique of the Model Context Protocol (MCP) arguing that it consumes excessive context window tokens, has low operational reliability, and overlaps with existing CLI/API approaches, with measurements from Quandri's stack showing 10.5% context usage.
The article argues that in 2026, the key differentiator for AI value is not model capability but data access through integration protocols like MCP, which connect models to real business data such as CRMs and accounting software, making connected workflows more important than benchmark scores.
Base launched Base MCP, a tool using the Model Context Protocol to let AI agents interact with crypto wallets and DeFi apps via natural language, with OAuth 2.1 authentication.
The MCP 2026-07-28 release candidate introduces a stateless core, extensions for server-rendered UIs and long-running tasks, improved auth, and a formal deprecation policy, simplifying deployment and scaling.
A developer built a zero-code visual MCP client within AgentSwarms that allows testing remote MCP servers directly in the browser, demonstrated with Cloudflare's free MCP server for documentation.
Google launches Gemini Spark, an always-on AI agent powered by Gemini 3.5 Flash, with background operation, integrations across Workspace and third-party apps via MCP, and updates to Antigravity.
A developer built a Model Context Protocol (MCP) index containing 3 million arXiv papers to help LLMs retrieve accurate research citations and reduce hallucinations, and is now seeking testers for feedback.
This paper presents NIMO Controller, a self-driving laboratory orchestrator based on the Model Context Protocol (MCP), which provides a unified interface for both human users and AI agents through a visual programming interface and MCP-based tool discovery.
The author describes a common user onboarding problem with MCP servers—users opening the endpoint in a browser and seeing a 401 error—and shares a simple hack: returning an HTML page that explains how to properly add the server to an LLM client, which drastically reduced support tickets.
A defense of MCP (Model Context Protocol) against criticism that it puts garbage in context, noting that modern tools like Claude Code, Codex, and Cursor implement progressive disclosure and load MCP tools on demand, making the complaint outdated. The author argues MCP is best for cloud-hosted platforms requiring authentication and discoverability.
OpenAI introduces a new generation of apps in ChatGPT with an open-source Apps SDK built on the Model Context Protocol, allowing developers to reach 800+ million users. Initial partner apps from Booking.com, Canva, Coursera, Figma, Expedia, Spotify, and Zillow are available today with more launching later this year.
n8n-MCP is an MCP server that gives AI assistants comprehensive access to n8n's 1,650 workflow automation nodes, enabling them to understand and work with n8n nodes effectively. It provides structured access to node properties, operations, documentation, templates, and community integrations, and can be self-hosted or used via a cloud dashboard.
Anthropic shares engineering best practices for designing, evaluating, and optimizing tools for AI agents, specifically utilizing the Model Context Protocol (MCP) and Claude Code to improve agent performance.
Anthropic publishes a guide defining context engineering as the evolution of prompt engineering, focusing on curating optimal context tokens for AI agents to maintain performance and focus during multi-turn inference.