Tag
A Hacker News user asks if anyone is using Google's A2A agent-to-agent protocol, noting confusion six months ago and the rise of MCP, but now seeing potential for agent interaction.
Agentic Context Engine (ACE) is an open-source Python tool that adds persistent learning to AI agents via a Skillbook of strategies refined from execution traces and feedback.
This paper presents Connect the Dots (CoD), a framework for training LLMs via reinforcement learning to develop meta-capabilities for long-lifecycle agents, enabling continuous learning and cross-domain generalization.
Azalia Mirhoseini highlights DeLM, a decentralized language model approach where agents communicate via shared state, achieving ~10% improvement on SWE-bench Verified with Gemini-3 Flash at less than half the cost.
MeshPilot is an AI workspace for terminals, tasks, and agents.
An open source UI kit with 15 components for document viewing (PDF, DOCX, XLSX) including bounding box citations, file upload, e-signature, and file system integration, released under MIT license.
Propane is a tool that provides automatic customer context for product teams and agents, launching on Product Hunt.
Compiled 6 Claude Skills for video that can be used directly, covering auto-generated animated videos, AI-assisted rough cuts, React component rendered videos, multimedia generation toolbox, Chinese editing agent, and video prompt writing open-source tools.
The tweet discusses Microsoft's SkillOpt paper, which improved GPT-5.5 accuracy from 41% to 80% without retraining by using a small skills file to guide the agent.
A developer builds a debugging tool for AI agents that compares replays against reference runs to identify where behavior first drifted, expressing frustration with manual trace debugging.
Membrane released over 3,000 integration skills for AI agents, simplifying SaaS app interactions by handling auth, actions, and glue code. The skills are built on an open spec and include examples like Gmail and Slack.
Hugging Face and collaborators launch Agentic Resource Discovery (ARD), an open specification for dynamically discovering tools, skills, and agents at runtime, moving beyond static installation.
Flue 1.0 Beta is a TypeScript framework for building AI agents with zero LLM lock-in, featuring workflows, autonomous agents, and channel integrations.
A developer building autonomous billing agents discusses the difficulty of reconstructing why an agent made a decision after the fact, and describes building a tool (Attova) that records decisions with evidence, alternatives, and confidence to improve debugging and human review.
The author built a Healthy Food MCP server and learned that agents perform better with many narrow, constrained tools rather than one flexible tool, emphasizing the need for a boring tool surface to reduce LLM hallucination.
Kevin Niparko 在台上演讲,讨论如何让 AI 代理连续运行数天甚至数周,而无需保持笔记本电脑打开。
Built a new tracer for debugging AI agents that auto-detects loops, logs sessions as readable timelines, and allows side-by-side comparison. Seeking feedback.
Framer 3.0 launches with new features including agents, branching, community integration, and a redesigned interface.
Someone analyzed the 196 startups from YC's 2026 spring batch and found that 95% use AI, 85% are AI-native, and the real keyword is agents rather than AI.
Introduces IRTS-ToolBench, a benchmark of 1,700 questions for evaluating LLMs and AI agents on irregular time series question answering via tool-grounded reasoning, covering 10 task types across 13 domains.