Tag
ActiveGraph announces two new papers on agent memory (LongMemEval) and self-improvement regimes, along with reference agents, pack templates, and upcoming meetups in Seattle and San Francisco.
NVIDIA introduces the Agent Toolkit, an open modular foundation with models, tools, skills, and a secure runtime to help businesses build specialized, trustworthy AI agents for various industries.
A developer introduces agent-rigor, an open-source framework that enforces runtime discipline and traditional SDLC mechanics into AI coding assistants to prevent common agent failures like scope creep and fix-forward loops.
TokenArch Lanterns is a framework for exploring and developing standards for autonomous agents.
A developer presents a tier list of AI agent frameworks for local autonomous workflows, ranking OpenClaw as S-tier and Gemini Spark as A-tier, while criticizing cloud-locked wrappers for requiring excessive manual approvals.
Firecrawl launches Keyless mode, allowing free trials without an API Key. It supports real-time web scraping, interactions, and PDF parsing, with 1000 free Credits monthly. Suitable for Agent frameworks like Claude Code and Cursor.
LedgerAgent is a method for customer service agents that maintains task states in a separate ledger to improve policy adherence and state management during tool calling. It improves average passk over standard approaches across four domains.
zkit is a provider-agnostic toolkit of small, independent Go packages for building AI applications, including agent loops, tool systems, guardrails, history compaction, and an LLM provider layer.
This development pipeline first creates a rough prototype to elicit user feedback, turning every 'that's not right' reaction into a checkable rule. Then AI agents independently develop and validate against those rules, catching issues early and avoiding self-deception.
This paper introduces MemSlides, a hierarchical memory framework for personalized slide generation that separates long-term user profiles, working memory for session constraints, and tool memory for localized edits, enabling multi-turn revision without full regeneration.
Vercel released Eve, an open-source filesystem-first durable backend agent framework, with comprehensive documentation available.
Maka is a local-first desktop AI workbench built with Electron, supporting multi-model connections, tool calls, permission control, and privacy protection, along with integrated bot access, local memory, and more. The author also mentioned rebuilding the OpenCLI app and planning to refactor MakeAgent's UI.
OPD-Evolver proposes a self-evolving agent framework using slow-fast co-evolution and on-policy self-distillation to enhance memory management and policy learning, outperforming existing methods like ReasoningBank and Skill0 across multi-domain benchmarks.
An open-source project teaches you to build a simplified version of Claude Code from scratch, thoroughly explaining the harness engineering of AI Agents. It has received 66.5K Stars.
OpenSquilla is an open-source project that enables self-organizing skill orchestration for agents via MetaSkill 3.0, combined with intelligent routing to reduce token costs. The author integrated it into WeSight, demonstrating how a single sentence can convert a WeChat public account article into a Xiaohongshu post, showcasing the potential for agents to self-assemble workflows.
Hermes has evolved from a basic Agent framework into a multifaceted ecosystem, with the community contributing four high-quality open-source tools covering automated short videos, local image generation, AI text de-odorizing, and VS Code integration.
A guide to setting up and using the NousResearch Hermes Agent, a CLI tool for creating and managing autonomous AI agents, including background sessions, context compression, and session storage.
Uniqent is an open-source tool that lets you export an AI agent's memory, persona, and MCP setup as a single file, enabling one-click installation across different frameworks like OpenClaw and Hermes. It's in early development and seeking contributors on GitHub.
The author built SmithersBot, an open-source agent harness that orchestrates Claude Code and Codex to pursue long-term goals over weeks by breaking plans into tasks, requiring approvals, and using checkpoints to avoid degradation.
Memento-Skills is a self-evolving agent framework where agents learn from failures and rewrite their own skills, improving over time through a Read-Execute-Reflect-Write loop. It was tested on HLE and GAIA benchmarks and supports open-source LLMs like Kimi, MiniMax, and GLM.