agent-systems

Tag

Cards List
#agent-systems

@_akhaliq: LongMINT Evaluating Memory under Multi-Target Interference in Long-Horizon Agent Systems

X AI KOLs Following · 2026-05-21 Cached

LongMINT is a benchmark for evaluating memory under multi-target interference in long-horizon agent systems.

0 favorites 0 likes
#agent-systems

@dair_ai: If you design production agent systems, this matters. Most devs accidentally let their framework defaults make critical…

X AI KOLs Following · 2026-05-20 Cached

This paper introduces the concept of the stochastic-deterministic boundary (SDB) for production LLM agents and provides a methodology for selecting architectural patterns to improve reliability and performance.

0 favorites 0 likes
#agent-systems

@Khazix0918: https://x.com/Khazix0918/status/2056894400320708671

X AI KOLs Timeline · 2026-05-20 Cached

Summary of the core announcements at Google I/O 2026 developer conference, including AI models, products, and Agent systems such as Gemini 3.5 Flash, Gemini Omni Flash, Antigravity 2.0, Gemini Spark, etc.

0 favorites 0 likes
#agent-systems

Learning Transferable Topology Priors for Multi-Agent LLM Collaboration Across Domains

arXiv cs.CL · 2026-05-19 Cached

This paper proposes TopoPrior, a framework that learns transferable topology priors from offline reference collaboration graphs to generate initial topologies for multi-agent LLM collaboration across domains, significantly reducing online search overhead and token consumption.

0 favorites 0 likes
#agent-systems

AI memory demos show week one , Production is a month six problem lol

Reddit r/AI_Agents · 2026-05-18

The article discusses the gap between initial AI memory demos and long-term production challenges, where memory degrades due to contradictions, drift, and outdated preferences, and benchmarks fail to capture these issues.

0 favorites 0 likes
#agent-systems

Are we all quietly rebuilding memory systems because current AI memory doesn’t actually work long-term?

Reddit r/AI_Agents · 2026-05-15

The article discusses the common failures of current AI memory solutions in production, such as stale facts, summary drift, and vendor lock-in, suggesting that the real bottleneck is memory governance rather than retrieval.

0 favorites 0 likes
#agent-systems

How are you actually saving cost on your agent systems?

Reddit r/AI_Agents · 2026-05-10

The article discusses the challenges of cost optimization and FinOps for AI agent systems, highlighting issues with unpredictable token bills, lack of granular attribution tools, and strategies like caching and hard caps.

0 favorites 0 likes
#agent-systems

@apurvasgandhi: Sub-agents are a promising inference-time scaling primitive: • Expand an agent's working memory • Divide-and-conquer ha…

X AI KOLs Timeline · 2026-05-08

RAO (Recursive Agent Optimization) is an end-to-end reinforcement learning approach for training LLM agents to spawn, delegate to, and coordinate with recursive copies of themselves, turning recursive inference into a learned capability.

0 favorites 0 likes
#agent-systems

@ghumare64: https://x.com/ghumare64/status/2052825541057626258

X AI KOLs Timeline · 2026-05-08 Cached

An X thread arguing that production AI agents need operational scaffolding (runbooks, permissions, logs, rollback, verification) rather than just better prompts. The author draws parallels to DevOps evolution, stating that prompts provide advice while runbooks provide control, and that agent systems require platform engineering solutions for permissions, state management, verification, observability, and rollback capabilities.

0 favorites 0 likes
#agent-systems

@fankaishuoai: Understanding Palantir is more valuable than any AI analysis report. Its AIP platform is today's agent platform like Claude Code / Codex. Its Ontology (knowledge graph) is the enterprise Wiki — Markdown…

X AI KOLs Timeline · 2026-05-08

The article analyzes the architecture of Palantir's AIP platform, arguing that its combination of ontology knowledge base, agent platform, and forward deployed engineers represents the future of the software industry. It points out that the platform achieved a breakthrough in 2023 by integrating LLMs (such as Claude), and this model has been copied by Anthropic and OpenAI.

0 favorites 0 likes
← Previous
← Back to home

Submit Feedback