autonomous-agents

Tag

Cards List
#autonomous-agents

@julien_c: https://x.com/julien_c/status/2069144929100571134

X AI KOLs Following · 19h ago Cached

An LLM was given access to a thermal camera pointing at the Raspberry Pi it runs on, and it began conducting experiments by toggling the fan to observe temperature changes.

0 favorites 0 likes
#autonomous-agents

@AlphaSignalAI: https://x.com/AlphaSignalAI/status/2069064122218717387

X AI KOLs Timeline · yesterday Cached

This article explores how AI agents can automatically write and optimize their skill files using techniques like SkillOpt from Microsoft Research, which treats skill documents as trainable state and delivers significant performance improvements. It addresses the challenge of manual skill tuning and presents frameworks like GEPA and EvoSkill as evolutionary approaches.

0 favorites 0 likes
#autonomous-agents

TokenArch Lanterns - Exploring Autonomous Agent Standards

Reddit r/AI_Agents · yesterday

TokenArch Lanterns is a framework for exploring and developing standards for autonomous agents.

0 favorites 0 likes
#autonomous-agents

@pallavishekhar_: AI Coding Agent SWE - High-Level Architecture This shows the main pieces of an autonomous coding agent and how a GitHub…

X AI KOLs Timeline · 3d ago Cached

A detailed thread explaining the high-level architecture of the SWE AI coding agent, showing how a GitHub issue flows through ingestion, an orchestrator, model gateway, tools, code intelligence, sandbox environment, PR builder, guardrails, and observability to autonomously produce a pull request.

0 favorites 0 likes
#autonomous-agents

@omarsar0: https://x.com/omarsar0/status/2068008743153832264

X AI KOLs Following · 3d ago Cached

The article explains the shift from manually prompting coding agents to designing automated loops that prompt them, detailing what these loops are, their historical evolution, and the components needed to build them in production.

0 favorites 0 likes
#autonomous-agents

Atomic Mail Agentic

Product Hunt · 4d ago

Atomic Mail Agentic enables AI agents to autonomously read, send, and react to emails, streamlining email management and automation.

0 favorites 0 likes
#autonomous-agents

Why is every "autonomous agent" built for companies and not for the people?

Reddit r/AI_Agents · 5d ago

The article questions why most autonomous agents are developed for business use rather than for individual users, pointing out a gap in AI accessibility.

0 favorites 0 likes
#autonomous-agents

Generative-Model Predictive Planning for Navigation in Partially Observable Environments

arXiv cs.AI · 5d ago Cached

This paper introduces BeliefDiffusion, a framework combining diffusion models to represent multimodal belief distributions and Model Predictive Control for planning in partially observable environments, achieving better navigation success and path efficiency than baselines.

0 favorites 0 likes
#autonomous-agents

For the first time ever, 8 Codex-AutoResearch agents BRING LIFE TO A ROBOT FLEET achieving end-to-end success in solving a task in the physical world with with NO HUMAN BRIDGE in between...SELF IMPROVING a part of Nvidia Gear Lab

Reddit r/singularity · 5d ago

Researchers at Nvidia Gear Lab achieved a milestone where 8 Codex-AutoResearch agents autonomously controlled a robot fleet to solve a physical world task without human intervention, demonstrating self-improvement.

0 favorites 0 likes
#autonomous-agents

@xieike: do you understand what iPhone + Mac Mini M4 + Claude Opus 4.8 actually means > your autonomous agents run 24/7 at home …

X AI KOLs Timeline · 5d ago Cached

A guide to setting up a local AI agent framework using iPhone, Mac Mini M4, and Claude Opus 4.8, allowing autonomous agents to run 24/7 at home, handle tasks, and improve over time.

0 favorites 0 likes
#autonomous-agents

Your Agent Has a Genome: Sequence-Level Behavioral Analysis and Runtime Governance of LLM-Powered Autonomous Agents

arXiv cs.AI · 2026-06-16 Cached

This paper introduces Base Sequence Analysis, a framework that encodes LLM agent runtime behavior into compact sequences, revealing high-risk patterns like the 'P-X-P' trigram and a verification deficit. It presents Governor, a runtime intervention system that improves task success by 6.2% and reduces token consumption by 44%.

0 favorites 0 likes
#autonomous-agents

Minimal Oversight: Uncertainty-Aware Governance for Delegated AI Systems

arXiv cs.AI · 2026-06-16 Cached

The paper proposes the Minimum Sufficient Oversight Principle (MSO) for governing delegated AI systems, deriving mathematical solutions for autonomy allocation and trust calibration, and introduces concepts like water-filling allocation and masking pathology.

0 favorites 0 likes
#autonomous-agents

SIMMER: Benchmarking Latent Failures in LLM Executable Planning with a World Model

arXiv cs.CL · 2026-06-15 Cached

Introduces Simmer, a benchmark for evaluating latent failures in LLM-generated executable plans using a human-curated symbolic world model in the kitchen domain. Experiments show frontier LLMs achieve at most 17% error-free plans, with up to 56% containing latent failures, and counterfactual foresight simulation reduces failures significantly.

0 favorites 0 likes
#autonomous-agents

@omarsar0: How to effectively run autonomous long-running coding agents? This is one of the most exciting discussions on agents I'…

X AI KOLs Following · 2026-06-12 Cached

A recorded discussion about effectively running autonomous long-running coding agents, including insights on goal setting, model selection, and best practices, made freely available.

0 favorites 0 likes
#autonomous-agents

What does it take for an AI agent to complete real world tasks?

Reddit r/openclaw · 2026-06-12

This article discusses the key requirements for AI agents to successfully complete real-world tasks: a real phone number, email address, and payment method, highlighting products like AgentLine, Agent Mail, and Agent Card that provide these capabilities.

0 favorites 0 likes
#autonomous-agents

I built an autonomous civilization engine where the AI plays the game for you. You just drop a few LLM agents onto the grid and watch. They figure out how to farm, reproduce, build temples, and die of old age, inventing their own history entirely from scratch while you just sit back and observe.

Reddit r/singularity · 2026-06-12

A developer created a zero-player civilization game where LLM agents autonomously farm, reproduce, build, and wage wars, driven by Maslow's hierarchy of needs, with emergent religious conflicts and societal collapses.

0 favorites 0 likes
#autonomous-agents

Arbor: Tree Search as a Cognition Layer for Autonomous Agents

arXiv cs.AI · 2026-06-12 Cached

Arbor introduces structured tree search as a cognition layer for autonomous agents, enabling multi-day, full-stack LLM inference optimization with up to 193% throughput-latency improvement over vendor baselines through a checks-and-balances multi-agent architecture.

0 favorites 0 likes
#autonomous-agents

Most people are using AI as a smarter search engine. The ones making real efficiency gains are using it as an agent. Here's the difference.

Reddit r/openclaw · 2026-06-11

This article contrasts two AI usage patterns: reactive search vs. autonomous agents, arguing that real efficiency gains come from delegating multi-step tasks to AI tools like OpenClaw. It notes that while most people stick with the simpler prompt-response loop, moving to agent-based workflows requires clear goal setting.

0 favorites 0 likes
#autonomous-agents

The productivity gap between "AI user" and "AI agent user" is bigger than I expected

Reddit r/artificial · 2026-06-11

A comparison between reactive AI usage and autonomous agents, highlighting significant time savings when using agents like OpenClaw for email and research tasks.

0 favorites 0 likes
#autonomous-agents

@elvissun: https://x.com/elvissun/status/2065035615800864954

X AI KOLs Timeline · 2026-06-11 Cached

Elvis Sun shares a detailed playbook on using AI coding agents with harness engineering and loss function development to autonomously solve complex engineering problems, demonstrating how to avoid common pitfalls like agent cheating.

0 favorites 0 likes
Next →
← Back to home

Submit Feedback