agentic-framework

#agentic-framework

@_akhaliq: paper:

X AI KOLs Following ↗ · 2d ago Cached

This paper proposes Robust-TO, an agentic video understanding framework that integrates per-frame trustworthiness to address the Blind Trust Problem, achieving significant accuracy gains under realistic perturbations.

0 favorites 0 likes

#agentic-framework

Confidence-Aware Tool Orchestration for Robust Video Understanding

Hugging Face Daily Papers ↗ · 3d ago Cached

Robust-TO addresses the Blind Trust Problem in video reasoning by integrating per-frame trustworthiness into an agentic framework, improving accuracy under realistic perturbations through calibrated evidence weighting and reliability-aware reasoning.

0 favorites 0 likes

#agentic-framework

Qwen-Image-Agent: Bridging the Context Gap in Real-World Image Generation

Hugging Face Daily Papers ↗ · 3d ago Cached

Qwen-Image-Agent proposes a unified agentic framework that addresses the context gap in text-to-image generation by integrating planning, reasoning, searching, and memory mechanisms. It introduces IA-Bench for evaluation and achieves state-of-the-art performance.

0 favorites 0 likes

#agentic-framework

OmniPath: A Multi-Modal Agentic Framework for Auditing Wheelchair Accessibility

arXiv cs.AI ↗ · 4d ago Cached

OmniPath is a multi-modal agentic framework that combines OpenStreetMap network topology with aerial LiDAR data to audit wheelchair accessibility by analyzing physical barriers like slope and surface discontinuities at high resolution, validated against field surveys.

0 favorites 0 likes

#agentic-framework

@omarsar0: Eve does feel like the "Next.js for agents" as @rauchg puts it. You got to check it out!

X AI KOLs Following ↗ · 5d ago Cached

Eve, a new agentic framework from Vercel, is being compared to 'Next.js for agents' for its file-based approach to tools, skills, and evals, enabling rapid agent building with TypeScript.

0 favorites 0 likes

#agentic-framework

@Zhongyi_Zhou_: ML optimizes via mathematical gradients; Loop Engineering needs textual "gradients"! Introducing ToolGrad: an agentic f…

X AI KOLs Timeline ↗ · 2026-06-17 Cached

Introduces ToolGrad, an agentic framework that generates, evaluates, and refines tool-use trajectories using textual 'gradients', achieving near 100% pass rate and lower cost for dataset generation. Accepted at ACL 2026.

0 favorites 0 likes

#agentic-framework

RL-Index: Reinforcement Learning for Retrieval Index Reasoning

Hugging Face Daily Papers ↗ · 2026-06-15 Cached

RL-Index proposes a reinforcement learning-based agentic indexing framework that shifts reasoning from query time to the indexing stage by augmenting documents with LLM-generated rationales, improving retrieval effectiveness and reducing online latency.

0 favorites 0 likes

#agentic-framework

AlloSpatial: Agentic Harness Framework for Spatial Reasoning in Foundation Models

Hugging Face Daily Papers ↗ · 2026-06-08 Cached

AlloSpatial is an agentic framework that enhances spatial reasoning in foundation models by converting egocentric observations into structured allocentric representations, using cognitive mapping and tool-use reasoning. It improves performance by 5-18% on benchmarks and outperforms larger models through cold-start reinforcement learning.

0 favorites 0 likes

#agentic-framework

ProSPy: A Profiling-Driven SQL-Python Agentic Framework for Enterprise Text-to-SQL

arXiv cs.CL ↗ · 2026-06-05 Cached

ProSPy is a profiling-driven SQL-Python agentic framework for enterprise text-to-SQL that structures reasoning into four stages: automatic profiling, schema pruning, dialect-agnostic SQL interface, and Python-based analysis. It achieves execution accuracies of 60.15% and 60.51% on Spider 2.0-Lite and Spider 2.0-Snow with Claude-4.5-Opus, outperforming strong baselines.

0 favorites 0 likes

#agentic-framework

QueryAgent-R1: Bridging Query Generation and Product Retrieval for E-Commerce Query Recommendation

arXiv cs.CL ↗ · 2026-06-05 Cached

QueryAgent-R1 is an agentic framework that bridges query generation and product retrieval in e-commerce using reinforcement learning and memory abstraction, improving query CTR by 2.9% and CVR by 3.1% in online tests.

0 favorites 0 likes

#agentic-framework

@rohanpaul_ai: Another great paper from Google. Shows general LLMs can solve formal math by planning proofs and checking each step. Ra…

X AI KOLs Following ↗ · 2026-06-04 Cached

A new Google paper introduces LEAP, an agentic framework that enables general LLMs to solve formal math problems by planning proofs and checking each step, raising performance from under 10% to 70% on the Lean IMO benchmark and solving all 2025 Putnam problems.

0 favorites 0 likes

#agentic-framework

@tom_doerr: Semi-autonomous agents optimize codebases through parallel experimentation https://github.com/evo-hq/evo

X AI KOLs Timeline ↗ · 2026-06-03 Cached

Evo is an open-source tool that provides semi-autonomous agents to optimize codebases through parallel experimentation, using tree search and multiple subagents to autonomously discover and improve metrics.

0 favorites 0 likes

#agentic-framework

LEAP: Supercharging LLMs for Formal Mathematics with Agentic Frameworks

arXiv cs.AI ↗ · 2026-06-03 Cached

LEAP is an agentic framework that enables general-purpose LLMs to achieve state-of-the-art performance in formal theorem proving in Lean, solving all 12 problems from the 2025 Putnam Competition and boosting formal solve rates from below 10% to 70% on a new benchmark (Lean-IMO-Bench), surpassing specialized systems.

0 favorites 0 likes

#agentic-framework

MapAgent: An Industrial-Grade Agentic Framework for City-scale Lane-level Map Generation

Hugging Face Daily Papers ↗ · 2026-06-03 Cached

MapAgent is an industrial-grade agentic framework that combines vision-language processing with constraint-aware reasoning to automatically produce specification-compliant lane-level maps, achieving over 95% automation in Baidu Maps for more than 360 cities.

0 favorites 0 likes

#agentic-framework

MOSAIC: Modular Orchestration for Structured Agentic Intelligence and Composition

arXiv cs.AI ↗ · 2026-06-02 Cached

MOSAIC introduces a structured agentic framework for automated data science that uses memory-grounded model selection and workflow construction, validated on financial time-series tasks. It outperforms AutoML and agentic baselines.

0 favorites 0 likes

#agentic-framework

HypoAgent: An Agentic Framework for Interactive Abductive Hypothesis Generation over Knowledge Graphs

arXiv cs.AI ↗ · 2026-06-01 Cached

HypoAgent is an agentic framework for interactive abductive hypothesis generation over knowledge graphs, integrating three agents to handle evolving user intents and fine-grained diagnosis, achieving state-of-the-art performance.

0 favorites 0 likes

#agentic-framework

Autonomous Multiagent pipeline to create any app, just give an idea.

Reddit r/openclaw ↗ · 2026-05-30

ACO System is an open-source multi-agent framework that autonomously manages the software development pipeline from GitHub Issue to merged PR using six specialized AI agents, with a deterministic architect gate to prevent bad PRs.

0 favorites 0 likes

#agentic-framework

Harmonizing Real-Time Constraints and Long-Horizon Reasoning: An Asynchronous Agentic Framework for Dynamic Scheduling

arXiv cs.AI ↗ · 2026-05-29 Cached

This paper introduces RACE-Sched, an asynchronous agentic framework that decouples real-time reactive scheduling from deliberative LLM-based reasoning to handle dynamic job shop scheduling problems, achieving superior performance over DRL and other baselines.

0 favorites 0 likes

#agentic-framework

From Residuals to Reasons: LLM-Guided Mechanism Inference from Tabular Data

arXiv cs.LG ↗ · 2026-05-25 Cached

Introduces Multi-Agent Residual In-Context Learning (MARICL), an agentic framework that uses LLM agents to analyze residuals from a base model on tabular data, hypothesize missing structure, and produce explicit correction terms via textual gradient optimization. Across nine benchmarks, MARICL consistently improves over its base model and demonstrates mechanistic generalization in cell-free protein predictions.

0 favorites 0 likes

#agentic-framework

RMA: an Agentic System for Research-Level Mathematical Problems

arXiv cs.AI ↗ · 2026-05-25 Cached

Research Math Agents (RMA) is an agentic framework for automated reasoning on research-level mathematical problems, achieving state-of-the-art results on the First Proof benchmark by solving 8 out of 10 problems, outperforming strong baselines like GPT-5.2R and Aletheia.

0 favorites 0 likes

agentic-framework

Submit Feedback