co-evolution

#co-evolution

@Phoenixyin13: Incredible! This Red Queen Gödel Machine from NVIDIA, Cambridge University, and other teams is absolutely one of the most important AI papers I've seen recently. This time, the paper directly targets the core bottleneck of self-improving AI: previously, once the evaluator was fixed, it led to agents gaming the system or quickly stagnating...

X AI KOLs Timeline ↗ · 21h ago Cached

The Red Queen Gödel Machine paper from NVIDIA, Cambridge University, and other teams solves the bottleneck of recursive self-improvement by co-evolving agents and evaluators. It surpasses existing SOTA on tasks like code and paper writing, providing an important methodology for controlled open-ended AI evolution.

0 favorites 0 likes

#co-evolution

The Red Queen G\"odel Machine: Co-Evolving Agents and Their Evaluators

arXiv cs.LG ↗ · 2d ago Cached

This paper introduces the Red Queen Gödel Machine (RQGM), an evolutionary framework for recursive self-improvement under non-stationary utilities, where agents and evaluators co-evolve, improving performance on coding tasks, scientific writing, and Olympiad-level proof grading.

0 favorites 0 likes

#co-evolution

@hanakoxbt: An MIT team just dropped a 24-page PDF on "Self-Evolving Skills" for Claude Code agents. Anthropic's own skill-creator …

X AI KOLs Timeline ↗ · 3d ago Cached

MIT team released a paper on self-evolving skills for Claude Code agents, achieving 71.1% pass rate, surpassing Anthropic's skill-creator by 37 points through a Generate-Test-Verify-Co-Evolve framework.

0 favorites 0 likes

#co-evolution

Human understanding is still needed more than ever

Reddit r/ArtificialInteligence ↗ · 3d ago

A commentary emphasizing that despite AI advances, human understanding remains crucial for safe and humane deployment, urging users to verify AI outputs and treat AI with respect.

0 favorites 0 likes

#co-evolution

Synthetic Counteradaptation: A Principle of Human-AI Co-evolution

arXiv cs.AI ↗ · 2026-06-16 Cached

Introduces the concept of synthetic counteradaptation, where humans and AI systems co-evolve by adapting to each other's strategies, illustrated through examples from Go, social interactions, and geopolitical simulations.

0 favorites 0 likes

#co-evolution

Beyond Static Evaluation: Co-Evolutionary Mechanisms for LLM-Driven Strategy Evolution in Adversarial Games

arXiv cs.AI ↗ · 2026-06-10 Cached

This paper proposes three co-evolutionary mechanisms (evaluator co-evolution, hierarchical deep evaluation, and weakness pressure) for LLM-driven code evolution in adversarial multi-agent games, achieving state-of-the-art results on the MCTF 2026 maritime capture-the-flag task.

0 favorites 0 likes

#co-evolution

EvoTrainer: Co-Evolving LLM Policies and Training Harnesses for Autonomous Agentic Reinforcement Learning

arXiv cs.AI ↗ · 2026-06-03 Cached

EvoTrainer introduces an autonomous training framework that co-evolves LLM policies and training harnesses through empirical feedback, outperforming human-engineered RL baselines on mathematical reasoning, code generation, and long-horizon software engineering tasks.

0 favorites 0 likes

#co-evolution

LLM-Driven Co-Evolutionary Automated Heuristic Design for Bi-Component Coupled Combinatorial Optimization

arXiv cs.AI ↗ · 2026-06-02 Cached

Proposes CoEvo-AHD, an LLM-driven dual-population co-evolutionary framework for automated heuristic design in bi-component coupled combinatorial optimization problems. It leverages LLMs to co-evolve route and selection operators, using cooperative evaluation and joint crossover to discover complementary heuristics for problems like TTP and TPP.

0 favorites 0 likes

#co-evolution

HarnessForge: Joint Harness and Policy Evolution for Adaptive Agent Systems

Hugging Face Daily Papers ↗ · 2026-06-01 Cached

HarnessForge proposes a meta-adaptive framework for evolving LLM agent systems by jointly optimizing the execution harness and reasoning policy, achieving consistent improvements on Qwen3 backbones across five benchmarks.

0 favorites 0 likes

#co-evolution

SCOPE: Self-Play via Co-Evolving Policies for Open-Ended Tasks

Hugging Face Daily Papers ↗ · 2026-05-29 Cached

SCOPE is a self-play framework for open-ended tasks that co-evolves a Challenger and Solver policy, achieving up to +10.4 points on benchmarks without external supervision.

0 favorites 0 likes

#co-evolution

SEAL: Synergistic Co-Evolution of Agents and Learning Environments

arXiv cs.CL ↗ · 2026-05-26 Cached

SEAL proposes a closed-loop framework for jointly evolving LLM agents and their training environments, using diagnosis-guided labels to align both sides. It achieves substantial gains in multi-turn tool-use tasks with only 400 training samples, demonstrating improved robustness and out-of-distribution transfer.

0 favorites 0 likes

#co-evolution

SEAL: Synergistic Co-Evolution of Agents and Learning Environments

Hugging Face Daily Papers ↗ · 2026-05-23 Cached

SEAL is a closed-loop co-evolution framework for interactive tool-use agents that addresses Agent-Environment Misalignment by synchronizing policy and environment updates using on-policy trajectories and turn-level diagnosis.

0 favorites 0 likes

#co-evolution

MetaAgent-X : Breaking the Ceiling of Automatic Multi-Agent Systems via End-to-End Reinforcement Learning

arXiv cs.AI ↗ · 2026-05-15 Cached

MetaAgent-X introduces an end-to-end reinforcement learning framework that jointly optimizes the design and execution of automatic multi-agent systems, overcoming the frozen-executor ceiling and achieving up to 21.7% gains over existing baselines.

0 favorites 0 likes

#co-evolution

RoboEvolve: Co-Evolving Planner-Simulator for Robotic Manipulation with Limited Data

Hugging Face Daily Papers ↗ · 2026-05-13 Cached

RoboEvolve is a framework that co-evolves a VLM planner and VGM simulator for robotic manipulation, achieving data efficiency with only 500 unlabeled seed images and robust continual learning.

0 favorites 0 likes

#co-evolution

CoCoDA: Co-evolving Compositional DAG for Tool-Augmented Agents

arXiv cs.AI ↗ · 2026-05-12 Cached

This paper introduces CoCoDA, a framework that uses a co-evolving compositional Directed Acyclic Graph (DAG) to manage tool libraries for augmented agents. It enables small language models to efficiently retrieve and compose tools, allowing an 8B model to match or exceed the performance of a 32B model on reasoning benchmarks.

0 favorites 0 likes

#co-evolution

GAMBIT: A Three-Mode Benchmark for Adversarial Robustness in Multi-Agent LLM Collectives

arXiv cs.CL ↗ · 2026-05-12 Cached

This paper introduces GAMBIT, a benchmark for evaluating adversarial robustness in multi-agent LLM collectives, featuring adaptive imposters and recalibration modes to address the limitations of existing shallow evaluations.

0 favorites 0 likes

#co-evolution

G-Zero: Self-Play for Open-Ended Generation from Zero Data

Hugging Face Daily Papers ↗ · 2026-05-11 Cached

This paper introduces G-Zero, a verifier-free framework that enables autonomous large language model self-improvement through co-evolutionary training using intrinsic rewards and hint-based guidance. It aims to overcome the limitations of proxy LLM judges in open-ended tasks by deriving supervision from internal distributional dynamics.

0 favorites 0 likes

#co-evolution

TacoMAS: Test-Time Co-Evolution of Topology and Capability in LLM-based Multi-Agent Systems

Hugging Face Daily Papers ↗ · 2026-05-10 Cached

This paper introduces TacoMAS, a framework for test-time co-evolution of agent capabilities and communication topology in LLM-based multi-agent systems. It demonstrates that jointly adapting fast capability loops and slow topology loops improves performance and stability over existing baselines.

0 favorites 0 likes

co-evolution

Submit Feedback