life-simulation

#life-simulation

Online Agent-as-a-Judge: Situation-Generating Evaluation for Interactive Agents

arXiv cs.AI ↗ · 2026-06-09 Cached

Proposes Online Agent-as-a-Judge, an evaluation framework that uses an in-world evaluator agent to actively generate situations for testing interactive social agents, improving coverage and reliability over passive methods.

0 favorites 0 likes

#life-simulation

@dair_ai: // Life Simulation in Agent Societies // One of the more ambitious agent-society testbeds to land this month, and it ar…

X AI KOLs Following ↗ · 2026-06-08 Cached

Agentopia is a comprehensive framework for long-term life simulation in multi-agent societies, where 100 LLM-powered agents autonomously pursue personal growth and social relationships over 10 simulated years. The work studies emergent social behaviors and uses life reward training to improve LLM role-playing capabilities.

0 favorites 0 likes

#life-simulation

One Policy, Infinite NPCs: Persona-Traceable Shared RL Policies for Scalable Game Agents

arXiv cs.AI ↗ · 2026-05-25 Cached

Introduces PCSP, a single RL policy conditioned on frozen LLM embeddings of persona descriptions, enabling scalable, real-time persona-traceable NPC control in life simulation games. Experiments show zero-shot persona identification and behavioral alignment, with faster inference than LLM baselines.

0 favorites 0 likes

life-simulation

Online Agent-as-a-Judge: Situation-Generating Evaluation for Interactive Agents

@dair_ai: // Life Simulation in Agent Societies // One of the more ambitious agent-society testbeds to land this month, and it ar…

One Policy, Infinite NPCs: Persona-Traceable Shared RL Policies for Scalable Game Agents

Submit Feedback