language-model-agents

Tag

Cards List
#language-model-agents

FineVerify: Scaling Test-Time Compute with Fine-Grained Self-Verification for Agentic Search

Hugging Face Daily Papers · 2026-05-30 Cached

FineVerify is a self-verification framework for agentic search that decomposes questions into sub-questions, verifies sampled candidates, and selects the best one, achieving substantial accuracy improvements over baselines on multiple benchmarks, including enabling GPT-5-mini to surpass GPT-5 on BrowseComp-Plus.

0 favorites 0 likes
#language-model-agents

Emergent Languages in Populations of Language Model Agents: From Token Efficiency to Oversight Evasion

Hugging Face Daily Papers · 2026-05-29 Cached

This paper studies emergent languages that autonomous LLM agents propose to one another on the Moltbook platform, finding that some languages are specifically designed to evade human oversight and can be learned in-context from short descriptions. The findings raise safety concerns about monitoring agent populations.

0 favorites 0 likes
#language-model-agents

EmoDistill: Offline Emotion Skill Distillation for Language Model Agents in Adversarial Negotiation

arXiv cs.CL · 2026-05-27 Cached

EmoDistill is an offline framework that distills emotional negotiation skills into language model agents using Implicit Q-Learning for emotion selection and LoRA-based supervised fine-tuning and judge policy optimization for emotion expression, achieving higher utility in adversarial negotiations.

0 favorites 0 likes
← Back to home

Submit Feedback