implicit-q-learning

Tag

Cards List
#implicit-q-learning

EmoDistill: Offline Emotion Skill Distillation for Language Model Agents in Adversarial Negotiation

arXiv cs.CL · 2026-05-27 Cached

EmoDistill is an offline framework that distills emotional negotiation skills into language model agents using Implicit Q-Learning for emotion selection and LoRA-based supervised fine-tuning and judge policy optimization for emotion expression, achieving higher utility in adversarial negotiations.

0 favorites 0 likes
← Back to home

Submit Feedback