skill-distillation

Tag

Cards List
#skill-distillation

EmoDistill: Offline Emotion Skill Distillation for Language Model Agents in Adversarial Negotiation

arXiv cs.CL · 2026-05-27 Cached

EmoDistill is an offline framework that distills emotional negotiation skills into language model agents using Implicit Q-Learning for emotion selection and LoRA-based supervised fine-tuning and judge policy optimization for emotion expression, achieving higher utility in adversarial negotiations.

0 favorites 0 likes
#skill-distillation

PANDO: Efficient Multimodal AI Agents via Online Skill Distillation

Hugging Face Daily Papers · 2026-05-26 Cached

PANDO is a web agent framework that improves efficiency through online skill distillation, reducing token usage by 58-61% while outperforming baselines on VisualWebArena tasks.

0 favorites 0 likes
#skill-distillation

@Voxyz_ai: just checked github trending, the #1 repo this week is a CLAUDE.md file. 44,465 new stars this week. a skill distilling…

X AI KOLs Timeline · 2026-04-19 Cached

A single CLAUDE.md file became GitHub's top-trending repo with 44k weekly stars by distilling Andrej Karpathy's LLM coding advice into four principles.

0 favorites 0 likes
← Back to home

Submit Feedback