human-preferences

Tag

Cards List
#human-preferences

Large Language Models Should Learn Personalized Rather Than Aggregated Human Preferences

arXiv cs.LG · 2026-06-09 Cached

This position paper argues that large language models should learn from personalized rather than aggregated human preferences, highlighting theoretical limitations from social choice theory and practical issues from demographic diversity. It proposes bounded personalization frameworks that respect individual autonomy while maintaining universal safety constraints.

0 favorites 0 likes
#human-preferences

SenseJudge: Human-Centric Preference-Driven Judgment Framework

arXiv cs.CL · 2026-06-03 Cached

SenseJudge is a human-centric framework for customizable LLM judging that adapts to diverse user preferences, outperforming existing methods. It also introduces SenseBench, a benchmark derived from real-world multi-turn interactions.

0 favorites 0 likes
#human-preferences

@brianchristian: Personal update: Today I officially join @CHAI_Berkeley as a full-time researcher. After almost ten years of affiliatio…

X AI KOLs Timeline · 2026-06-01

Brian Christian announces his official full-time researcher position at CHAI Berkeley after nearly a decade of affiliation, building on his PhD work on AI representation of human preferences.

0 favorites 0 likes
#human-preferences

3DCodeBench: Benchmarking Agentic Procedural 3D Modeling Via Code

Hugging Face Daily Papers · 2026-05-31 Cached

This paper introduces 3DCodeBench, a benchmark for evaluating vision-language models on procedural 3D modeling via code, and 3DCodeArena, a ranking platform based on pairwise human preferences.

0 favorites 0 likes
#human-preferences

HP-Edit: A Human-Preference Post-Training Framework for Image Editing

Hugging Face Daily Papers · 2026-04-21 Cached

HP-Edit introduces a post-training framework that aligns diffusion-based image editing models with human preferences via RLHF, using a new 50K real-world dataset and an automatic VLM-based evaluator.

0 favorites 0 likes
← Back to home

Submit Feedback