Tag
This position paper argues that large language models should learn from personalized rather than aggregated human preferences, highlighting theoretical limitations from social choice theory and practical issues from demographic diversity. It proposes bounded personalization frameworks that respect individual autonomy while maintaining universal safety constraints.
SenseJudge is a human-centric framework for customizable LLM judging that adapts to diverse user preferences, outperforming existing methods. It also introduces SenseBench, a benchmark derived from real-world multi-turn interactions.
Brian Christian announces his official full-time researcher position at CHAI Berkeley after nearly a decade of affiliation, building on his PhD work on AI representation of human preferences.
This paper introduces 3DCodeBench, a benchmark for evaluating vision-language models on procedural 3D modeling via code, and 3DCodeArena, a ranking platform based on pairwise human preferences.
HP-Edit introduces a post-training framework that aligns diffusion-based image editing models with human preferences via RLHF, using a new 50K real-world dataset and an automatic VLM-based evaluator.