philosophical-alignment

#philosophical-alignment

StoicLLM: Preference Optimization for Philosophical Alignment in Small Language Models

arXiv cs.CL ↗ · 2026-05-13 Cached

This research paper investigates using preference optimization (ORPO, AlphaPO) on small language models like Llama-3.2-3B and Qwen-3-4B to align them with Stoic philosophy using micro-datasets. The study finds that while 300 examples can effectively encode Stoic virtues, small models still struggle with outward-facing cosmopolitan duties.

0 favorites 0 likes

philosophical-alignment

StoicLLM: Preference Optimization for Philosophical Alignment in Small Language Models

Submit Feedback