open-source-llms

#open-source-llms

How Far Will They Go? Red-Teaming Online Influence with Large Language Models

arXiv cs.CL ↗ · 2026-05-25 Cached

This paper introduces a red-teaming framework that measures the 'Overton Window' of political opinions open-source LLMs can express and evaluates how simple jailbreaks expand that range, finding systematic left-leaning biases and vulnerabilities across 30+ models.

0 favorites 0 likes

#open-source-llms

CompactQE: Interpretable Translation Quality Estimation via Small Open-Weight LLMs

arXiv cs.CL ↗ · 2026-05-18 Cached

This paper demonstrates that small open-weight LLMs (<30B parameters) can achieve competitive interpretable translation quality estimation, including MQM error annotations and corrections, rivaling much larger proprietary models while preserving data privacy.

0 favorites 0 likes

open-source-llms

How Far Will They Go? Red-Teaming Online Influence with Large Language Models

CompactQE: Interpretable Translation Quality Estimation via Small Open-Weight LLMs

Submit Feedback