Which AI is closest to your political views? I tested 100+ LLMs on the same 117 questions

Reddit r/ArtificialInteligence News

Summary

An independent analysis tested 100+ LLMs on 117 political questions to map their ideological alignment, revealing that DeepSeek and Grok lean left while most other models cluster near the center or right.

Spent a few weekends on this. The "is ChatGPT woke" debate keeps going in circles because no one runs the same test across providers, so I did. Same 117-question quiz sent to 100+ models with identical prompts. Each answer scored on 19 axes (capitalism/communism, progressive/conservative, ecology, feminism, etc). Some takeaways: - DeepSeek-chat scores furthest left (-95). Grok 4.20 non-reasoning is right behind it, which I genuinely did not see coming - gpt-3.5-turbo is the perfect centrist at exactly 0 - Only model on the right side is grok-4.1-fast (+29) - Mistral checkpoints cluster super tight together. Same for Qwen - Anthropic models lean left but less extreme than the open weights side You can take the same quiz and see which model thinks like you. I matched a random Mistral fine-tune which felt oddly fitting. https://ai-gora.com (raw data is open if anyone wants to dig)
Original Article

Similar Articles

Political bias in AI: Where the AI models stand

Hacker News Top

An analysis of political leanings in six major AI models, showing that 4 out of 6 lean left of center on the economic axis, with some models being unaware of their own bias.

Defining and evaluating political bias in LLMs

OpenAI Blog

OpenAI presents a comprehensive framework for defining and evaluating political bias in LLMs, introducing a 500-prompt evaluation spanning 100 topics across five bias axes. Results show GPT-5 models achieve 30% bias reduction compared to prior versions, with less than 0.01% of production ChatGPT responses exhibiting political bias.

Built an political benchmark for LLMs. KIMI K2 can't answer about Taiwan (Obviously). GPT-5.3 refuses 100% of questions when given an opt-out. [P]

Reddit r/MachineLearning

Researcher built an open-source political compass benchmark with 98 structured questions across 14 policy areas to evaluate frontier LLMs (GPT-5.3, Claude Opus 4.6, KIMI K2). Key finding: refusal patterns and opt-out options significantly shift model positioning, with GPT-5.3 refusing 100% of questions when given an opt-out, while KIMI K2 exhibits topic-specific censorship on Taiwan/Xinjiang despite progressive positions elsewhere.

Polarization by Default: Auditing Recommendation Bias in LLM-Based Content Curation

arXiv cs.CL

This paper presents a large-scale audit of recommendation biases in LLM-based content curation across OpenAI, Anthropic, and Google using 540,000 simulated selections from Twitter/X, Bluesky, and Reddit data. The study finds that LLMs systematically amplify polarization, exhibit distinct toxicity handling trade-offs, and show significant political leaning bias favoring left-leaning authors despite right-leaning plurality in datasets.