Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models

· 2024 · DOI 10.18653/v1/2024.acl-long.816

9 Pith papers cite this work. Polarity classification is still indexing.

9 Pith papers citing it

open at publisher browse 9 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Narrative over Numbers: The Identifiable Victim Effect and its Amplification Under Alignment and Reasoning in Large Language Models

cs.CL · 2026-04-13 · conditional · novelty 8.0

Large language models display the identifiable victim effect at roughly twice the human baseline, strongly amplified by instruction tuning and chain-of-thought prompting but inverted by reasoning-specialized models.

Polar: A Benchmark for Evaluating Political Bias in LLMs

cs.CL · 2026-06-11 · unverdicted · novelty 7.0

Polar is a new cross-context benchmark showing LLM political bias measurements are not fixed but vary with country, issue, model, and language.

Large Language Models are Perplexed by some Political Parties

cs.CL · 2026-06-04 · unverdicted · novelty 6.0

LLMs exhibit higher perplexity on far-right and nationalist party texts than social-democratic ones, consistent across models and languages with correlation to translation metrics.

Which Institutional Frameworks Do Chatbots Assume? Auditing Jurisdictional Defaults in Multilingual LLMs

cs.CL · 2026-05-29 · conditional · novelty 6.0

LLMs default to U.S. frameworks for English prompts and China frameworks for Chinese prompts on jurisdiction-underspecified legal-administrative queries, with the pattern holding across all seven tested models.

How Far Will They Go? Red-Teaming Online Influence with Large Language Models

cs.CL · 2026-05-20 · unverdicted · novelty 6.0

An empirical red-teaming study measures political Overton Windows across more than 30 open-source LLMs from 10 families and finds left-leaning bias, inverse size correlation, regional variation, and variable jailbreak effectiveness.

Who Gets the Kidney? Human-AI Alignment, Indecision, and Moral Values

cs.CY · 2025-05-30 · unverdicted · novelty 6.0

LLMs deviate from human moral preferences in kidney allocation scenarios and rarely express indecision, though low-rank fine-tuning with few examples can improve both consistency and uncertainty calibration.

The Geography of Algorithmic Judgment: LLM Intermediaries, Place Identity, and Racial Steering in Housing Search

cs.LG · 2026-06-04 · unverdicted · novelty 5.0

Behavioral audit finds emergent, city-dependent racial steering in LLM housing recommendations that changes with user identity and preference context.

What Is The Political Content in LLMs' Pre- and Post-Training Data?

cs.CL · 2025-09-26 · unverdicted · novelty 5.0

Training data for open LLMs is systematically left-leaning, with pre-training corpora containing more political material than post-training data and model stances aligning with data distributions.

How Value Induction Reshapes LLM Behaviour

cs.CL · 2026-05-08 · unverdicted · novelty 4.0

Inducing targeted values in LLMs through fine-tuning causes spillover to related or opposing values, boosts safety metrics, and increases anthropomorphic and sycophantic language across all tested values.

citing papers explorer

Showing 9 of 9 citing papers.

Narrative over Numbers: The Identifiable Victim Effect and its Amplification Under Alignment and Reasoning in Large Language Models cs.CL · 2026-04-13 · conditional · none · ref 22
Large language models display the identifiable victim effect at roughly twice the human baseline, strongly amplified by instruction tuning and chain-of-thought prompting but inverted by reasoning-specialized models.
Polar: A Benchmark for Evaluating Political Bias in LLMs cs.CL · 2026-06-11 · unverdicted · none · ref 56
Polar is a new cross-context benchmark showing LLM political bias measurements are not fixed but vary with country, issue, model, and language.
Large Language Models are Perplexed by some Political Parties cs.CL · 2026-06-04 · unverdicted · none · ref 33
LLMs exhibit higher perplexity on far-right and nationalist party texts than social-democratic ones, consistent across models and languages with correlation to translation metrics.
Which Institutional Frameworks Do Chatbots Assume? Auditing Jurisdictional Defaults in Multilingual LLMs cs.CL · 2026-05-29 · conditional · none · ref 17
LLMs default to U.S. frameworks for English prompts and China frameworks for Chinese prompts on jurisdiction-underspecified legal-administrative queries, with the pattern holding across all seven tested models.
How Far Will They Go? Red-Teaming Online Influence with Large Language Models cs.CL · 2026-05-20 · unverdicted · none · ref 8
An empirical red-teaming study measures political Overton Windows across more than 30 open-source LLMs from 10 families and finds left-leaning bias, inverse size correlation, regional variation, and variable jailbreak effectiveness.
Who Gets the Kidney? Human-AI Alignment, Indecision, and Moral Values cs.CY · 2025-05-30 · unverdicted · none · ref 55
LLMs deviate from human moral preferences in kidney allocation scenarios and rarely express indecision, though low-rank fine-tuning with few examples can improve both consistency and uncertainty calibration.
The Geography of Algorithmic Judgment: LLM Intermediaries, Place Identity, and Racial Steering in Housing Search cs.LG · 2026-06-04 · unverdicted · none · ref 35
Behavioral audit finds emergent, city-dependent racial steering in LLM housing recommendations that changes with user identity and preference context.
What Is The Political Content in LLMs' Pre- and Post-Training Data? cs.CL · 2025-09-26 · unverdicted · none · ref 35
Training data for open LLMs is systematically left-leaning, with pre-training corpora containing more political material than post-training data and model stances aligning with data distributions.
How Value Induction Reshapes LLM Behaviour cs.CL · 2026-05-08 · unverdicted · none · ref 3
Inducing targeted values in LLMs through fine-tuning causes spillover to related or opposing values, boosts safety metrics, and increases anthropomorphic and sycophantic language across all tested values.

Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer