pith. sign in

hub

Having Beer after Prayer? Measuring Cultural Bias in Large Language Models

10 Pith papers cite this work. Polarity classification is still indexing.

10 Pith papers citing it

hub tools

years

2026 10

representative citing papers

Steerable Cultural Preference Optimization of Reward Models

cs.CL · 2026-06-17 · unverdicted · novelty 5.0

SCPO is a steerable training method for reward models that improves minority cultural preference accuracy by up to 7 points and is up to 280% more data-efficient than standard finetuning on PRISM and GlobalOpinionQA datasets.

citing papers explorer

Showing 10 of 10 citing papers.