Counterfactual prompting effects on LLMs are often indistinguishable from those caused by meaning-preserving paraphrases, causing most previously reported demographic sensitivities to disappear under proper statistical comparison.
arXiv preprint arXiv:2307.14117 (2023)
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
representative citing papers
LLM-generated image aesthetics labels correlate with Mercari user behavior and produced sales growth in an online experiment.
citing papers explorer
-
Compared to What? Baselines and Metrics for Counterfactual Prompting
Counterfactual prompting effects on LLMs are often indistinguishable from those caused by meaning-preserving paraphrases, causing most previously reported demographic sensitivities to disappear under proper statistical comparison.
-
Image Score: Learning and Evaluating Human Preferences for Mercari Search
LLM-generated image aesthetics labels correlate with Mercari user behavior and produced sales growth in an online experiment.