Recognition: no theorem link
Robust Testing Of the Allais Paradox By Paired Choices vs. Paired Valuations
Pith reviewed 2026-05-10 18:27 UTC · model grok-4.3
The pith
A strong paired choice test for the common ratio effect stays unbiased under stochastic choice and shows the effect remains prevalent in data.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
McGranaghan et al. show that standard paired choice tests for the common ratio effect are structurally biased when choice is stochastic and propose valuation tests as a robust alternative, finding no systematic evidence for the effect. We argue that valuation tests are inherently biased and lack predictive power under standard expected utility assumptions. In contrast, we advocate for a strong paired choice test, proving it remains robustly unbiased across common models of stochastic choice. Applying this strong test to existing experimental data, we find that the common ratio effect remains highly prevalent.
What carries the argument
The strong paired choice test, a reinforced version of binary choice comparisons between lotteries that is proven unbiased for detecting the common ratio effect across stochastic choice models.
If this is right
- Valuation tests will systematically under-detect violations of expected utility because they are biased even when the true model satisfies EU.
- The common ratio effect survives as a robust empirical finding once testing procedures account for stochastic choice.
- Stochastic choice models alone do not overturn the Allais paradox in the common ratio domain.
- Existing experimental data on probability scaling in choices continue to support non-expected-utility behavior when analyzed with the strong test.
- Testing protocols for other Allais-type violations should prioritize strong paired choice designs over valuation elicitations.
Where Pith is reading between the lines
- The result implies that explanations for the Allais paradox must go beyond simple randomness in choice and address systematic preference patterns.
- Similar robustness checks could be applied to the common consequence effect or other EU violations to test whether they also survive unbiased methods.
- The bias in valuation tests may arise because continuous value reports introduce different noise structures than binary choices, a distinction worth modeling explicitly.
- If the strong test is adopted widely, meta-analyses of risk preferences could be updated to down-weight older valuation-based studies.
Load-bearing premise
The common stochastic choice models such as logit and probit cover the relevant range of realistic behavior and that reanalysis of existing datasets does not introduce selection or application biases.
What would settle it
A new experiment that applies the strong paired choice test to fresh subjects and finds no systematic common ratio effect would undermine the claim that the effect remains highly prevalent under unbiased methods.
Figures
read the original abstract
McGranaghan, Nielsen, O'Donoghue, Somerville, and Sprenger [2024] show that standard paired choice tests for the common ratio effect are structurally biased when choice is stochastic, proposing valuation tests as a robust alternative. Using valuation tests, they find no systematic evidence for the common ratio effect, seemingly overturning much of the extant literature. We evaluate this conclusion in light of stochastic choice theory. We argue that valuation tests are inherently biased and lack predictive power under standard expected utility assumptions. In contrast, we advocate for a ``strong'' paired choice test, proving it remains robustly unbiased across common models of stochastic choice. Applying this strong test to existing experimental data, we find that the common ratio effect remains highly prevalent.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper critiques McGranaghan et al. (2024) for concluding that the common ratio effect lacks systematic evidence when using valuation tests, which they proposed as robust to stochastic choice. It argues instead that valuation tests are inherently biased and lack predictive power under standard expected utility assumptions with stochastic choice. The authors advocate a 'strong' paired choice test, prove it is robustly unbiased across common stochastic choice models (e.g., logit, probit), and reapply it to existing experimental data to conclude that the common ratio effect remains highly prevalent.
Significance. If the unbiasedness proof for the strong paired choice test holds and the reanalysis avoids selection biases, the result would restore the common ratio effect as a prevalent phenomenon in the Allais paradox literature, challenging recent valuation-based findings and underscoring the sensitivity of anomaly detection to test design under stochastic choice. The work provides a model-independent theoretical argument and an empirical reanalysis that could shift methodological recommendations in behavioral economics experiments.
major comments (2)
- [Empirical reanalysis section] The reanalysis of pre-existing datasets to apply the strong paired choice test: subsetting observations to satisfy the stricter pairing structure (or imputing missing pairs) risks selection on unobservables or altering the effective sample in ways that could inflate prevalence estimates. The manuscript should detail the exact matching procedure, report sample sizes before/after subsetting, and include robustness checks (e.g., comparing to full samples or alternative pairings) to address this load-bearing concern for the empirical claim.
- [Theoretical section on strong test] The proof of robustness for the strong paired choice test across stochastic models: while the abstract states it remains unbiased for common models like logit and probit, the derivation must explicitly enumerate the full set of models covered and demonstrate that the test statistic's expectation is zero under each (independent of the common ratio violation). If any realistic stochastic model is omitted, the 'robustly unbiased' claim is weakened.
minor comments (2)
- [Introduction] Clarify notation for the 'strong' paired choice test versus standard paired choice tests early in the paper to avoid confusion with the McGranaghan et al. terminology.
- [Comparison table] Add a table summarizing the key properties (bias, power) of valuation tests, standard paired choice tests, and the proposed strong test under EU and stochastic choice.
Simulated Author's Rebuttal
We thank the referee for their constructive and detailed comments. We address each major comment point by point below, indicating where revisions will be made to improve the manuscript.
read point-by-point responses
-
Referee: [Empirical reanalysis section] The reanalysis of pre-existing datasets to apply the strong paired choice test: subsetting observations to satisfy the stricter pairing structure (or imputing missing pairs) risks selection on unobservables or altering the effective sample in ways that could inflate prevalence estimates. The manuscript should detail the exact matching procedure, report sample sizes before/after subsetting, and include robustness checks (e.g., comparing to full samples or alternative pairings) to address this load-bearing concern for the empirical claim.
Authors: We agree that the empirical reanalysis requires additional transparency to address potential selection concerns. In the revised manuscript, we will provide a detailed description of the exact matching procedure used to subset observations for the strong paired choice test. We will report sample sizes before and after subsetting for each dataset. We will also add robustness checks, such as comparisons to the full samples and alternative pairing methods, to confirm that the prevalence estimates of the common ratio effect are not driven by the subsetting process. These revisions will directly strengthen the empirical section. revision: yes
-
Referee: [Theoretical section on strong test] The proof of robustness for the strong paired choice test across stochastic models: while the abstract states it remains unbiased for common models like logit and probit, the derivation must explicitly enumerate the full set of models covered and demonstrate that the test statistic's expectation is zero under each (independent of the common ratio violation). If any realistic stochastic model is omitted, the 'robustly unbiased' claim is weakened.
Authors: We thank the referee for this suggestion to enhance the explicitness of the theoretical proof. The current derivation establishes that the strong paired choice test is unbiased under standard stochastic choice models, including logit and probit. In the revised manuscript, we will explicitly enumerate the full set of models covered (logit, probit, and other common variants such as tremble models) and include step-by-step derivations showing that the expectation of the test statistic is zero under each model, independent of any common ratio violation. This will make the robustness claim more precise. revision: yes
Circularity Check
Theoretical unbiasedness proof and external-data reanalysis are independent of each other
full rationale
The paper's core derivation is a mathematical proof that a proposed 'strong' paired-choice test statistic remains unbiased under standard stochastic choice models (logit, probit, etc.). This proof is presented as a first-principles result and does not rely on fitting parameters to the target data or on self-citations for its validity. The subsequent empirical claim—that the common-ratio effect is prevalent—arises from applying the already-proven test to pre-existing experimental datasets collected by other researchers. No equation reduces to its own input by construction, no fitted quantity is relabeled as a prediction, and no load-bearing premise collapses to a self-citation chain. The derivation chain is therefore self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Common models of stochastic choice (logit, probit, etc.) describe realistic choice behavior.
- domain assumption Valuation tasks lack predictive power under standard expected utility assumptions when choice is stochastic.
invented entities (1)
-
Strong paired choice test
no independent evidence
Reference graph
Works this paper leans on
-
[1]
Agranov, M. and P. Ortoleva (2017): Stochastic choice and preferences for randomization, Journal of Political Economy, 125, 40--68
2017
-
[2]
Allais, M. (1953): Le comportement de l'homme rationnel devant le risque: critique des postulats et axiomes de l' \'e cole am \'e ricaine, Econometrica: journal of the Econometric Society, 503--546
1953
-
[3]
Apesteguia, J. and M. A. Ballester (2018): Monotone stochastic choice models: The case of risk and time preferences, Journal of Political Economy, 126, 74--106
2018
-
[4]
Ballinger, T. P. and N. T. Wilcox (1997): Decisions, Error and Heterogeneity, The Economic Journal, 107, 1090--1105
1997
-
[5]
Barberis, N. C. (2013): Thirty years of prospect theory in economics: A review and assessment, Journal of economic perspectives, 27, 173--196
2013
-
[6]
Ben-Akiva, M. E. (1973): Structure of passenger travel demand models. Ph.D. thesis, Massachusetts Institute of Technology
1973
-
[7]
Levinsohn, and A
Berry, S., J. Levinsohn, and A. Pakes (1995): Automobile Prices in Market Equilibrium, Econometrica, 63, 841--890
1995
-
[8]
Panchenko, and A
Blavatskyy, P., V. Panchenko, and A. Ortmann (2023): How common is the common-ratio effect? Experimental Economics, 26, 253–272
2023
-
[9]
(1995): Individual decision making, in The handbook of experimental economics, ed
Camerer, C. (1995): Individual decision making, in The handbook of experimental economics, ed. by J. H. Kagel and A. E. Roth, Princeton University Press, 587--704
1995
-
[10]
Fishburn, P. C. (1978): Choice probabilities and choice functions, Journal of Mathematical Psychology, 18, 205--219
1978
-
[11]
Gul, F. and W. Pesendorfer (2006): Random expected utility, Econometrica, 74, 121--146
2006
-
[12]
Hausman, J. A. and D. A. Wise (1978): A conditional probit model for qualitative choice: Discrete decisions recognizing interdependence and heterogeneous preferences, Econometrica: Journal of the econometric society, 403--426
1978
-
[13]
He, J. and P. Natenzon (2024): Moderate utility, American Economic Review: Insights, 6, 176--195
2024
-
[14]
Hey, J. D. (2001): Does repetition improve consistency? Experimental economics, 4, 5--54
2001
-
[15]
Kahneman, D. and A. Tversky (1979): Prospect Theory. An Analysis of Decision under Uncertainty, Econometrica, 47, 263--291
1979
-
[16]
(2005): Modelling the Stochastic Component of Behaviour in Experiments: Some Issues for the Interpretation of Data, Experimental Economics, 8, 301–323
Loomes, G. (2005): Modelling the Stochastic Component of Behaviour in Experiments: Some Issues for the Interpretation of Data, Experimental Economics, 8, 301–323
2005
-
[17]
Machina, M. J. (1987): Choice under uncertainty: Problems solved and unsolved, Journal of Economic Perspectives, 1, 121--154
1987
-
[18]
--- -.1pt --- -.1pt --- (2008): Non-expected utility theory, in The New Palgrave Dictionary of Economics, Springer, 1--14
2008
-
[19]
--- -.1pt --- -.1pt --- (2018): Non-expected utility theory, in The New Palgrave Dictionary of Economics, Springer, 9570--9582
2018
-
[20]
(1978): Modeling the Choice of Residential Location, Spatial Interaction Theory and Planning Models, 75--96
McFadden, D. (1978): Modeling the Choice of Residential Location, Spatial Interaction Theory and Planning Models, 75--96
1978
-
[21]
Nielsen, T
McGranaghan, C., K. Nielsen, T. O’Donoghue, J. Somerville, and C. D. Sprenger (2024): Distinguishing common ratio preferences from common ratio effects using paired valuation tasks, American Economic Review, 114, 307--347
2024
-
[22]
(2000): A Practitioner's Guide to Estimation of Random-Coefficients Logit Models of Demand, Journal of Economics & Management Strategy, 9, 513--548
Nevo, A. (2000): A Practitioner's Guide to Estimation of Random-Coefficients Logit Models of Demand, Journal of Economics & Management Strategy, 9, 513--548
2000
-
[23]
(1998): The Probability Weighting Function, Econometrica, 66, 497--527
Prelec, D. (1998): The Probability Weighting Function, Econometrica, 66, 497--527
1998
-
[24]
(2025): Stochastic choice theory, Cambridge Books
Strzalecki, T. (2025): Stochastic choice theory, Cambridge Books
2025
-
[25]
Thurstone, L. L. (1927): Psychophysical analysis, The American journal of psychology, 38, 368--389
1927
-
[26]
(1969): Intransitivity of preferences
Tversky, A. (1969): Intransitivity of preferences. Psychological review, 76, 31
1969
-
[27]
Psychological review, 79, 281
--- -.1pt --- -.1pt --- (1972): Elimination by aspects: A theory of choice. Psychological review, 79, 281
1972
-
[28]
Wilcox, N. T. (2008): Stochastic models for binary discrete choice under risk: A critical primer and econometric comparison, in Risk aversion in experiments, Emerald Group Publishing
2008
-
[29]
--- -.1pt --- -.1pt --- (2011): ‘Stochastically more risk averse:’ A contextual theory of stochastic discrete choice under risk, Journal of Econometrics, 162, 89--104, the Economics and Econometrics of Risk
2011
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.