pith. machine review for the scientific record. sign in

arxiv: 2604.06050 · v2 · submitted 2026-04-07 · 💰 econ.TH

Recognition: no theorem link

Robust Testing Of the Allais Paradox By Paired Choices vs. Paired Valuations

Authors on Pith no claims yet

Pith reviewed 2026-05-10 18:27 UTC · model grok-4.3

classification 💰 econ.TH
keywords common ratio effectAllais paradoxstochastic choicepaired choicesvaluation testsexpected utilityrobust testingdecision under risk
0
0 comments X

The pith

A strong paired choice test for the common ratio effect stays unbiased under stochastic choice and shows the effect remains prevalent in data.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper challenges recent findings that the common ratio effect vanishes under valuation-based tests and instead shows those tests are biased against detecting violations of expected utility. It proves that a strengthened version of the traditional paired choice test avoids bias across standard stochastic choice models such as logit and probit. Reapplying this robust test to existing experimental datasets reveals that the common ratio effect is still highly prevalent. This matters because the common ratio effect is a central empirical challenge to expected utility theory in decisions under risk, and its persistence affects whether stochastic noise can explain away observed anomalies.

Core claim

McGranaghan et al. show that standard paired choice tests for the common ratio effect are structurally biased when choice is stochastic and propose valuation tests as a robust alternative, finding no systematic evidence for the effect. We argue that valuation tests are inherently biased and lack predictive power under standard expected utility assumptions. In contrast, we advocate for a strong paired choice test, proving it remains robustly unbiased across common models of stochastic choice. Applying this strong test to existing experimental data, we find that the common ratio effect remains highly prevalent.

What carries the argument

The strong paired choice test, a reinforced version of binary choice comparisons between lotteries that is proven unbiased for detecting the common ratio effect across stochastic choice models.

If this is right

  • Valuation tests will systematically under-detect violations of expected utility because they are biased even when the true model satisfies EU.
  • The common ratio effect survives as a robust empirical finding once testing procedures account for stochastic choice.
  • Stochastic choice models alone do not overturn the Allais paradox in the common ratio domain.
  • Existing experimental data on probability scaling in choices continue to support non-expected-utility behavior when analyzed with the strong test.
  • Testing protocols for other Allais-type violations should prioritize strong paired choice designs over valuation elicitations.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The result implies that explanations for the Allais paradox must go beyond simple randomness in choice and address systematic preference patterns.
  • Similar robustness checks could be applied to the common consequence effect or other EU violations to test whether they also survive unbiased methods.
  • The bias in valuation tests may arise because continuous value reports introduce different noise structures than binary choices, a distinction worth modeling explicitly.
  • If the strong test is adopted widely, meta-analyses of risk preferences could be updated to down-weight older valuation-based studies.

Load-bearing premise

The common stochastic choice models such as logit and probit cover the relevant range of realistic behavior and that reanalysis of existing datasets does not introduce selection or application biases.

What would settle it

A new experiment that applies the strong paired choice test to fresh subjects and finds no systematic common ratio effect would undermine the claim that the effect remains highly prevalent under unbiased methods.

Figures

Figures reproduced from arXiv: 2604.06050 by Federico Echenique, Gerelt Tserenjigmid.

Figure 1
Figure 1. Figure 1: Possible values of (E[mAB], E[mCD]) as a function of γ when y = 30, p = 0.8 and r = 0.4. When γ = 1 we obtain the point (py, py) = (24, 24). As γ → 0 we obtain any vector in R2 ++. Proposition 1. Consider an expected utility agent with a CRRA von-Neumann￾Morgenstern utility function u(x) = x γ , where γ ∈ (0, 1). Fix y > 0 and p, r ∈ (0, 1). Suppose that p > 1/2. (1) For any (z1, z2) ∈ R2 ++, there exists … view at source ↗
Figure 2
Figure 2. Figure 2: Comparison of the EU, CRE, and RCRE regions under paired choice tests. Proposition 6. Consider the model in Equation (1) and let f(x) = u(x). For each of the three assumptions of Proposition 5, there exist ϵp, ϵx, ϵp,x that satisfy the as￾sumption and E[εAB] ̸= E[εCD]. This result and its proof show that even when ϵp, ϵx, ϵp,x are independent and sym￾metric around zero, we may have E[εAB] ̸= E[εCD]. 4. Det… view at source ↗
Figure 3
Figure 3. Figure 3: Implications of [PITH_FULL_IMAGE:figures/full_fig_p022_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Illustration of the construction in the proof of Proposi￾tion 1. Define the random variable X piecewise, conditional on the value of Z and Y , by X =    Z if Z < −d a + bY if Z ≥ −d where a = c−d 2 and b = c+d 2d . This construction is illustrated in [PITH_FULL_IMAGE:figures/full_fig_p029_4.png] view at source ↗
read the original abstract

McGranaghan, Nielsen, O'Donoghue, Somerville, and Sprenger [2024] show that standard paired choice tests for the common ratio effect are structurally biased when choice is stochastic, proposing valuation tests as a robust alternative. Using valuation tests, they find no systematic evidence for the common ratio effect, seemingly overturning much of the extant literature. We evaluate this conclusion in light of stochastic choice theory. We argue that valuation tests are inherently biased and lack predictive power under standard expected utility assumptions. In contrast, we advocate for a ``strong'' paired choice test, proving it remains robustly unbiased across common models of stochastic choice. Applying this strong test to existing experimental data, we find that the common ratio effect remains highly prevalent.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper critiques McGranaghan et al. (2024) for concluding that the common ratio effect lacks systematic evidence when using valuation tests, which they proposed as robust to stochastic choice. It argues instead that valuation tests are inherently biased and lack predictive power under standard expected utility assumptions with stochastic choice. The authors advocate a 'strong' paired choice test, prove it is robustly unbiased across common stochastic choice models (e.g., logit, probit), and reapply it to existing experimental data to conclude that the common ratio effect remains highly prevalent.

Significance. If the unbiasedness proof for the strong paired choice test holds and the reanalysis avoids selection biases, the result would restore the common ratio effect as a prevalent phenomenon in the Allais paradox literature, challenging recent valuation-based findings and underscoring the sensitivity of anomaly detection to test design under stochastic choice. The work provides a model-independent theoretical argument and an empirical reanalysis that could shift methodological recommendations in behavioral economics experiments.

major comments (2)
  1. [Empirical reanalysis section] The reanalysis of pre-existing datasets to apply the strong paired choice test: subsetting observations to satisfy the stricter pairing structure (or imputing missing pairs) risks selection on unobservables or altering the effective sample in ways that could inflate prevalence estimates. The manuscript should detail the exact matching procedure, report sample sizes before/after subsetting, and include robustness checks (e.g., comparing to full samples or alternative pairings) to address this load-bearing concern for the empirical claim.
  2. [Theoretical section on strong test] The proof of robustness for the strong paired choice test across stochastic models: while the abstract states it remains unbiased for common models like logit and probit, the derivation must explicitly enumerate the full set of models covered and demonstrate that the test statistic's expectation is zero under each (independent of the common ratio violation). If any realistic stochastic model is omitted, the 'robustly unbiased' claim is weakened.
minor comments (2)
  1. [Introduction] Clarify notation for the 'strong' paired choice test versus standard paired choice tests early in the paper to avoid confusion with the McGranaghan et al. terminology.
  2. [Comparison table] Add a table summarizing the key properties (bias, power) of valuation tests, standard paired choice tests, and the proposed strong test under EU and stochastic choice.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive and detailed comments. We address each major comment point by point below, indicating where revisions will be made to improve the manuscript.

read point-by-point responses
  1. Referee: [Empirical reanalysis section] The reanalysis of pre-existing datasets to apply the strong paired choice test: subsetting observations to satisfy the stricter pairing structure (or imputing missing pairs) risks selection on unobservables or altering the effective sample in ways that could inflate prevalence estimates. The manuscript should detail the exact matching procedure, report sample sizes before/after subsetting, and include robustness checks (e.g., comparing to full samples or alternative pairings) to address this load-bearing concern for the empirical claim.

    Authors: We agree that the empirical reanalysis requires additional transparency to address potential selection concerns. In the revised manuscript, we will provide a detailed description of the exact matching procedure used to subset observations for the strong paired choice test. We will report sample sizes before and after subsetting for each dataset. We will also add robustness checks, such as comparisons to the full samples and alternative pairing methods, to confirm that the prevalence estimates of the common ratio effect are not driven by the subsetting process. These revisions will directly strengthen the empirical section. revision: yes

  2. Referee: [Theoretical section on strong test] The proof of robustness for the strong paired choice test across stochastic models: while the abstract states it remains unbiased for common models like logit and probit, the derivation must explicitly enumerate the full set of models covered and demonstrate that the test statistic's expectation is zero under each (independent of the common ratio violation). If any realistic stochastic model is omitted, the 'robustly unbiased' claim is weakened.

    Authors: We thank the referee for this suggestion to enhance the explicitness of the theoretical proof. The current derivation establishes that the strong paired choice test is unbiased under standard stochastic choice models, including logit and probit. In the revised manuscript, we will explicitly enumerate the full set of models covered (logit, probit, and other common variants such as tremble models) and include step-by-step derivations showing that the expectation of the test statistic is zero under each model, independent of any common ratio violation. This will make the robustness claim more precise. revision: yes

Circularity Check

0 steps flagged

Theoretical unbiasedness proof and external-data reanalysis are independent of each other

full rationale

The paper's core derivation is a mathematical proof that a proposed 'strong' paired-choice test statistic remains unbiased under standard stochastic choice models (logit, probit, etc.). This proof is presented as a first-principles result and does not rely on fitting parameters to the target data or on self-citations for its validity. The subsequent empirical claim—that the common-ratio effect is prevalent—arises from applying the already-proven test to pre-existing experimental datasets collected by other researchers. No equation reduces to its own input by construction, no fitted quantity is relabeled as a prediction, and no load-bearing premise collapses to a self-citation chain. The derivation chain is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 1 invented entities

The paper rests on standard stochastic choice theory and expected utility as background; it introduces the strong test as a new procedure without additional fitted parameters or new entities.

axioms (2)
  • domain assumption Common models of stochastic choice (logit, probit, etc.) describe realistic choice behavior.
    The proof that the strong test is unbiased is stated to hold across these models.
  • domain assumption Valuation tasks lack predictive power under standard expected utility assumptions when choice is stochastic.
    Central argument against the 2024 valuation approach.
invented entities (1)
  • Strong paired choice test no independent evidence
    purpose: A version of paired choice testing that remains unbiased for the common ratio effect under stochastic choice.
    Newly advocated and proven in the paper.

pith-pipeline@v0.9.0 · 5427 in / 1419 out tokens · 76030 ms · 2026-05-10T18:27:23.716211+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

29 extracted references

  1. [1]

    Agranov, M. and P. Ortoleva (2017): Stochastic choice and preferences for randomization, Journal of Political Economy, 125, 40--68

  2. [2]

    Allais, M. (1953): Le comportement de l'homme rationnel devant le risque: critique des postulats et axiomes de l' \'e cole am \'e ricaine, Econometrica: journal of the Econometric Society, 503--546

  3. [3]

    Apesteguia, J. and M. A. Ballester (2018): Monotone stochastic choice models: The case of risk and time preferences, Journal of Political Economy, 126, 74--106

  4. [4]

    Ballinger, T. P. and N. T. Wilcox (1997): Decisions, Error and Heterogeneity, The Economic Journal, 107, 1090--1105

  5. [5]

    Barberis, N. C. (2013): Thirty years of prospect theory in economics: A review and assessment, Journal of economic perspectives, 27, 173--196

  6. [6]

    Ben-Akiva, M. E. (1973): Structure of passenger travel demand models. Ph.D. thesis, Massachusetts Institute of Technology

  7. [7]

    Levinsohn, and A

    Berry, S., J. Levinsohn, and A. Pakes (1995): Automobile Prices in Market Equilibrium, Econometrica, 63, 841--890

  8. [8]

    Panchenko, and A

    Blavatskyy, P., V. Panchenko, and A. Ortmann (2023): How common is the common-ratio effect? Experimental Economics, 26, 253–272

  9. [9]

    (1995): Individual decision making, in The handbook of experimental economics, ed

    Camerer, C. (1995): Individual decision making, in The handbook of experimental economics, ed. by J. H. Kagel and A. E. Roth, Princeton University Press, 587--704

  10. [10]

    Fishburn, P. C. (1978): Choice probabilities and choice functions, Journal of Mathematical Psychology, 18, 205--219

  11. [11]

    Gul, F. and W. Pesendorfer (2006): Random expected utility, Econometrica, 74, 121--146

  12. [12]

    Hausman, J. A. and D. A. Wise (1978): A conditional probit model for qualitative choice: Discrete decisions recognizing interdependence and heterogeneous preferences, Econometrica: Journal of the econometric society, 403--426

  13. [13]

    He, J. and P. Natenzon (2024): Moderate utility, American Economic Review: Insights, 6, 176--195

  14. [14]

    Hey, J. D. (2001): Does repetition improve consistency? Experimental economics, 4, 5--54

  15. [15]

    Kahneman, D. and A. Tversky (1979): Prospect Theory. An Analysis of Decision under Uncertainty, Econometrica, 47, 263--291

  16. [16]

    (2005): Modelling the Stochastic Component of Behaviour in Experiments: Some Issues for the Interpretation of Data, Experimental Economics, 8, 301–323

    Loomes, G. (2005): Modelling the Stochastic Component of Behaviour in Experiments: Some Issues for the Interpretation of Data, Experimental Economics, 8, 301–323

  17. [17]

    Machina, M. J. (1987): Choice under uncertainty: Problems solved and unsolved, Journal of Economic Perspectives, 1, 121--154

  18. [18]

    --- -.1pt --- -.1pt --- (2008): Non-expected utility theory, in The New Palgrave Dictionary of Economics, Springer, 1--14

  19. [19]

    --- -.1pt --- -.1pt --- (2018): Non-expected utility theory, in The New Palgrave Dictionary of Economics, Springer, 9570--9582

  20. [20]

    (1978): Modeling the Choice of Residential Location, Spatial Interaction Theory and Planning Models, 75--96

    McFadden, D. (1978): Modeling the Choice of Residential Location, Spatial Interaction Theory and Planning Models, 75--96

  21. [21]

    Nielsen, T

    McGranaghan, C., K. Nielsen, T. O’Donoghue, J. Somerville, and C. D. Sprenger (2024): Distinguishing common ratio preferences from common ratio effects using paired valuation tasks, American Economic Review, 114, 307--347

  22. [22]

    (2000): A Practitioner's Guide to Estimation of Random-Coefficients Logit Models of Demand, Journal of Economics & Management Strategy, 9, 513--548

    Nevo, A. (2000): A Practitioner's Guide to Estimation of Random-Coefficients Logit Models of Demand, Journal of Economics & Management Strategy, 9, 513--548

  23. [23]

    (1998): The Probability Weighting Function, Econometrica, 66, 497--527

    Prelec, D. (1998): The Probability Weighting Function, Econometrica, 66, 497--527

  24. [24]

    (2025): Stochastic choice theory, Cambridge Books

    Strzalecki, T. (2025): Stochastic choice theory, Cambridge Books

  25. [25]

    Thurstone, L. L. (1927): Psychophysical analysis, The American journal of psychology, 38, 368--389

  26. [26]

    (1969): Intransitivity of preferences

    Tversky, A. (1969): Intransitivity of preferences. Psychological review, 76, 31

  27. [27]

    Psychological review, 79, 281

    --- -.1pt --- -.1pt --- (1972): Elimination by aspects: A theory of choice. Psychological review, 79, 281

  28. [28]

    Wilcox, N. T. (2008): Stochastic models for binary discrete choice under risk: A critical primer and econometric comparison, in Risk aversion in experiments, Emerald Group Publishing

  29. [29]

    --- -.1pt --- -.1pt --- (2011): ‘Stochastically more risk averse:’ A contextual theory of stochastic discrete choice under risk, Journal of Econometrics, 162, 89--104, the Economics and Econometrics of Risk