Anatomy of the Market: A Body-Tail Test of Factor Models

Useong Shin

arxiv: 2606.23596 · v4 · pith:FEIEZKRWnew · submitted 2026-06-22 · 💱 q-fin.GN

Anatomy of the Market: A Body-Tail Test of Factor Models

Useong Shin This is my paper

Pith reviewed 2026-07-01 07:21 UTC · model grok-4.3

classification 💱 q-fin.GN

keywords factor modelspricing errorsmarket decompositionbody-tail testq5 modeldaily frequencystochastic discount factorCRSP market

0 comments

The pith

A capitalization-ranked split of the market into body and tail legs reveals that the q5 model produces offsetting daily alphas even when it spans the aggregate market best.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper examines whether factor models that price the full market without error also price its components consistently. It splits the CRSP market into capitalization-ranked body and tail legs that add back to the market return, then tests models on these legs separately. At daily frequency every model clears the overall benchmark, yet only q5 shows a systematic pattern of negative body alphas and positive tail alphas across nine different split ratios. Random splits eliminate the pattern and monthly aggregation weakens the joint rejection while moving relative weakness toward FF3.

Core claim

In an ideal stochastic discount factor zero pricing errors and maximum Sharpe ratio coincide, but in low-dimensional approximations they need not. Decomposing an investible CRSP market into capitalization-ranked body and tail legs that recombine to the market return shows that at daily frequency all models pass the aggregate benchmark, yet q5 alone leaves systematic offsetting leg alphas—negative body, positive tail—at all nine split ratios despite holding the strongest spanning position. Matched random splits remove the pattern. Monthly aggregation attenuates q5's joint rejection and shifts relative weakness toward FF3.

What carries the argument

The capitalization-ranked body-tail decomposition that splits the CRSP market into legs recombining exactly to the market return.

If this is right

Internal consistency of factor models is frequency-dependent rather than a fixed property.
q5 exhibits the strongest market spanning yet still fails the body-tail consistency test at daily frequency.
Aggregate benchmarks alone are insufficient to detect offsetting pricing errors across market segments.
Random splits of the market do not generate the same offsetting alpha pattern seen in the capitalization-ranked split.
Monthly data aggregation changes which model shows the greatest weakness relative to the others.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Daily-frequency tests may surface approximation errors in factor models that monthly data smooth over.
The body-tail method could be applied to other markets or asset classes to check whether the same frequency dependence appears.
Models that survive both aggregate and leg tests might be closer to ideal SDFs than those that pass only aggregates.

Load-bearing premise

Splitting the market by capitalization rank isolates variation that is economically meaningful enough to expose internal inconsistencies in factor models.

What would settle it

Observing that the negative-body positive-tail alpha pattern for q5 disappears under an alternative non-capitalization split or after orthogonalizing the legs to additional factors would undermine the claim of systematic inconsistency.

Figures

Figures reproduced from arXiv: 2606.23596 by Useong Shin.

**Figure 3.1.** Figure 3.1: CRSP-based market return and standard market factors [PITH_FULL_IMAGE:figures/full_fig_p010_3_1.png] view at source ↗

read the original abstract

In an ideal stochastic discount factor, zero pricing errors and maximum Sharpe ratio coincide; in a low-dimensional approximation they need not. I test this separation by decomposing an investible CRSP market into capitalization-ranked body and tail legs that recombine to the market return. At the daily frequency, all models pass the aggregate benchmark, but q5 alone leaves systematic offsetting leg alphas-negative body, positive tail-at all nine split ratios, despite holding the strongest spanning position. Matched random splits remove the pattern. Monthly aggregation attenuates q5's joint rejection and shifts relative weakness toward FF3, showing that internal consistency is frequency-dependent.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The body-tail split test is a clean new diagnostic for factor-model consistency, but the q5 daily-frequency result is under-supported and plausibly explained by microstructure noise in the tail.

read the letter

The paper's main contribution is a capitalization-ranked decomposition of the CRSP market into body and tail legs that add back to the market return, then checking whether standard factor models price both legs without offsetting alphas. The random-split placebo and the frequency comparison (daily versus monthly) are useful controls that prior spanning tests do not directly supply.

What stands out is the reported pattern at daily frequency: q5 produces the cleanest aggregate pricing yet still shows systematic negative body alphas and positive tail alphas across nine split ratios, while the random splits do not. Monthly aggregation weakens that joint rejection. This frequency dependence is worth noting.

The soft spots are straightforward. The abstract gives no standard errors, t-stats, or details on how the legs are constructed or rebalanced, so the size and reliability of the offsetting alphas cannot be judged. The stress-test concern about bid-ask bounce and thin trading in the low-cap tail is plausible and unaddressed in the reported checks; the fact that monthly data attenuates the result is consistent with that alternative. Without liquidity screens, mid-quote returns, or lead-lag adjustments, it is hard to treat the daily pattern as evidence of internal inconsistency rather than data artifact.

The work is aimed at empirical asset-pricing researchers who already run factor-model horse races. A reader already familiar with q5 and the CRSP data will see the incremental test clearly. The central claim is not yet strongly supported, but the test design itself is worth referee attention because it is falsifiable and easy to replicate or extend.

I would send it to review with a request for the missing statistics and microstructure robustness checks.

Referee Report

2 major / 2 minor

Summary. The paper decomposes the CRSP market into capitalization-ranked body and tail legs that recombine to the aggregate return. It reports that at daily frequency all tested factor models span the aggregate market, but q5 alone produces systematic negative body alphas and positive tail alphas across nine split ratios; this pattern is absent in matched random splits. Monthly aggregation attenuates the q5 rejection and shifts relative weakness toward FF3, leading to the conclusion that internal consistency of factor models is frequency-dependent.

Significance. The body-tail decomposition with a random-split placebo is a clean and falsifiable design that directly tests whether spanning the aggregate implies consistency on economically meaningful sub-portfolios. If the daily-frequency pattern survives microstructure controls, the result would show that even the best-spanning model (q5) can be internally inconsistent at high frequency, with implications for the use of daily data in asset-pricing tests and for the interpretation of multi-factor models.

major comments (2)

[Abstract] Abstract and daily-frequency results: the claim of 'systematic offsetting leg alphas' for q5 is presented without error bars, t-statistics, or any statistical test of the joint body-tail rejection; this absence is load-bearing because the central claim rests on the existence and consistency of these alphas across nine splits.
[Daily-frequency results] Daily-frequency leg-alpha results (described in abstract and results section): the tail leg consists of low-capitalization stocks whose daily returns are known to be contaminated by bid-ask bounce and non-synchronous trading; the paper reports that monthly aggregation weakens the rejection but supplies no robustness checks using mid-quote returns, liquidity screens, or lead-lag adjustments, leaving open the possibility that the offsetting alphas reflect microstructure noise rather than model inconsistency.

minor comments (2)

[Method] The nine capitalization-ranked split ratios are described only by their existence; a table or appendix listing the exact percentile cutoffs would improve reproducibility.
[Aggregate benchmark] The paper states that q5 'holds the strongest spanning position' on the aggregate; reporting the precise R² or pricing-error metrics that establish this ranking would strengthen the contrast with the leg results.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed and constructive report. The two major comments identify clear opportunities to strengthen the statistical presentation and to address potential microstructure concerns. We respond to each below and will incorporate the suggested changes in the revision.

read point-by-point responses

Referee: [Abstract] Abstract and daily-frequency results: the claim of 'systematic offsetting leg alphas' for q5 is presented without error bars, t-statistics, or any statistical test of the joint body-tail rejection; this absence is load-bearing because the central claim rests on the existence and consistency of these alphas across nine splits.

Authors: We agree that explicit statistical tests would make the central claim more robust. Although the consistency of the sign pattern across all nine split ratios already provides strong informal evidence, we will add t-statistics for the individual body and tail alphas together with a joint Wald test of the body-tail pair in the revised results section. A brief reference to these tests will also be added to the abstract. revision: yes
Referee: [Daily-frequency results] Daily-frequency leg-alpha results (described in abstract and results section): the tail leg consists of low-capitalization stocks whose daily returns are known to be contaminated by bid-ask bounce and non-synchronous trading; the paper reports that monthly aggregation weakens the rejection but supplies no robustness checks using mid-quote returns, liquidity screens, or lead-lag adjustments, leaving open the possibility that the offsetting alphas reflect microstructure noise rather than model inconsistency.

Authors: This is a legitimate concern. While the matched random-split placebo already controls for size-driven artifacts, it does not directly purge bid-ask bounce or non-synchronicity. In the revision we will add three robustness exercises: (i) Dimson (1979) lead-lag adjustments, (ii) exclusion of the smallest 1 % and 5 % of stocks within the tail, and (iii) liquidity screens based on positive trading volume. We will report whether the q5 offsetting pattern survives these filters and will qualify the conclusions if the alphas are materially attenuated. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical test uses external data and standard models

full rationale

The paper performs an empirical decomposition of CRSP market returns into capitalization-ranked body and tail legs, then runs time-series regressions of these legs on standard factor models (q5, FF3, etc.) at daily and monthly frequencies, with random-split placebos. No derivation chain exists that reduces a claimed prediction or result to a fitted parameter defined by the result itself, nor does any load-bearing premise rely on self-citation of an unverified uniqueness theorem or ansatz. The test is externally falsifiable against CRSP returns and does not rename known patterns or smuggle assumptions via prior work by the same author. This is the normal case of a self-contained empirical study.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The central claim rests on the domain assumption that leg alphas after aggregate spanning measure internal inconsistency, plus the free choice of nine capitalization split ratios; no new entities are postulated.

free parameters (1)

split ratios
Nine capitalization thresholds chosen to define body and tail legs; values not derived from theory.

axioms (1)

domain assumption Factor models are evaluated by whether they produce zero alphas on sub-portfolios after spanning the aggregate return
Core premise of the body-tail test stated in the abstract.

pith-pipeline@v0.9.1-grok · 5620 in / 1285 out tokens · 48673 ms · 2026-07-01T07:21:27.842408+00:00 · methodology

Review history (3 revisions) →

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

A Cap-Axis Integral Diagnostic of Factor Models
q-fin.GN 2026-07 unverdicted novelty 6.0

Introduces cap-axis integral diagnostic revealing zero-alpha violations on cap-rank subspaces in factor models using 1967-2024 CRSP data.

Reference graph

Works this paper leans on

26 extracted references · 21 canonical work pages · cited by 1 Pith paper · 1 internal anchor

[1]

R., Ross, S

Gibbons, M. R., Ross, S. A., & Shanken, J. (1989). A test of the efficiency of a given portfolio. Econometrica, 57(5), 1121--1152. https://www.jstor.org/stable/1913625

work page arXiv 1989
[2]

W., & MacKinlay, A

Lo, A. W., & MacKinlay, A. C. (1990). Data-snooping biases in tests of financial asset pricing models. The Review of Financial Studies, 3(3), 431--467. https://doi.org/10.1093/rfs/3.3.431

work page doi:10.1093/rfs/3.3.431 1990
[3]

P., & Jagannathan, R

Hansen, L. P., & Jagannathan, R. (1997). Assessing specification errors in stochastic discount factor models. The Journal of Finance, 52(2), 557--590. https://doi.org/10.1111/j.1540-6261.1997.tb04813.x

work page doi:10.1111/j.1540-6261.1997.tb04813.x 1997
[4]

Cochrane, J. H. (2005). Asset Pricing (Revised ed.). Princeton University Press. https://www.johnhcochrane.com/asset-pricing

2005
[5]

Lewellen, J., Nagel, S., & Shanken, J. (2010). A skeptical appraisal of asset-pricing tests. Journal of Financial Economics, 96(2), 175--194. https://doi.org/10.1016/j.jfineco.2009.09.001

work page doi:10.1016/j.jfineco.2009.09.001 2010
[6]

Barillas, F., & Shanken, J. (2017). Which alpha? The Review of Financial Studies, 30(4), 1316--1338. https://doi.org/10.1093/rfs/hhw101

work page doi:10.1093/rfs/hhw101 2017
[7]

Barillas, F., & Shanken, J. (2018). Comparing asset pricing models. The Journal of Finance, 73(2), 715--754. https://doi.org/10.1111/jofi.12607

work page doi:10.1111/jofi.12607 2018
[8]

Kozak, S., Nagel, S., & Santosh, S. (2018). Interpreting factor models. The Journal of Finance, 73(3), 1183--1223. https://doi.org/10.1111/jofi.12612

work page doi:10.1111/jofi.12612 2018
[9]

Y., & Zimmermann, T

Chen, A. Y., & Zimmermann, T. (2022). Open source cross-sectional asset pricing. Critical Finance Review, 11(2), 207--264. https://doi.org/10.1561/104.00000112

work page doi:10.1561/104.00000112 2022
[10]

Giglio, S., Xiu, D., & Zhang, D. (2025). Test assets and weak factors. The Journal of Finance, 80(1), 259--319. https://doi.org/10.1111/jofi.13415

work page doi:10.1111/jofi.13415 2025
[11]

Shin, U. (2026). Which portfolios? The construction dependence of factor model performance. arXiv preprint arXiv:2606.19550. https://doi.org/10.48550/arXiv.2606.19550

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2606.19550 2026
[12]

Jegadeesh, N., & Titman, S. (1993). Returns to buying winners and selling losers: Implications for stock market efficiency. The Journal of Finance, 48(1), 65--91. https://doi.org/10.1111/j.1540-6261.1993.tb04702.x

work page doi:10.1111/j.1540-6261.1993.tb04702.x 1993
[13]

Carhart, M. M. (1997). On persistence in mutual fund performance. The Journal of Finance, 52(1), 57--82. https://doi.org/10.1111/j.1540-6261.1997.tb03808.x

work page doi:10.1111/j.1540-6261.1997.tb03808.x 1997
[14]

F., & French, K

Fama, E. F., & French, K. R. (1992). The cross-section of expected stock returns. The Journal of Finance, 47(2), 427--465. https://doi.org/10.1111/j.1540-6261.1992.tb04398.x

work page doi:10.1111/j.1540-6261.1992.tb04398.x 1992
[15]

F., & French, K

Fama, E. F., & French, K. R. (1993). Common risk factors in the returns on stocks and bonds. Journal of Financial Economics, 33(1), 3--56. https://doi.org/10.1016/0304-405X(93)90023-5

work page doi:10.1016/0304-405x(93)90023-5 1993
[16]

F., & French, K

Fama, E. F., & French, K. R. (2015). A five-factor asset pricing model. Journal of Financial Economics, 116(1), 1--22. https://doi.org/10.1016/j.jfineco.2014.10.010

work page doi:10.1016/j.jfineco.2014.10.010 2015
[17]

F., & French, K

Fama, E. F., & French, K. R. (2018). Choosing factors. Journal of Financial Economics, 128(2), 234--252. https://doi.org/10.1016/j.jfineco.2018.02.012

work page doi:10.1016/j.jfineco.2018.02.012 2018
[18]

Hou, K., Xue, C., & Zhang, L. (2015). Digesting anomalies: An investment approach. The Review of Financial Studies, 28(3), 650--705. https://doi.org/10.1093/rfs/hhu068

work page doi:10.1093/rfs/hhu068 2015
[19]

Hou, K., Mo, H., Xue, C., & Zhang, L. (2019). Which factors? Review of Finance, 23(1), 1--35. https://doi.org/10.1093/rof/rfy032

work page doi:10.1093/rof/rfy032 2019
[20]

Hou, K., Xue, C., & Zhang, L. (2020). Replicating anomalies. The Review of Financial Studies, 33(5), 2019--2133. https://doi.org/10.1093/rfs/hhy131

work page doi:10.1093/rfs/hhy131 2020
[21]

Hou, K., Mo, H., Xue, C., & Zhang, L. (2021). An augmented q-factor model with expected growth. Review of Finance, 25(1), 1--41. https://doi.org/10.1093/rof/rfaa004

work page doi:10.1093/rof/rfaa004 2021
[22]

Hou, K., Mo, H., Xue, C., & Zhang, L. (2024). The economics of security analysis. Management Science, 70(1), 164--186. https://doi.org/10.1287/mnsc.2022.4640

work page doi:10.1287/mnsc.2022.4640 2024
[23]

Center for Research in Security Prices, LLC. (2026). CRSP US Stock Databases [Data set]. Accessed via Wharton Research Data Services, June 5, 2026. https://www.crsp.org/research/

2026
[24]

French, K. R. (2026). Kenneth R. French Data Library [Data set]. Accessed May 5, 2026. https://mba.tuck.dartmouth.edu/pages/faculty/ken.french/data_library.html

2026
[25]

Global-q.org. (2026). Factors and testing portfolios [Data set]. Accessed May 5, 2026. https://global-q.org/factors.html

2026
[26]

Open Source Asset Pricing. (2026). Open Source Asset Pricing [Data set]. Accessed June 24, 2026. https://www.openassetpricing.com

2026

[1] [1]

R., Ross, S

Gibbons, M. R., Ross, S. A., & Shanken, J. (1989). A test of the efficiency of a given portfolio. Econometrica, 57(5), 1121--1152. https://www.jstor.org/stable/1913625

work page arXiv 1989

[2] [2]

W., & MacKinlay, A

Lo, A. W., & MacKinlay, A. C. (1990). Data-snooping biases in tests of financial asset pricing models. The Review of Financial Studies, 3(3), 431--467. https://doi.org/10.1093/rfs/3.3.431

work page doi:10.1093/rfs/3.3.431 1990

[3] [3]

P., & Jagannathan, R

Hansen, L. P., & Jagannathan, R. (1997). Assessing specification errors in stochastic discount factor models. The Journal of Finance, 52(2), 557--590. https://doi.org/10.1111/j.1540-6261.1997.tb04813.x

work page doi:10.1111/j.1540-6261.1997.tb04813.x 1997

[4] [4]

Cochrane, J. H. (2005). Asset Pricing (Revised ed.). Princeton University Press. https://www.johnhcochrane.com/asset-pricing

2005

[5] [5]

Lewellen, J., Nagel, S., & Shanken, J. (2010). A skeptical appraisal of asset-pricing tests. Journal of Financial Economics, 96(2), 175--194. https://doi.org/10.1016/j.jfineco.2009.09.001

work page doi:10.1016/j.jfineco.2009.09.001 2010

[6] [6]

Barillas, F., & Shanken, J. (2017). Which alpha? The Review of Financial Studies, 30(4), 1316--1338. https://doi.org/10.1093/rfs/hhw101

work page doi:10.1093/rfs/hhw101 2017

[7] [7]

Barillas, F., & Shanken, J. (2018). Comparing asset pricing models. The Journal of Finance, 73(2), 715--754. https://doi.org/10.1111/jofi.12607

work page doi:10.1111/jofi.12607 2018

[8] [8]

Kozak, S., Nagel, S., & Santosh, S. (2018). Interpreting factor models. The Journal of Finance, 73(3), 1183--1223. https://doi.org/10.1111/jofi.12612

work page doi:10.1111/jofi.12612 2018

[9] [9]

Y., & Zimmermann, T

Chen, A. Y., & Zimmermann, T. (2022). Open source cross-sectional asset pricing. Critical Finance Review, 11(2), 207--264. https://doi.org/10.1561/104.00000112

work page doi:10.1561/104.00000112 2022

[10] [10]

Giglio, S., Xiu, D., & Zhang, D. (2025). Test assets and weak factors. The Journal of Finance, 80(1), 259--319. https://doi.org/10.1111/jofi.13415

work page doi:10.1111/jofi.13415 2025

[11] [11]

Shin, U. (2026). Which portfolios? The construction dependence of factor model performance. arXiv preprint arXiv:2606.19550. https://doi.org/10.48550/arXiv.2606.19550

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2606.19550 2026

[12] [12]

Jegadeesh, N., & Titman, S. (1993). Returns to buying winners and selling losers: Implications for stock market efficiency. The Journal of Finance, 48(1), 65--91. https://doi.org/10.1111/j.1540-6261.1993.tb04702.x

work page doi:10.1111/j.1540-6261.1993.tb04702.x 1993

[13] [13]

Carhart, M. M. (1997). On persistence in mutual fund performance. The Journal of Finance, 52(1), 57--82. https://doi.org/10.1111/j.1540-6261.1997.tb03808.x

work page doi:10.1111/j.1540-6261.1997.tb03808.x 1997

[14] [14]

F., & French, K

Fama, E. F., & French, K. R. (1992). The cross-section of expected stock returns. The Journal of Finance, 47(2), 427--465. https://doi.org/10.1111/j.1540-6261.1992.tb04398.x

work page doi:10.1111/j.1540-6261.1992.tb04398.x 1992

[15] [15]

F., & French, K

Fama, E. F., & French, K. R. (1993). Common risk factors in the returns on stocks and bonds. Journal of Financial Economics, 33(1), 3--56. https://doi.org/10.1016/0304-405X(93)90023-5

work page doi:10.1016/0304-405x(93)90023-5 1993

[16] [16]

F., & French, K

Fama, E. F., & French, K. R. (2015). A five-factor asset pricing model. Journal of Financial Economics, 116(1), 1--22. https://doi.org/10.1016/j.jfineco.2014.10.010

work page doi:10.1016/j.jfineco.2014.10.010 2015

[17] [17]

F., & French, K

Fama, E. F., & French, K. R. (2018). Choosing factors. Journal of Financial Economics, 128(2), 234--252. https://doi.org/10.1016/j.jfineco.2018.02.012

work page doi:10.1016/j.jfineco.2018.02.012 2018

[18] [18]

Hou, K., Xue, C., & Zhang, L. (2015). Digesting anomalies: An investment approach. The Review of Financial Studies, 28(3), 650--705. https://doi.org/10.1093/rfs/hhu068

work page doi:10.1093/rfs/hhu068 2015

[19] [19]

Hou, K., Mo, H., Xue, C., & Zhang, L. (2019). Which factors? Review of Finance, 23(1), 1--35. https://doi.org/10.1093/rof/rfy032

work page doi:10.1093/rof/rfy032 2019

[20] [20]

Hou, K., Xue, C., & Zhang, L. (2020). Replicating anomalies. The Review of Financial Studies, 33(5), 2019--2133. https://doi.org/10.1093/rfs/hhy131

work page doi:10.1093/rfs/hhy131 2020

[21] [21]

Hou, K., Mo, H., Xue, C., & Zhang, L. (2021). An augmented q-factor model with expected growth. Review of Finance, 25(1), 1--41. https://doi.org/10.1093/rof/rfaa004

work page doi:10.1093/rof/rfaa004 2021

[22] [22]

Hou, K., Mo, H., Xue, C., & Zhang, L. (2024). The economics of security analysis. Management Science, 70(1), 164--186. https://doi.org/10.1287/mnsc.2022.4640

work page doi:10.1287/mnsc.2022.4640 2024

[23] [23]

Center for Research in Security Prices, LLC. (2026). CRSP US Stock Databases [Data set]. Accessed via Wharton Research Data Services, June 5, 2026. https://www.crsp.org/research/

2026

[24] [24]

French, K. R. (2026). Kenneth R. French Data Library [Data set]. Accessed May 5, 2026. https://mba.tuck.dartmouth.edu/pages/faculty/ken.french/data_library.html

2026

[25] [25]

Global-q.org. (2026). Factors and testing portfolios [Data set]. Accessed May 5, 2026. https://global-q.org/factors.html

2026

[26] [26]

Open Source Asset Pricing. (2026). Open Source Asset Pricing [Data set]. Accessed June 24, 2026. https://www.openassetpricing.com

2026