Designing Recommendation Exposure and Favorite Lists: A Field Experiment in a Spot-Work Platform

Kazuki Sekiya; Shunsuke Ozeki; Shunya Noda; Suguru Otani; Yuki Fujii; Yuki Komatsu

arxiv: 2606.17397 · v3 · pith:N6OLVJ3Unew · submitted 2026-06-16 · 💰 econ.GN · cs.GT· cs.IR· q-fin.EC

Designing Recommendation Exposure and Favorite Lists: A Field Experiment in a Spot-Work Platform

Kazuki Sekiya , Suguru Otani , Yuki Komatsu , Yuki Fujii , Shunsuke Ozeki , Shunya Noda This is my paper

Pith reviewed 2026-06-26 22:15 UTC · model grok-4.3

classification 💰 econ.GN cs.GTcs.IRq-fin.EC

keywords recommendation systemsfield experimentspot workmatching platformsexposure controlTECfavorite lists

0 comments

The pith

Thresholded eligibility control reallocates exposure to job templates based on unfilled capacity and raises matching rates on a spot-work platform.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper examines recommender design for platforms matching workers to short-lived spot jobs, where favoring popular templates can leave other openings unfilled. It introduces thresholded eligibility control (TEC) to reallocate exposure according to recent posting activity and unfilled capacity rather than predicted favoriting alone. Simulations calibrated to platform data show the per-round job-finding rate rising from 57.6 percent to 70.0 percent. A prefecture-level randomized field experiment finds higher realized matches, greater exposure per active template, fewer low-exposure templates, and better impression-level favoriting plus downstream matching.

Core claim

Thresholded eligibility control (TEC) is a parallelizable mechanism that reallocates template exposure based on posting activity and unfilled capacity; when applied to Timee data it raises the per-round job-finding rate from 57.6 percent to 70.0 percent in simulation and, in a randomized field experiment, increases realized matches, exposure per template, and favoriting while reducing the share of low-exposure templates.

What carries the argument

thresholded eligibility control (TEC), a mechanism that reallocates template exposure based on recent posting activity and unfilled capacity to balance recommendations with actual labor demand.

If this is right

Increases realized matches and exposure per active template.
Reduces the share of low-exposure templates.
Improves impression-level favoriting and downstream matching.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same exposure-control logic could be tested on other gig platforms where recommendations shape access to time-sensitive tasks.
Dynamic updates to the eligibility thresholds could be compared against static rules in follow-on experiments to measure robustness to demand shifts.
Longer-run data would reveal whether firms adjust their posting behavior in response to changed worker visibility.

Load-bearing premise

Reallocating exposure solely on the basis of recent posting activity and unfilled capacity will not create new bottlenecks or reduce overall platform participation.

What would settle it

A measurable drop in total platform participation or an increase in unfilled jobs after rollout would falsify the claim that the mechanism improves matching without side effects.

Figures

Figures reproduced from arXiv: 2606.17397 by Kazuki Sekiya, Shunsuke Ozeki, Shunya Noda, Suguru Otani, Yuki Fujii, Yuki Komatsu.

**Figure 2.** Figure 2: Template Recommendation Flow in the Timee App [PITH_FULL_IMAGE:figures/full_fig_p011_2.png] view at source ↗

**Figure 3.** Figure 3: Template-Level Exposure and Subscriber Distributions in the Baseline Simu [PITH_FULL_IMAGE:figures/full_fig_p023_3.png] view at source ↗

**Figure 4.** Figure 4: Exposure Allocation and Per-Round Fill Rates by Template Activity [PITH_FULL_IMAGE:figures/full_fig_p024_4.png] view at source ↗

**Figure 5.** Figure 5: Per-Round Job-Finding Rates across Market Sizes and Worker-to-Template [PITH_FULL_IMAGE:figures/full_fig_p026_5.png] view at source ↗

**Figure 6.** Figure 6: Distribution-Regression DID Treatment Effects [PITH_FULL_IMAGE:figures/full_fig_p039_6.png] view at source ↗

**Figure 7.** Figure 7: Transition from Greedy to TEC in the Simulation Note: Transition simulations from the Greedy steady state. At round 0, the platform either continues with Greedy or switches to TEC. Each panel plots the cumulative mean of the indicated outcome from round 0 through horizon t, averaged over 200 simulated sample paths. Shaded areas indicate ±2 standard deviations across simulated sample paths. 41 [PITH_FULL_I… view at source ↗

**Figure 8.** Figure 8: Estimated Recommendation-Position Fixed Effects on Favoriting Probability [PITH_FULL_IMAGE:figures/full_fig_p049_8.png] view at source ↗

**Figure 9.** Figure 9: Observed vs. Counterfactual CDFs of Template-Day-Level Outcomes [PITH_FULL_IMAGE:figures/full_fig_p050_9.png] view at source ↗

read the original abstract

How should recommender systems be designed when recommendations shape access to scarce, short-lived opportunities? We study this question in a production setting: Timee, Japan's largest platform for spot work, where workers favorite job templates and receive notifications when firms post shifts from those templates. Maximizing predicted favoriting can generate misdirected concentration: recommendations accumulate on popular templates that create few viable job openings, while templates with unmet labor demand receive too little exposure. We design exposure-control mechanisms for favorite-list management, reallocating template exposure based on posting activity and unfilled capacity. The proposed recommender, thresholded eligibility control (TEC), is fully parallelizable and suitable for large-scale digital platforms. In simulations calibrated to Timee data, TEC raises the per-round job-finding rate from 57.6% to 70.0%. A prefecture-level randomized field experiment increases realized matches and exposure per active template, reduces the share of low-exposure templates, and improves impression-level favoriting and downstream matching.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

TEC gives a practical, parallelizable rule for rebalancing template exposure on a gig platform, and the field experiment is the real contribution here.

read the letter

The paper introduces thresholded eligibility control (TEC) that ties recommendation exposure to recent posting volume and unfilled shifts on Timee. In data-calibrated simulations this lifts the per-round job-finding rate from 57.6% to 70%. A prefecture-level randomized field experiment then reports gains in realized matches, exposure per active template, fewer low-exposure templates, and better impression-level favoriting plus downstream matching.

What the work does cleanly is move the test out of pure simulation into a live two-sided market. The rule itself is simple enough to run at scale without central coordination, and the authors measure effects on both sides of the market. That combination is not common in the recommender literature they cite.

The soft spot is the missing experimental detail. The abstract and stress-test note give no sample sizes, randomization protocol, or pre-analysis plan, so the numerical claims cannot be checked for robustness or for how much the baseline 57.6% figure depends on the same data used for calibration. The claim of no participation drop or new bottlenecks is asserted but not directly tested with auxiliary metrics.

This is useful for anyone running or studying labor platforms where recommendations allocate scarce shifts. It is not a deep theoretical advance, but the field evidence is the part worth referee time. I would send it out for review so the authors can supply the missing design and power details.

Referee Report

2 major / 0 minor

Summary. The paper proposes thresholded eligibility control (TEC), a parallelizable exposure-reallocation mechanism for recommender systems on spot-work platforms. It reallocates template exposure according to recent posting activity and unfilled capacity to reduce misdirected concentration on popular but low-opportunity templates. Simulations calibrated to Timee data report an increase in per-round job-finding rate from 57.6% to 70.0%. A prefecture-level randomized field experiment is reported to increase realized matches and per-template exposure, reduce the share of low-exposure templates, and improve impression-level favoriting and downstream matching.

Significance. If the quantitative claims hold, the work contributes a practical, scalable mechanism for managing exposure in matching platforms where recommendations affect access to scarce, time-sensitive opportunities. The combination of a field experiment with calibrated simulations provides direct evidence on both realized and counterfactual performance, which is rare in this domain. The mechanism's emphasis on observable posting activity and capacity makes it implementable without requiring new data collection.

major comments (2)

[Abstract] Abstract: The simulation reports a rise from 57.6% to 70.0% in the per-round job-finding rate, but provides no information on how the 57.6% baseline is computed from the same Timee data used for calibration. This creates a circularity that directly affects the magnitude of the reported improvement and must be clarified with explicit out-of-sample checks or hold-out validation.
[Abstract] Abstract: The prefecture-level randomized field experiment is described only at a high level; the randomization procedure, number of prefectures or templates assigned, sample size, and whether a pre-analysis plan was registered are not stated. These details are load-bearing for interpreting the reported increases in realized matches, exposure per active template, and downstream matching.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for these constructive comments on the abstract. We respond to each point below and indicate planned revisions.

read point-by-point responses

Referee: [Abstract] Abstract: The simulation reports a rise from 57.6% to 70.0% in the per-round job-finding rate, but provides no information on how the 57.6% baseline is computed from the same Timee data used for calibration. This creates a circularity that directly affects the magnitude of the reported improvement and must be clarified with explicit out-of-sample checks or hold-out validation.

Authors: We agree the abstract omits this detail. The 57.6% baseline reflects the observed per-round job-finding rate under the platform's existing recommendation policy in the calibration sample. To eliminate any appearance of circularity, we will revise the abstract to briefly state the baseline construction and add an explicit description of the hold-out validation procedure (including the temporal split used) in the simulation section of the main text. revision: yes
Referee: [Abstract] Abstract: The prefecture-level randomized field experiment is described only at a high level; the randomization procedure, number of prefectures or templates assigned, sample size, and whether a pre-analysis plan was registered are not stated. These details are load-bearing for interpreting the reported increases in realized matches, exposure per active template, and downstream matching.

Authors: We agree the abstract is high-level. The full manuscript details the prefecture-level randomization, the number of treated and control prefectures, template and worker sample sizes, and the exact outcome measures. We will expand the abstract to report the number of prefectures and overall sample size. A pre-analysis plan was not registered for this platform-partnered experiment; we will add an explicit statement to that effect in both the abstract and the experimental design section. revision: yes

Circularity Check

0 steps flagged

No significant circularity identified

full rationale

The paper's central results derive from a prefecture-level randomized field experiment measuring realized matches, exposure, and favoriting under the TEC mechanism, plus standard counterfactual simulations calibrated to observed Timee data. No quoted equations, self-citations, or steps reduce the reported improvements to fitted inputs by construction; the baseline job-finding rate and intervention effects are distinct empirical quantities. The design reallocates exposure using observable posting activity and capacity without redefining outcomes via the same parameters. This is a self-contained empirical evaluation with no load-bearing self-definitional or renaming patterns.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review supplies no explicit free parameters, axioms, or invented entities; the design implicitly relies on standard platform-economics assumptions about worker response to notifications and firm posting behavior.

pith-pipeline@v0.9.1-grok · 5730 in / 1141 out tokens · 22970 ms · 2026-06-26T22:15:53.323474+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

13 extracted references · 2 linked inside Pith

[1]

Wiring the labor market,

Working Paper. Autor, David H, “Wiring the labor market,”Journal of Economic Perspectives, 2001, 15(1), 25–40. Behaghel, Luc, Sofia Dromundo, Marc Gurgand, Yagan Hazard, and Thomas Zuber, “The potential of recommender systems for directing job search: A large scale experiment,”

2001
[2]

Providing advice to jobseekers at low cost: An experimental study on online advice,

Working Paper. Belot, Michele, Philipp Kircher, and Paul Muller, “Providing advice to jobseekers at low cost: An experimental study on online advice,”The Review of Economic Studies, 2019,86(4), 1411–1447. Belot, Mich` ele, Philipp Kircher, and Paul Muller, “Do the Long-term Unemployed Benefit from Automated Occupational Advice during Online Job Search?,”E...

2019
[3]

Disclosing Product Availabil- ity in Online Retail,

Working Paper. 45 Calvo, Eduard, Ruomeng Cui, and Laura Wagner, “Disclosing Product Availabil- ity in Online Retail,”Manufacturing & Service Operations Management, 2023,25(2), 427–447. Chen, Kuan-Ming, Yu-Wei Hsieh, and Ming-Jen Lin, “Reducing recommendation inequality via two-sided matching: A field experiment of online dating,”International Economic Rev...

2023
[4]

Dis- tribution regression difference-in-differences,

Fern´ andez-Val, Iv´ an, Jonas Meier, Aico van Vuuren, and Francis Vella, “Dis- tribution regression difference-in-differences,”arXiv preprint arXiv:2409.02311,

Pith/arXiv arXiv
[5]

Competition Avoidance vs. Herding in Job Search: Evidence from Large-Scale Field Experiments on an Online Job Board,

Fradkin, Andrey, Monica Bhole, and John J. Horton, “Competition Avoidance vs. Herding in Job Search: Evidence from Large-Scale Field Experiments on an Online Job Board,”Management Science, 2026,72(2), 1305–1323. Gautier, Pieter, Paul Muller, Bas Van der Klaauw, Michael Rosholm, and Michael Svarer, “Estimating equilibrium effects of job search assistance,”...

2026
[6]

Nonparametric Estimation of Matching Efficiency and Elasticity in a Spot Gig Work Platform: 2019-2023,

Kanayama, Hayato and Suguru Otani, “Nonparametric Estimation of Matching Efficiency and Elasticity in a Spot Gig Work Platform: 2019-2023,”arXiv preprint arXiv:2412.19024,

Pith/arXiv arXiv 2019
[7]

Just After Minimum Wage Hikes: Short-Run Labor-Demand Response and Reallocation,

, Sho Miyaji, and Suguru Otani, “Just After Minimum Wage Hikes: Short-Run Labor-Demand Response and Reallocation,”arXiv preprint arXiv:2505.04555,

arXiv
[8]

Online labour index: Measuring the online gig economy for policy and research,

K¨ assi, Otto and Vili Lehdonvirta, “Online labour index: Measuring the online gig economy for policy and research,”Technological forecasting and social change, 2018, 137, 241–248. 46 Knight, Benjamin and Dmitry Mitrofanov, “Disclosing Low Product Availability: An Online Platform’s Strategy for Mitigating Stockout Risk,”Management Science, 2026,72(2), 156...

2018
[9]

Push It Across the Finish Line—Designing Online Interfaces to Induce Choice Closure at the Postdecision Prepurchase Stage,

Working Paper. Lee, Younghwa, Andrew N. K. Chen, and Weiquan Wang, “Push It Across the Finish Line—Designing Online Interfaces to Induce Choice Closure at the Postdecision Prepurchase Stage,”Information Systems Research, 2025,36(3), 1821–1845. Luo, Xueming, Xianghua Lu, and Jing Li, “When and How to Leverage E-commerce Cart Targeting: The Relative and Mod...

2025
[10]

Modeling Online Browsing and Path Analysis Using Clickstream Data,

Management Science, forthcoming. Montgomery, Alan L., Shibo Li, Kannan Srinivasan, and John C. Liechty, “Modeling Online Browsing and Path Analysis Using Clickstream Data,”Marketing Science, 2004,23(4), 579–595. Naya, Victor Alfonso, Guillaume Bied, Philippe Caillou, Bruno Cr´ epon, Christophe Gaillac, Elia P´ erennes, and Mich` ele Sebag, “Designing labo...

2004
[11]

Nonparametric estimation of matching efficiency and elasticity on a private on-the-job search platform: Evidence from Japan, 2014-2024,

Working Paper. Otani, Suguru, “Nonparametric estimation of matching efficiency and elasticity on a private on-the-job search platform: Evidence from Japan, 2014-2024,”Journal of the Japanese and International Economies, 2025, p. 101394. Reusens, Michael, Wilfried Lemahieu, Bart Baesens, and Luc Sels, “Evaluating recommendation and search in the labor mark...

2014
[12]

Integrating Predictive Models into Two-Sided Recommendations: A Matching-Theoretic Approach,

Sekiya, Kazuki, Suguru Otani, Yuki Komatsu, Sachio Ohkawa, and Shunya Noda, “Integrating Predictive Models into Two-Sided Recommendations: A Matching-Theoretic Approach,”arXiv preprint arXiv:2602.19689,

arXiv
[13]

Matching Theory-based Recommender Systems in Online Dating,

Tomita, Yoji, Riku Togashi, and Daisuke Moriwaki, “Matching Theory-based Recommender Systems in Online Dating,” in “Proceedings of the 16th ACM Conference on Recommender Systems” 2022, p. 538–541. , , Yuriko Hashizume, and Naoto Ohsaka, “Fast and examination-agnostic reciprocal recommendation in matching markets,” in “Proceedings of the 17th ACM Conferenc...

2022

[1] [1]

Wiring the labor market,

Working Paper. Autor, David H, “Wiring the labor market,”Journal of Economic Perspectives, 2001, 15(1), 25–40. Behaghel, Luc, Sofia Dromundo, Marc Gurgand, Yagan Hazard, and Thomas Zuber, “The potential of recommender systems for directing job search: A large scale experiment,”

2001

[2] [2]

Providing advice to jobseekers at low cost: An experimental study on online advice,

Working Paper. Belot, Michele, Philipp Kircher, and Paul Muller, “Providing advice to jobseekers at low cost: An experimental study on online advice,”The Review of Economic Studies, 2019,86(4), 1411–1447. Belot, Mich` ele, Philipp Kircher, and Paul Muller, “Do the Long-term Unemployed Benefit from Automated Occupational Advice during Online Job Search?,”E...

2019

[3] [3]

Disclosing Product Availabil- ity in Online Retail,

Working Paper. 45 Calvo, Eduard, Ruomeng Cui, and Laura Wagner, “Disclosing Product Availabil- ity in Online Retail,”Manufacturing & Service Operations Management, 2023,25(2), 427–447. Chen, Kuan-Ming, Yu-Wei Hsieh, and Ming-Jen Lin, “Reducing recommendation inequality via two-sided matching: A field experiment of online dating,”International Economic Rev...

2023

[4] [4]

Dis- tribution regression difference-in-differences,

Fern´ andez-Val, Iv´ an, Jonas Meier, Aico van Vuuren, and Francis Vella, “Dis- tribution regression difference-in-differences,”arXiv preprint arXiv:2409.02311,

Pith/arXiv arXiv

[5] [5]

Competition Avoidance vs. Herding in Job Search: Evidence from Large-Scale Field Experiments on an Online Job Board,

Fradkin, Andrey, Monica Bhole, and John J. Horton, “Competition Avoidance vs. Herding in Job Search: Evidence from Large-Scale Field Experiments on an Online Job Board,”Management Science, 2026,72(2), 1305–1323. Gautier, Pieter, Paul Muller, Bas Van der Klaauw, Michael Rosholm, and Michael Svarer, “Estimating equilibrium effects of job search assistance,”...

2026

[6] [6]

Nonparametric Estimation of Matching Efficiency and Elasticity in a Spot Gig Work Platform: 2019-2023,

Kanayama, Hayato and Suguru Otani, “Nonparametric Estimation of Matching Efficiency and Elasticity in a Spot Gig Work Platform: 2019-2023,”arXiv preprint arXiv:2412.19024,

Pith/arXiv arXiv 2019

[7] [7]

Just After Minimum Wage Hikes: Short-Run Labor-Demand Response and Reallocation,

, Sho Miyaji, and Suguru Otani, “Just After Minimum Wage Hikes: Short-Run Labor-Demand Response and Reallocation,”arXiv preprint arXiv:2505.04555,

arXiv

[8] [8]

Online labour index: Measuring the online gig economy for policy and research,

K¨ assi, Otto and Vili Lehdonvirta, “Online labour index: Measuring the online gig economy for policy and research,”Technological forecasting and social change, 2018, 137, 241–248. 46 Knight, Benjamin and Dmitry Mitrofanov, “Disclosing Low Product Availability: An Online Platform’s Strategy for Mitigating Stockout Risk,”Management Science, 2026,72(2), 156...

2018

[9] [9]

Push It Across the Finish Line—Designing Online Interfaces to Induce Choice Closure at the Postdecision Prepurchase Stage,

Working Paper. Lee, Younghwa, Andrew N. K. Chen, and Weiquan Wang, “Push It Across the Finish Line—Designing Online Interfaces to Induce Choice Closure at the Postdecision Prepurchase Stage,”Information Systems Research, 2025,36(3), 1821–1845. Luo, Xueming, Xianghua Lu, and Jing Li, “When and How to Leverage E-commerce Cart Targeting: The Relative and Mod...

2025

[10] [10]

Modeling Online Browsing and Path Analysis Using Clickstream Data,

Management Science, forthcoming. Montgomery, Alan L., Shibo Li, Kannan Srinivasan, and John C. Liechty, “Modeling Online Browsing and Path Analysis Using Clickstream Data,”Marketing Science, 2004,23(4), 579–595. Naya, Victor Alfonso, Guillaume Bied, Philippe Caillou, Bruno Cr´ epon, Christophe Gaillac, Elia P´ erennes, and Mich` ele Sebag, “Designing labo...

2004

[11] [11]

Nonparametric estimation of matching efficiency and elasticity on a private on-the-job search platform: Evidence from Japan, 2014-2024,

Working Paper. Otani, Suguru, “Nonparametric estimation of matching efficiency and elasticity on a private on-the-job search platform: Evidence from Japan, 2014-2024,”Journal of the Japanese and International Economies, 2025, p. 101394. Reusens, Michael, Wilfried Lemahieu, Bart Baesens, and Luc Sels, “Evaluating recommendation and search in the labor mark...

2014

[12] [12]

Integrating Predictive Models into Two-Sided Recommendations: A Matching-Theoretic Approach,

Sekiya, Kazuki, Suguru Otani, Yuki Komatsu, Sachio Ohkawa, and Shunya Noda, “Integrating Predictive Models into Two-Sided Recommendations: A Matching-Theoretic Approach,”arXiv preprint arXiv:2602.19689,

arXiv

[13] [13]

Matching Theory-based Recommender Systems in Online Dating,

Tomita, Yoji, Riku Togashi, and Daisuke Moriwaki, “Matching Theory-based Recommender Systems in Online Dating,” in “Proceedings of the 16th ACM Conference on Recommender Systems” 2022, p. 538–541. , , Yuriko Hashizume, and Naoto Ohsaka, “Fast and examination-agnostic reciprocal recommendation in matching markets,” in “Proceedings of the 17th ACM Conferenc...

2022