pith. machine review for the scientific record. sign in

arxiv: 2604.15564 · v1 · submitted 2026-04-16 · 💰 econ.EM

Recognition: unknown

Mobility Behaviour of Immigrants in Canada: Analyzing Mode Choice Using GPS Panel Data and Mixed Logit Models

Bilal Farooq, Tareq Alsaleh, Zachary Patterson

Authors on Pith no claims yet

Pith reviewed 2026-05-10 09:10 UTC · model grok-4.3

classification 💰 econ.EM
keywords immigrant mobilitymode choicemixed logit modelsGPS panel dataintegration indexvalue of travel timepublic transitCanada
0
0 comments X

The pith

New immigrants in Canada show about 66 percent lower sensitivity to in-vehicle travel time than Canadian-born residents, with higher integration linked to greater car use.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper analyzes GPS-tracked trips from 100 participants totaling over 80,000 observations to compare travel mode choices between immigrants and Canadian-born people using mixed logit models. It establishes that immigrants in the sample are less sensitive to travel time costs and that better social and economic integration correlates with choosing cars over public transit. This pattern would matter for urban planning because it implies that transit systems in cities with growing immigrant populations may need targeted improvements in access times rather than just fare adjustments to maintain ridership. The work combines real observed behavior with hypothetical choices for new options like e-mobility and checks the models against unseen data for reliability. If the findings hold, integration processes could gradually increase overall car dependence in these communities.

Core claim

The joint revealed and stated preference mixed logit models applied to the panel data indicate that new immigrants exhibit lower sensitivity to in-vehicle travel time than Canadian-born respondents, producing a value of travel time for immigrants that is approximately 66 percent lower with a consistent ratio of 0.34 across specifications. Higher levels of integration, measured by a composite index of economic, social, civic, and health dimensions, are associated with reduced transit use, where a one standard deviation increase lowers the probability of choosing public transit by about five percentage points and increases car reliance. The framework also shows that unobserved e-mobility modes

What carries the argument

The composite integration index paired with mixed logit models in a joint revealed-stated preference setup, which captures preference heterogeneity and allows testing of unobserved alternatives on equal footing with observed modes.

If this is right

  • Transit policies in immigrant-receiving cities should prioritize service quality improvements such as reduced access time, which the models show is about three times more effective than fare reductions at increasing transit use among immigrants.
  • Emerging e-mobility options can compete directly with existing modes based on level-of-service attributes alone, without any built-in preference penalty.
  • As integration levels rise, the shift toward car reliance suggests that land-use and transport planning must anticipate changing demand patterns in these populations.
  • The high out-of-sample prediction accuracy of 80 to 82 percent supports using these models for scenario testing of new mobility services.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • If the lower time sensitivity persists, cities could experience rising congestion and emissions as immigrant cohorts integrate over time unless transit alternatives improve substantially.
  • Applying the same GPS panel approach in other immigrant-receiving countries would test whether the 0.34 ratio and integration-car link are specific to Canadian conditions or more general.
  • The findings point to a potential feedback loop where economic integration enables car ownership, which in turn may slow further social integration if transit networks weaken.

Load-bearing premise

The 100-participant sample with over 80,000 trips is representative of broader immigrant and Canadian-born populations, and the composite integration index validly measures the intended dimensions without omitted factors.

What would settle it

Re-estimating the models on a larger, randomly selected GPS panel dataset from multiple Canadian cities and checking whether the immigrant-to-Canadian-born value of travel time ratio stays near 0.34 and the integration effect on transit probability remains around five percentage points.

Figures

Figures reproduced from arXiv: 2604.15564 by Bilal Farooq, Tareq Alsaleh, Zachary Patterson.

Figure 1
Figure 1. Figure 1: BDMobility Application User Interface and Layout [PITH_FULL_IMAGE:figures/full_fig_p006_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Data processing pipeline from raw GPS traces to estimation-ready dataset. Stages are numbered sequentially; [PITH_FULL_IMAGE:figures/full_fig_p009_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Worked example: transit journey decomposition and choice-set generation for a single observed bus trip [PITH_FULL_IMAGE:figures/full_fig_p012_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Spatial distribution of recorded trip origins across the two study areas: (a) Greater Toronto Area and (b) [PITH_FULL_IMAGE:figures/full_fig_p016_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Sociodemographic profile of study participants by immigrant status. [PITH_FULL_IMAGE:figures/full_fig_p017_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: immigrant status and integration indicators: [PITH_FULL_IMAGE:figures/full_fig_p018_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Recorded trip diary summary: a mode share by trip count, b by distance, c by travel time, d average trip distance and duration by mode, e mode share by immigrant status, and f trip distance distribution (box = IQR; line = median; diamond = mean). The SP scenarios were triggered dynamically within the app based on participants’ actual observed trips, presenting hypothetical mode-switching alternatives durin… view at source ↗
Figure 8
Figure 8. Figure 8: Stated preference scenarios and behavioural shift: [PITH_FULL_IMAGE:figures/full_fig_p020_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Parameter estimates across all four model specifications with 95% confidence intervals from robust standard [PITH_FULL_IMAGE:figures/full_fig_p024_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: Integration effect on mode choice probabilities (Model 3, representative 10 km work commute): [PITH_FULL_IMAGE:figures/full_fig_p026_10.png] view at source ↗
read the original abstract

We examine these relationships using a panel dataset of more than 80,000 trip observations from 100 participants through a custom-built mobile application. A joint revealed preference (RP) and stated preference (SP) framework is used to estimate multinomial logit (MNL) and mixed logit (MXL) models. The level of integration is represented through a composite index capturing economic, social, civic, and health dimensions of integration. Results indicate two distinct patterns. First, the estimated models suggest that new immigrants in the sample exhibit lower sensitivity to in-vehicle travel time than Canadian-born respondents. The mixed logit specification suggests that the value of travel time for the sampled immigrants is approximately 66% lower than that of Canadian-born residents, with a immigrant-to-Canadian-born ratio of 0.34 that is consistent across both MXL specifications. Second, higher levels of integration are associated with reduced transit use and greater car reliance. A one standard deviation increase in the integration index decreases the probability of choosing public transit by approximately five percentage points. The joint RP-SP specification allows the inclusion of emerging e-mobility alternatives not yet observed in revealed behaviour; these face no inherent preference penalty, competing purely on their level-of-service attributes. Out-of-sample validation using five-fold cross-validation produces a mean prediction accuracy between 80% and 82% across model specifications. The findings suggest that transit policies in immigrant-receiving cities could prioritize service quality improvements, particularly reductions in access time, which are approximately three times more effective than fare reductions in shifting immigrants toward transit use.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 3 minor

Summary. The manuscript analyzes mode choice behavior of immigrants versus Canadian-born residents in Canada using a GPS panel dataset of over 80,000 trips from 100 participants. It estimates multinomial logit and mixed logit models in a joint revealed preference (RP) and stated preference (SP) framework, incorporating a composite integration index across economic, social, civic, and health dimensions. Central claims are that the value of travel time (VTT) for sampled immigrants is approximately 66% lower than for Canadian-born residents (immigrant-to-Canadian-born ratio of 0.34, consistent across MXL specifications) and that a one standard deviation increase in the integration index reduces the probability of choosing public transit by approximately five percentage points. The joint RP-SP setup allows inclusion of e-mobility alternatives, and five-fold cross-validation yields 80-82% out-of-sample prediction accuracy.

Significance. If the results hold, the paper offers policy-relevant evidence on how immigrant integration shapes mobility preferences, suggesting that transit agencies in immigrant-receiving cities should prioritize service quality improvements (particularly access time reductions, claimed to be three times more effective than fare reductions) over price incentives. The use of high-frequency GPS panel data, joint RP-SP estimation, and explicit out-of-sample validation via cross-validation are strengths that enhance credibility and reproducibility compared to typical cross-sectional studies in this area.

major comments (3)
  1. [Mixed Logit Models and Results] Mixed logit results: The headline immigrant-to-Canadian-born VTT ratio of 0.34 is reported as consistent across two MXL specifications, but with random parameters identified from only 100 independent individuals (despite the large trip count), no standard errors, delta-method variances, or bootstrap intervals are provided for the derived ratio. This leaves open whether the ratio is statistically distinguishable from 1 or sensitive to the assumed mixing distribution (normal/lognormal) and simulation draws.
  2. [Integration Index and Choice Probabilities] Integration index and marginal effects: The claim that a one-SD increase in the composite integration index decreases transit choice probability by five percentage points rests on the index construction, but no details are given on variable selection, weighting scheme, or robustness to alternative aggregations. This is load-bearing for the second main finding and could be affected by omitted variable bias in the dimensions captured.
  3. [Data Description and Estimation] Sample and identification: The effective sample for between-person heterogeneity in the mixed logit is N=100; the paper does not report sensitivity analyses (e.g., alternative distributional assumptions, leave-one-out cross-validation for parameters, or checks on the stability of the 0.34 ratio) to address potential imprecision in the mixing distribution.
minor comments (3)
  1. [Model Specification] The full model equations for the mixed logit (including how random parameters are specified for time and cost, and how RP and SP data are jointly estimated) are not presented; explicit equations would aid replication.
  2. [Value of Travel Time Calculation] Clarify whether the 66% lower VTT (and the 0.34 ratio) is computed from means or medians of the random parameter distributions, as this affects interpretation given the panel structure.
  3. [Policy Implications] The policy statement that access time reductions are approximately three times more effective than fare reductions should be tied directly to specific marginal effects or elasticities from the estimated models.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the detailed and constructive comments, which highlight important areas for strengthening the manuscript's transparency and robustness. We address each major comment below and will revise the paper accordingly to incorporate additional analyses and details.

read point-by-point responses
  1. Referee: Mixed logit results: The headline immigrant-to-Canadian-born VTT ratio of 0.34 is reported as consistent across two MXL specifications, but with random parameters identified from only 100 independent individuals (despite the large trip count), no standard errors, delta-method variances, or bootstrap intervals are provided for the derived ratio. This leaves open whether the ratio is statistically distinguishable from 1 or sensitive to the assumed mixing distribution (normal/lognormal) and simulation draws.

    Authors: We agree that uncertainty measures for the derived VTT ratio are essential for assessing its reliability. In the revised manuscript, we will compute and report standard errors for the 0.34 ratio using the delta method, supplemented by bootstrap confidence intervals. We will also present results under alternative mixing distributions (e.g., lognormal for time parameters) and varying simulation draws to evaluate sensitivity, while noting that the panel structure with over 80,000 trips supports identification despite the 100 individuals. revision: yes

  2. Referee: Integration index and marginal effects: The claim that a one-SD increase in the composite integration index decreases transit choice probability by five percentage points rests on the index construction, but no details are given on variable selection, weighting scheme, or robustness to alternative aggregations. This is load-bearing for the second main finding and could be affected by omitted variable bias in the dimensions captured.

    Authors: The referee is correct that greater transparency on the integration index is required. We will add a dedicated subsection (and appendix) detailing the variable selection for each dimension (economic, social, civic, health), the weighting scheme (e.g., equal weights or principal components), and robustness checks using alternative aggregations or subsets of dimensions. This will directly address potential biases and strengthen the marginal effects interpretation. revision: yes

  3. Referee: Sample and identification: The effective sample for between-person heterogeneity in the mixed logit is N=100; the paper does not report sensitivity analyses (e.g., alternative distributional assumptions, leave-one-out cross-validation for parameters, or checks on the stability of the 0.34 ratio) to address potential imprecision in the mixing distribution.

    Authors: We acknowledge the concern regarding the sample size for heterogeneity. In the revision, we will include sensitivity analyses such as alternative distributional assumptions for random parameters, stability checks on the VTT ratio across specifications, and where computationally feasible, leave-one-out or k-fold validation for key parameters. These will be reported alongside the main results to demonstrate robustness. revision: yes

Circularity Check

0 steps flagged

No circularity: results are direct statistical estimates from choice data

full rationale

The paper estimates MNL and MXL models on RP/SP panel data to obtain coefficients for in-vehicle time, cost, and the integration index. The reported VTT ratio (0.34) is the ratio of those estimated coefficients, which is the standard definition of value of travel time in discrete choice models and does not reduce to any input by the paper's own equations. The integration effect on transit probability is likewise a direct marginal effect from the fitted model. No self-definitional loops, fitted parameters renamed as predictions, or load-bearing self-citations appear in the derivation chain. Out-of-sample cross-validation is performed on held-out data, confirming the results are not tautological with the estimation sample. This is a conventional applied econometrics workflow whose central claims remain independent of the paper's inputs.

Axiom & Free-Parameter Ledger

2 free parameters · 2 axioms · 0 invented entities

Central claims rest on standard discrete-choice assumptions plus fitted parameters from the mixed logit model applied to the 80k-trip panel; no new physical entities are postulated.

free parameters (2)
  • Random parameters in mixed logit
    Distribution parameters for travel time, cost, and integration effects estimated from data
  • Integration index construction weights
    Composite index across economic, social, civic, health dimensions; exact weighting or aggregation rule fitted or chosen
axioms (2)
  • domain assumption Individuals maximize random utility when choosing travel mode
    Core assumption of all MNL and MXL specifications
  • domain assumption Stated preference responses are consistent with revealed preference behavior up to scale
    Required for joint RP-SP estimation

pith-pipeline@v0.9.0 · 5590 in / 1398 out tokens · 33473 ms · 2026-05-10T09:10:42.708127+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

3 extracted references · 3 canonical work pages

  1. [1]

    doi: 10.1016/j.trip.2023.100852. T. Alsaleh, I. Otchere, B. Farooq, Z. Patterson, and E. Cherchi. Do perceptions migrate with us? an investigation using automated travel scenarios generation from semi-passive travel diary recording. Transportation Research Procedia, 2025. In press. H.Asgari, N.Zaman, andX.Jin. Understandingimmigrants’modechoicebehaviorinF...

  2. [2]

    doi: 10.1016/j.trpro.2017.05.319. B. Azoulay and Z. Patterson. Towards the standardization of reporting in smartphone travel surveys: the development and application of the smartphone survey reporting guidelines (ssrgs).Transporta- tion research procedia, 76:574–585, 2024. M. Ben-Akiva and T. Morikawa. Estimation of switching models from revealed preferen...

  3. [3]

    URLhttps://assets.metrolinx.com/image/upload/v1663237565/Documents/Metrolinx/ Metrolinx-Business-Case-Guidance-Volume-2.pdf. M. M. Monteiro, J. de Abreu e Silva, S. Haustein, and J. Pinho de Sousa. Urban travel behaviour adaptation of temporary transnational residents.Journal of Transport Geography, 90:102935, 2021. doi: 10.1016/j.jtrangeo.2020.102935. I....