pith. machine review for the scientific record. sign in

arxiv: 2605.11569 · v1 · submitted 2026-05-12 · 💻 cs.AI · cs.LG

Recognition: 2 theorem links

· Lean Theorem

Dual-Temporal LSTM with Hybrid Attention for Airline Passenger Load Factor Forecasting: Integrating Intra-Flight and Inter-Flight Booking Dynamics

Authors on Pith no claims yet

Pith reviewed 2026-05-13 01:44 UTC · model grok-4.3

classification 💻 cs.AI cs.LG
keywords airline demand forecastingdual LSTMhybrid attentionload factor predictionbooking time seriesrevenue managementintra-flight dynamicsinter-flight patterns
0
0 comments X

The pith

A dual-stream LSTM with hybrid attention improves airline passenger load factor forecasts by jointly processing intra-flight booking accumulation and inter-flight historical patterns.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Airline revenue management requires short-term forecasts of seat occupancy, yet most systems examine booking data along only one time axis: either the build-up of reservations for a single upcoming flight or the historical profile of the same route. This paper demonstrates that feeding both sequences into a shared LSTM architecture with hybrid attention yields more accurate load-factor predictions, which remain stable even when aircraft capacity changes. On reservation records from Biman Bangladesh Airlines the hybrid model records a mean absolute error of 2.8167 and an R-squared value of 0.9495, surpassing single-stream LSTMs, tree-based methods, and earlier dual-LSTM designs. The same architecture generalizes across domestic versus international, direct versus transit, and short- versus long-haul flights, and the carrier has placed the method into live operations.

Core claim

The central claim is that a dual-temporal LSTM integrated with attention simultaneously processes a horizontal intra-flight sequence of booking accumulation over the days before departure and a vertical inter-flight sequence of booking patterns at fixed days-before-departure offsets across historical flights. Multiple variants that combine self-attention, cross-attention, and hybrid attention with concatenation, residual, and gated fusion are evaluated on real reservation data; the hybrid-attention model achieves a mean absolute error of 2.8167 and an R-squared of 0.9495, outperforming single-stream baselines, tree-based models, and three prior dual-LSTM architectures on the same dataset, as

What carries the argument

Dual-stream LSTM with hybrid attention that fuses one intra-flight accumulation sequence and one inter-flight fixed-offset sequence.

If this is right

  • Load-factor predictions remain usable when planned aircraft type or total seat capacity changes, because the model does not forecast absolute passenger counts.
  • The same architecture generalizes across domestic versus international, direct versus transit, high versus low frequency, and short versus long-haul routes.
  • Hybrid attention fusion outperforms both single-stream LSTMs and earlier dual-LSTM designs on the identical Biman Bangladesh Airlines dataset.
  • Real-world deployment at an operating airline demonstrates immediate applicability to revenue-management systems.
  • Forecasts can be produced for any future departure date once the two input sequences are assembled from the reservation system.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same dual-stream pattern could be applied to hotel occupancy or event-ticketing data, where both per-item accumulation and cross-item historical curves exist.
  • Adding a lightweight module to detect sudden external events would address the current reliance on the two streams alone.
  • Re-training and testing the identical architecture on datasets from carriers in other regions would test whether the reported gains are specific to one airline's booking system.
  • Extending the vertical stream to include variable rather than fixed offsets could capture longer-term seasonal effects not yet modeled.

Load-bearing premise

The two chosen temporal streams together capture the dominant booking dynamics without additional exogenous features or explicit modeling of external shocks.

What would settle it

A head-to-head test on reservation data from a second airline in which the hybrid dual-stream model shows higher mean absolute error than a carefully tuned single-stream LSTM would falsify the claim that the dual architecture is required for superior performance.

Figures

Figures reproduced from arXiv: 2605.11569 by ASM Nazrul Islam, Joydeb Kumar Sana, Md. Hasanul Kabir, Md. Liakot Ali.

Figure 1
Figure 1. Figure 1: Pipeline of proposed approach. approach aligns with traditional revenue management, which focuses on available seats. However, this method introduces a dependency on aircraft capacity. If the aircraft type changes, the model’s prediction may no longer be valid, limiting the model’s applicability. Passenger Load Factor (PLF), defined as the ratio of revenue passenger kilometers to available seat kilometers,… view at source ↗
Figure 2
Figure 2. Figure 2: Feature selection pipeline [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: DLSTM-HA model architecture. increase the span of booking history captured. Approach 1 produced consistently better performance across all model variants and was adopted for the final architecture. Both sequences use a window size of 3 (H=3, V =3), determined through a systematic grid search over candidate sizes ranging from 3 to 18, evaluated on validation loss across all four base model architectures. Th… view at source ↗
Figure 4
Figure 4. Figure 4: Trends of MAE (top) and R 2 (bottom) across dynamic prediction days (D-21 to D-0). accumulates, flights become harder to predict in the final week before departure. This pattern reflects the influence of last-minute booking dynamics: cancellations, upgrades, group bookings, and no-shows, which are harder to predict despite having complete booking information. The R2 trend follows a similarly non-monotonic … view at source ↗
Figure 5
Figure 5. Figure 5: Metric values for Domestic vs. International flights. [PITH_FULL_IMAGE:figures/full_fig_p009_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Metric values for Direct vs. Transit flights. [PITH_FULL_IMAGE:figures/full_fig_p009_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Metric values for High vs. Low frequency flights. [PITH_FULL_IMAGE:figures/full_fig_p010_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Metric values for Short vs. Mid vs. Long Haul flights. [PITH_FULL_IMAGE:figures/full_fig_p010_8.png] view at source ↗
read the original abstract

Accurate short-term demand forecasting is crucial to airline revenue management, yet most existing systems fail to meet this need because current models treat booking data as a single temporal dimension, either the accumulation of bookings for a specific flight or the historical booking profile of the same route. This unidimensional view discards information carried by the other temporal stream and forecasting absolute passenger counts introduces a further operational fragility when change in planned aircraft type alters total seat capacity. This study addresses both limitations. A dual-stream Long Short-Term Memory (LSTM) integrated with attention framework is proposed that simultaneously processes two complementary input sequences: a horizontal sequence capturing intra-flight booking accumulation over the days preceding departure, and a vertical sequence capturing inter-flight booking patterns at fixed days-before-departure offsets across historical flights. Multiple dual-stream architectural variants, combining self-attention, cross-attention, and hybrid attention with concatenation, residual, and gated fusion strategies, are developed and evaluated. Experiments on real-world reservation data from the national airline of Bangladesh, Biman Bangladesh Airlines (BBA), demonstrate that the proposed hybrid model achieves a Mean Absolute Error of 2.8167 and a coefficient of determination ($R^{2}$) of 0.9495, outperforming single-stream baselines, tree-based models, and three prior dual-LSTM architectures applied to the same data. Validation across four flight category pairs; domestic versus international, direct versus transit, high versus low frequency, and short versus mid versus long haul confirms that the model generalizes across operationally diverse route types. Biman Bangladesh Airlines (BBA) has officially integrated this methodology into its operations.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The paper claims to introduce a dual-temporal LSTM with hybrid attention for forecasting airline passenger load factors by processing intra-flight booking accumulation and inter-flight patterns. On Biman Bangladesh Airlines data, the hybrid model achieves MAE 2.8167 and R² 0.9495, outperforming baselines and generalizing across four route categories, with claimed operational integration.

Significance. If the results hold under scrutiny, this work could meaningfully advance short-term demand forecasting in revenue management by capturing complementary temporal information from booking data. The specific performance numbers and airline adoption suggest potential for practical impact, though the absence of robustness checks to external events limits the assessed significance.

major comments (2)
  1. Abstract and Results: The reported metrics (MAE of 2.8167 and R² of 0.9495) are presented without error bars, ablation studies on the attention fusion strategies, or statistical significance tests against baselines, which is required to substantiate the outperformance and generalization claims across route pairs.
  2. Model and Experiments: The central assumption that the two temporal streams capture dominant booking dynamics without exogenous features is not tested against external shocks such as promotions or disruptions; this is load-bearing for the claim that the model generalizes and has been operationally adopted.
minor comments (1)
  1. Notation: Clarify whether the load factor is expressed as a percentage or fraction, as the MAE value of 2.8167 is consistent with percentage points.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments, which highlight opportunities to strengthen the empirical support for our claims. We address each major comment below and will incorporate the necessary revisions.

read point-by-point responses
  1. Referee: [—] Abstract and Results: The reported metrics (MAE of 2.8167 and R² of 0.9495) are presented without error bars, ablation studies on the attention fusion strategies, or statistical significance tests against baselines, which is required to substantiate the outperformance and generalization claims across route pairs.

    Authors: We agree that error bars, ablation studies, and statistical significance tests are important for rigorously substantiating the reported performance. In the revised manuscript, we will add error bars derived from multiple independent training runs with different random seeds and report standard deviations alongside the mean metrics. We will include a dedicated ablation subsection comparing all dual-stream variants (self-attention, cross-attention, and the three hybrid fusion strategies) with quantitative results. We will also apply paired statistical tests (e.g., Wilcoxon signed-rank test) across the route categories to establish significance of improvements over the single-stream, tree-based, and prior dual-LSTM baselines. These additions will be reflected in the results tables, figures, and a brief mention in the abstract. revision: yes

  2. Referee: [—] Model and Experiments: The central assumption that the two temporal streams capture dominant booking dynamics without exogenous features is not tested against external shocks such as promotions or disruptions; this is load-bearing for the claim that the model generalizes and has been operationally adopted.

    Authors: The dual-temporal architecture is deliberately restricted to booking-curve inputs because these are the only real-time signals available to revenue-management systems at the short-term forecasting horizon. Historical inter-flight patterns implicitly encode the effects of past promotions and disruptions that occurred in the training data. We acknowledge that explicit robustness testing against labeled external events is absent. In revision we will (i) add a limitations paragraph discussing this modeling choice and the potential benefit of future exogenous features, (ii) provide additional context on Biman Bangladesh Airlines’ internal validation and deployment process (without disclosing proprietary details), and (iii) qualify the generalization and adoption statements to reflect the scope of the current experiments. We do not claim the model is invariant to all external shocks; rather, the cross-category results demonstrate practical utility on the observed distribution of routes. revision: partial

Circularity Check

0 steps flagged

No circularity: empirical results from held-out training data, not algebraic reduction

full rationale

The paper introduces a dual-stream LSTM architecture with hybrid attention to process intra-flight and inter-flight booking sequences, then reports performance (MAE 2.8167, R² 0.9495) obtained by standard supervised training and evaluation on real reservation data from Biman Bangladesh Airlines. These metrics arise from fitting the model parameters to training splits and measuring generalization on held-out test data, rather than any derivation that reduces the claimed outputs to the inputs by construction. No equations, uniqueness theorems, or self-citations are invoked as load-bearing justifications for the architecture or results; comparisons to baselines and prior dual-LSTM models are external empirical checks on the same dataset. The derivation chain is therefore self-contained as a data-driven modeling contribution.

Axiom & Free-Parameter Ledger

2 free parameters · 2 axioms · 0 invented entities

The model rests on standard LSTM and attention assumptions plus hyperparameters tuned to the BBA dataset; no new physical entities or ad-hoc constants are introduced beyond typical deep-learning practice.

free parameters (2)
  • LSTM hidden size, layers, and attention heads
    Standard architectural hyperparameters whose values are chosen to fit the training data.
  • Fusion gating or residual weights
    Learned or hand-tuned parameters controlling how the two streams are combined.
axioms (2)
  • domain assumption LSTM cells can capture sequential dependencies in booking curves.
    Core justification for using LSTM streams rather than simpler recurrent or feed-forward models.
  • domain assumption Attention mechanisms can identify the most informative time steps within each stream.
    Used to justify the self-, cross-, and hybrid attention modules.

pith-pipeline@v0.9.0 · 5614 in / 1345 out tokens · 56148 ms · 2026-05-13T01:44:47.085055+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

26 extracted references · 26 canonical work pages

  1. [1]

    Airline profitability to strengthen slightly in 2025 despite headwinds,

    “Airline profitability to strengthen slightly in 2025 despite headwinds,” https://www.iata.org/en/pressroom/2025-releases/2025-06-02-01/, 2025, accessed: Jul. 18, 2025

  2. [2]

    ARIMA + GARCH + Bootstrap forecasting method applied to the airline industry,

    M. R. Nieto and R. B. Carmona-Benítez, “ARIMA + GARCH + Bootstrap forecasting method applied to the airline industry,”Journal of Air Transport Management, vol. 71, pp. 1–8, 2018

  3. [3]

    Econometric forecasting models for air traffic passenger of Indonesia,

    V . Suryan, “Econometric forecasting models for air traffic passenger of Indonesia,”Journal of Civil Engineering Forum, vol. 3, no. 1, 2017

  4. [4]

    A novel LSTM-based daily airline demand forecasting method using vertical and horizontal time series,

    B. Pan, D. Yuan, W. Sun, C. Liang, and D. Li, “A novel LSTM-based daily airline demand forecasting method using vertical and horizontal time series,” inTrends and Applications in Knowledge Discovery and Data Mining (PAKDD Workshops). Cham: Springer, 2018, pp. 168– 173

  5. [5]

    Flight short-term booking demand forecasting based on a long short-term memory network,

    H. He, L. Chen, and S. Wang, “Flight short-term booking demand forecasting based on a long short-term memory network,”Computers & Industrial Engineering, vol. 186, p. 109707, 2023

  6. [6]

    SARIMA damp trend grey forecasting model for airline industry,

    R. B. Carmona-Benítez and M. R. Nieto, “SARIMA damp trend grey forecasting model for airline industry,”Journal of Air Transport Management, vol. 82, p. 101736, 2020

  7. [7]

    Forecasting air passenger traffic flow based on the two-phase learning model,

    X. Wu, Y . Xiang, G. Mao, M. Du, X. Yang, and X. Zhou, “Forecasting air passenger traffic flow based on the two-phase learning model,” Journal of Supercomputing, vol. 77, no. 5, pp. 4221–4243, 2021

  8. [8]

    An analysis of the prospects of ultra-long-haul airline operations using passenger demand data,

    W. Grimme, S. Bingemer, and S. Maertens, “An analysis of the prospects of ultra-long-haul airline operations using passenger demand data,” Transportation Research Procedia, vol. 51, pp. 208–216, 2020

  9. [9]

    An airport level framework for examining the impact of COVID-19 on airline demand,

    S. Dey Tirtha, T. Bhowmik, and N. Eluru, “An airport level framework for examining the impact of COVID-19 on airline demand,”Transporta- tion Research Part A: Policy and Practice, vol. 159, pp. 169–181, 2022

  10. [10]

    Air passenger demand model (APDM): Econometric model for forecasting demand in passenger air transports in Nepal,

    D. P. Bastola, “Air passenger demand model (APDM): Econometric model for forecasting demand in passenger air transports in Nepal,” vol. 1, no. 4, 2017

  11. [11]

    Estimating unconstrained cus- tomer choice set demand: A case study on airline reservation data,

    A. Haensel, G. Koole, and J. Erdman, “Estimating unconstrained cus- tomer choice set demand: A case study on airline reservation data,” Journal of Choice Modelling, vol. 4, no. 3, pp. 75–87, 2011

  12. [12]

    Dynamic airline booking forecasting,

    T. van Ostaijen, B. F. Santos, and M. Mitici, “Dynamic airline booking forecasting,” unpublished manuscript

  13. [13]

    Dynamic airline booking demand forecasting,

    T. L. E. Marques, “Dynamic airline booking demand forecasting,” Master’s thesis, 2016

  14. [14]

    Forecasting air travel demand for selected destinations using machine learning methods,

    M. Firat, D. Yiltas-Kaplan, and R. Samli, “Forecasting air travel demand for selected destinations using machine learning methods,”Journal of Universal Computer Science, vol. 27, no. 6, 2021

  15. [15]

    A practical approach to determining critical macroeconomic factors in air-traffic volume based on K-means clustering and decision-tree classification,

    J.-H. Chen, H.-H. Wei, C.-L. Chen, H.-Y . Wei, Y .-P. Chen, and Z. Ye, “A practical approach to determining critical macroeconomic factors in air-traffic volume based on K-means clustering and decision-tree classification,”Journal of Air Transport Management, vol. 82, p. 101743, 2020

  16. [16]

    A machine learning approach to itinerary-level booking prediction in competitive airline markets,

    D. Hopman, G. Koole, and R. V . D. Mei, “A machine learning approach to itinerary-level booking prediction in competitive airline markets,” International Journal of Revenue Management, vol. 12, no. 3–4, pp. 153–191, 2021

  17. [17]

    Exploring deep learning approaches for short-term passenger demand prediction,

    Z. Ghandeharioun, P. Zendehdel Nobari, and W. Wu, “Exploring deep learning approaches for short-term passenger demand prediction,”Data Science for Transportation, vol. 5, no. 3, p. 19, 2023

  18. [18]

    Learning internal representations by error propagation,

    D. E. Rumelhart and J. L. McClelland, “Learning internal representations by error propagation,” inParallel Distributed Processing: Explorations in the Microstructure of Cognition. MIT Press, 1987, pp. 318–362

  19. [19]

    Long short-term memory,

    S. Hochreiter and J. Schmidhuber, “Long short-term memory,”Neural Computation, vol. 9, no. 8, pp. 1735–1780, 1997

  20. [20]

    Forecasting air passenger demand: a comparison of LSTM and SARIMA,

    Q. H. Do, S.-K. Lo, J.-F. Chen, C.-L. Le, and L. H. Anh, “Forecasting air passenger demand: a comparison of LSTM and SARIMA,”Journal of Computer Science, vol. 16, no. 7, pp. 1063–1084, 2020

  21. [21]

    Deep learning models for forecasting aviation demand time series,

    A. Kanavos, F. Kounelis, L. Iliadis, and C. Makris, “Deep learning models for forecasting aviation demand time series,”Neural Computing and Applications, vol. 33, no. 23, pp. 16 329–16 343, 2021

  22. [22]

    Estimating and projecting air passenger traffic during the COVID-19 coronavirus outbreak and its socio-economic impact,

    S. M. Iacus, F. Natale, C. Santamaria, S. Spyratos, and M. Vespe, “Estimating and projecting air passenger traffic during the COVID-19 coronavirus outbreak and its socio-economic impact,”Safety Science, vol. 129, p. 104791, 2020

  23. [23]

    Efficient two-stream network for violence detection using separable convolutional LSTM,

    Z. Islam, M. Rukonuzzaman, R. Ahmed, M. H. Kabir, and M. Farazi, “Efficient two-stream network for violence detection using separable convolutional LSTM,” inProc. International Joint Conference on Neural Networks (IJCNN), 2021, pp. 1–8

  24. [24]

    Forecast- ing agricultural commodity prices using dual input attention LSTM,

    Y . H. Gu, D. Jin, H. Yin, R. Zheng, X. Piao, and S. J. Yoo, “Forecast- ing agricultural commodity prices using dual input attention LSTM,” Agriculture, vol. 12, no. 2, 2022

  25. [25]

    Remaining useful life prediction using dual-channel LSTM with time feature and its difference,

    C. Peng, J. Wu, Q. Wang, W. Gui, and Z. Tang, “Remaining useful life prediction using dual-channel LSTM with time feature and its difference,”Entropy, vol. 24, no. 12, 2022

  26. [26]

    Modeling and predicting passenger load factor in air transportation: A deep assessment methodology with fractional calculus approach utilizing reservation data,

    K. ¸ Sim¸ sek, N. Ö. Ö. Tu ˘grul, K. Karaçuha, V . Tabatadze, and E. Karaçuha, “Modeling and predicting passenger load factor in air transportation: A deep assessment methodology with fractional calculus approach utilizing reservation data,”Fractal and Fractional, vol. 8, no. 4, p. 214, 2024