Discovering Entity-Conditioned Lag Heterogeneity: A Lag-Gated Neural Audit Framework for Panel Time Series

Andi Xu

arxiv: 2605.21542 · v1 · pith:AG6RLUNTnew · submitted 2026-05-20 · 💻 cs.LG

Discovering Entity-Conditioned Lag Heterogeneity: A Lag-Gated Neural Audit Framework for Panel Time Series

Andi Xu This is my paper

Pith reviewed 2026-05-22 00:42 UTC · model grok-4.3

classification 💻 cs.LG

keywords panel time serieslag heterogeneitydistributed lag modelsentity conditioningneural audit frameworkAC-GATEtemporal miningscale-invariant gate

0 comments

The pith

AC-GATE conditions lag weights on entity proxies to produce heterogeneous lags as direct structural outputs rather than post-hoc results.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper treats entity-conditioned heterogeneous lag discovery as a temporal panel mining task. It introduces AC-GATE, an Adaptive-Conditioning Encoder with a Scale-Invariant Lag Gate, that uses observable entity-level proxies to shape lag-weight distributions over historical observations. This design makes the resulting effective lags integral outputs of the model instead of separate explanations added afterward. Evaluation follows a layered audit that checks predictive calibration separately from lag recovery, using synthetic panels with known ground-truth lags and real country-level panels for external structure testing. A sympathetic reader would care because the approach supplies directly auditable, entity-specific summaries of how different units respond to past signals over varying time horizons.

Core claim

AC-GATE instantiates conditional Moderated Distributed Lag by conditioning lag-weight distributions on observable entity-level proxies, thereby making effective lags structural outputs of the model. The framework recovers heterogeneous lag structure in synthetic data with known ground truth and generates non-degenerate, externally structured effective lags in real country panels under a layered audit protocol that separates calibration from discovery.

What carries the argument

The Scale-Invariant Lag Gate inside the Adaptive-Conditioning Encoder, which modulates lag weights over historical observations according to entity proxies.

If this is right

The model recovers known heterogeneous lag structures from synthetic panel data.
Real data yields non-degenerate effective lags that exhibit external structure.
The audit protocol separates predictive calibration from lag discovery validation.
Entity-specific lag summaries become direct model outputs available for auditing.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The method could support finer-grained policy analysis by revealing which entities respond on shorter or longer horizons to the same signals.
Proxy-based conditioning gates of this type might extend to other panel settings such as firm-level or patient-level time series.
Additional stress tests on panels with weaker proxy-lag correlations would clarify the limits of the structural-output claim.

Load-bearing premise

Observable entity-level proxies are sufficient to condition lag-weight distributions so that the resulting effective lags reflect structural heterogeneity instead of post-hoc artifacts.

What would settle it

A synthetic panel test in which the model fails to recover the known ground-truth heterogeneous lag structures even though the provided proxies are supplied as conditioning inputs.

Figures

Figures reproduced from arXiv: 2605.21542 by Andi Xu.

**Figure 2.** Figure 2: Overview of the AC-GATE architecture. D. Training Objective Training minimizes the domain-agnostic objective: L = Ltask + λrLrecon. (6) The task loss Ltask is the forecasting mean squared error (MSE) between the predicted target and the true target at the corresponding valid post-warmup step. The reconstruction loss Lrecon compares the original proxy vector pi with the reconstructed proxy vector pbi . The … view at source ↗

**Figure 3.** Figure 3: Relationship between entity-level stratifiers and learned effective lag. [PITH_FULL_IMAGE:figures/full_fig_p005_3.png] view at source ↗

**Figure 4.** Figure 4: Evidence for AC-GATE recovery, forecast-mechanism decoupling, and stratifier alignment. [PITH_FULL_IMAGE:figures/full_fig_p006_4.png] view at source ↗

**Figure 5.** Figure 5: Forecast R2 on the two real-world panels. Notes. All deep baselines (incl. AC-GATE and ablations) fall in the near-zero band [−0.05, 0.10]. Grouped ARDL flips sign across domains (−0.089 in Economics; +0.607 in Energy), so forecast R2 alone gives unstable rankings; interpret it alongside L1/L2 audit evidence (Sec. V). TABLE IV PROXY-SHUFFLE NEGATIVE CONTROL ON REAL-DATA L2 ALIGNMENT. Dom. Model Mean |ρ| sd… view at source ↗

read the original abstract

Country-level temporal panels are widely used in empirical analysis. Researchers often need to audit how different entities respond to historical signals over different time horizons. Current approaches typically do not provide directly auditable entity-specific lag summaries. We formulate entity-conditioned heterogeneous lag discovery as a temporal panel mining task and propose AC-GATE, an Adaptive-Conditioning Encoder with a Scale-Invariant Lag Gate. It instantiates conditional Moderated Distributed Lag by using observable entity-level proxies to condition lag-weight distributions over historical observations, thereby making effective lags structural outputs of the model rather than post-hoc explanations. The evaluation is based on a layered audit protocol that separates predictive calibration from lag discovery. A synthetic panel with known ground-truth lags is used for mechanism recovery testing, and two real-world country-level panels are used for external audit and stress testing. The results show that AC-GATE can recover heterogeneous lag structure in synthetic data, and generates non-degenerate, externally structured effective lags in real data.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

AC-GATE turns entity proxies into a conditioning mechanism for lag weights in panel series, but the abstract leaves the quantitative gains and baseline comparisons unclear.

read the letter

The main point is that this paper presents AC-GATE, a neural framework to discover entity-specific lag structures in panel time series by conditioning on observable proxies, making those lags part of the model output rather than something derived afterward. It frames the problem as entity-conditioned heterogeneous lag discovery and implements it with an adaptive-conditioning encoder plus a scale-invariant lag gate. This is a direct extension of moderated distributed lag models into a neural setting where the conditioning is baked into the architecture. The layered audit protocol is a practical choice: it first tests recovery on synthetic panels with known ground-truth lags, then checks external structure on real country-level data. That separation keeps the lag claims from being purely circular with the prediction task. The synthetic recovery and the non-degenerate lags on real panels are the concrete results shown so far. The approach is aimed at researchers who already work with temporal panels and want entity-level lag summaries that can be audited directly. A reader in applied social science or policy analysis who needs a tool for making lag heterogeneity visible would find the setup useful. The soft spots are mainly in the missing details. The abstract gives no recovery metrics, no error bars, and no head-to-head numbers against standard distributed lag or conditional neural baselines, so the size of the improvement is hard to judge. The circularity from fitting the model on the same data used for lag extraction is real, even if the external audit step reduces its impact on real panels. The paper shows clear thinking on the task and a reproducible evaluation plan, so it deserves a serious referee. I would recommend sending it for peer review.

Referee Report

3 major / 2 minor

Summary. The manuscript introduces AC-GATE, an Adaptive-Conditioning Encoder with a Scale-Invariant Lag Gate, to formulate entity-conditioned heterogeneous lag discovery as a temporal panel mining task. It instantiates conditional Moderated Distributed Lag by conditioning lag-weight distributions over historical observations using observable entity-level proxies, thereby treating effective lags as structural model outputs. Evaluation relies on a layered audit protocol that separates predictive calibration from lag discovery, using a synthetic panel with known ground-truth lags for recovery testing and two real-world country-level panels for external audit and stress testing. The central claim is that AC-GATE recovers heterogeneous lag structure in synthetic data and produces non-degenerate, externally structured effective lags in real data.

Significance. If the central claims hold under rigorous verification, the framework would address a genuine gap in providing directly auditable, entity-specific lag summaries for panel time series, moving beyond post-hoc explanations. The layered audit protocol and use of synthetic data with known ground-truth lags for mechanism recovery are notable strengths that support falsifiability. The approach could have impact in empirical domains relying on country-level panels, such as economics or policy analysis, by offering a neural method to surface lag heterogeneity conditioned on observable proxies.

major comments (3)

[Abstract / Evaluation section] Abstract and evaluation description: The claim of 'successful recovery' on synthetic data with known ground-truth lags is not supported by any reported quantitative metrics (e.g., lag recovery error, precision/recall on lag weights, or comparison to baselines such as standard distributed lag models or entity-specific ARDL). Without these, the mechanism recovery test cannot be assessed for robustness or superiority.
[Abstract] Abstract: The framing of effective lags as 'structural outputs of the model rather than post-hoc explanations' creates a circularity risk, as the lag weights are produced by parameters fitted to the same panel data used in the external audit. The layered audit protocol is described as separating predictive calibration from lag discovery, but no details are given on how it breaks the dependence on the training objective (e.g., via held-out entities, regularization, or out-of-sample lag validation).
[Method (AC-GATE construction)] Method description: The Scale-Invariant Lag Gate and its conditioning via entity proxies are central to the claim of heterogeneous lag discovery, yet no equations or ablation results are referenced showing that the resulting lag distributions are invariant to scale or that conditioning improves recovery over unconditioned baselines.

minor comments (2)

[Abstract] The abstract would benefit from explicit mention of the number of entities, time periods, and lag orders used in the synthetic and real panels to allow readers to gauge the scale of the experiments.
[Method] Notation for the conditional lag weights and the moderation mechanism should be introduced with a clear equation early in the methods to improve readability for readers unfamiliar with moderated distributed lag models.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive comments, which highlight areas where the manuscript can be strengthened. We respond to each major comment below and indicate planned revisions.

read point-by-point responses

Referee: [Abstract / Evaluation section] Abstract and evaluation description: The claim of 'successful recovery' on synthetic data with known ground-truth lags is not supported by any reported quantitative metrics (e.g., lag recovery error, precision/recall on lag weights, or comparison to baselines such as standard distributed lag models or entity-specific ARDL). Without these, the mechanism recovery test cannot be assessed for robustness or superiority.

Authors: We agree that the current presentation relies primarily on qualitative descriptions of recovery. In the revised manuscript we will add quantitative metrics, including L2 lag recovery error against ground-truth weights in the synthetic panel and direct comparisons to baselines such as standard distributed lag models and entity-specific ARDL, to allow rigorous assessment of the mechanism recovery test. revision: yes
Referee: [Abstract] Abstract: The framing of effective lags as 'structural outputs of the model rather than post-hoc explanations' creates a circularity risk, as the lag weights are produced by parameters fitted to the same panel data used in the external audit. The layered audit protocol is described as separating predictive calibration from lag discovery, but no details are given on how it breaks the dependence on the training objective (e.g., via held-out entities, regularization, or out-of-sample lag validation).

Authors: The audit protocol separates the steps by using held-out entities for lag-structure validation, so that conditioning parameters fitted on training entities are evaluated on unseen entities. We will revise the abstract and evaluation section to explicitly describe this held-out entity split and out-of-sample lag validation procedure, thereby clarifying how dependence on the training objective is broken. revision: yes
Referee: [Method (AC-GATE construction)] Method description: The Scale-Invariant Lag Gate and its conditioning via entity proxies are central to the claim of heterogeneous lag discovery, yet no equations or ablation results are referenced showing that the resulting lag distributions are invariant to scale or that conditioning improves recovery over unconditioned baselines.

Authors: Equation 3 in the methods defines the Scale-Invariant Lag Gate with explicit normalization to enforce scale invariance. We acknowledge that dedicated ablation results are not currently reported. The revised manuscript will include ablation experiments contrasting the full conditioned model against an unconditioned variant, with quantitative recovery metrics to demonstrate the benefit of entity-proxy conditioning. revision: yes

Circularity Check

0 steps flagged

No significant circularity identified

full rationale

The provided abstract and description frame AC-GATE as instantiating conditional Moderated Distributed Lag via observable proxies to produce effective lags as model outputs, with evaluation separated into synthetic recovery testing and real-data external audit. No equations, self-citations, or definitional steps are quoted that reduce the lag discovery directly to a fitted parameter or input by construction. The synthetic ground-truth test and layered audit protocol are presented as independent verification mechanisms, keeping the derivation self-contained against external benchmarks rather than tautological.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

No specific free parameters, axioms, or invented entities are identifiable from the abstract alone; the model presumably contains standard neural-network weights and hyperparameters whose values are not reported.

pith-pipeline@v0.9.0 · 5694 in / 1148 out tokens · 57602 ms · 2026-05-22T00:42:08.074592+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

AC-GATE ... instantiates conditional Moderated Distributed Lag by using observable entity-level proxies to condition lag-weight distributions ... effective lag k*_i = sum k ω_i,k

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

23 extracted references · 23 canonical work pages · 2 internal anchors

[1]

Heterogeneous effects of energy efficiency and renewable energy on carbon emissions: Evidence from developing countries

Rabia Akram et al. “Heterogeneous effects of energy efficiency and renewable energy on carbon emissions: Evidence from developing countries”. In:Journal of Cleaner Production247 (2020), p. 119122.ISSN: 0959- 6526.DOI: https : / / doi . org / 10 . 1016 / j . jclepro . 2019 . 119122.URL: https://www.sciencedirect.com/science/ article/pii/S0959652619339927

work page 2020
[2]

Causal mod- els for longitudinal and panel data: a survey

Dmitry Arkhangelsky and Guido Imbens. “Causal mod- els for longitudinal and panel data: a survey”. In:The Econometrics Journal27.3 (Sept. 2024), pp. C1–C61. ISSN: 1368-4221.DOI: 10 . 1093 / ectj / utae014. eprint: https://academic.oup.com/ectj/article- pdf/27/3/C1/ 59830430/utae014.pdf.URL: https://doi.org/10.1093/ ectj/utae014

work page 2024
[3]

Andrii Babii et al.Machine Learning Panel Data Regressions with Heavy-tailed Dependent Data: The- ory and Application. 2021. arXiv: 2008 . 03600 [econ.EM].URL: https://arxiv.org/abs/2008.03600

work page arXiv 2021
[4]

A simple approach to quantile regres- sion for panel data

Ivan A. Canay. “A simple approach to quantile regres- sion for panel data”. In:The Econometrics Journal14.3 (Oct. 2011), pp. 368–386.ISSN: 1368-4221.DOI: 10. 1111 / j . 1368 - 423X . 2011 . 00349 . x. eprint: https : / / academic.oup.com/ectj/article-pdf/14/3/368/27670447/ ectj0368.pdf.URL: https://doi.org/10.1111/j.1368- 423X.2011.00349.x

work page doi:10.1111/j.1368- 2011
[5]

On the (Mis)Use of Machine Learning With Panel Data

Augusto Cerqua, Marco Letta, and Gabriele Pinto. “On the (Mis)Use of Machine Learning With Panel Data”. In:Oxford Bulletin of Economics and Statisticsn/a.n/a ().DOI: https://doi.org/10.1111/obes.70019. eprint: https://onlinelibrary.wiley.com/doi/pdf/10.1111/obes. 70019.URL: https://onlinelibrary.wiley.com/doi/abs/10. 1111/obes.70019

work page doi:10.1111/obes.70019
[6]

Ilias Chronopoulos et al.Deep Neural Network Estima- tion in Panel Data Models. 2023. arXiv: 2305.19921 [econ.EM].URL: https://arxiv.org/abs/2305.19921

work page arXiv 2023
[7]

Absorptive Capacity: A New Perspective on Learning and Innova- tion

Wesley M. Cohen and Daniel A. Levinthal. “Absorptive Capacity: A New Perspective on Learning and Innova- tion”. In:Administrative Science Quarterly35.1 (1990), pp. 128–152.ISSN: 00018392.URL: http://www.jstor. org/stable/2393553 (visited on 05/01/2026)

work page arXiv 1990
[8]

The Next Generation of the Penn World Table

Robert C. Feenstra, Robert Inklaar, and Marcel P. Tim- mer. “The Next Generation of the Penn World Table”. In:American Economic Review105.10 (Oct. 2015), pp. 3150–82.DOI: 10.1257/aer.20130954.URL: https: //www.aeaweb.org/articles?id=10.1257/aer.20130954

work page doi:10.1257/aer.20130954.url: 2015
[9]

Threshold effects in non-dynamic panels: Estimation, testing, and inference

Bruce E. Hansen. “Threshold effects in non-dynamic panels: Estimation, testing, and inference”. In:Journal of Econometrics93.2 (1999), pp. 345–368.ISSN: 0304- 4076.DOI: https://doi.org/10.1016/S0304- 4076(99) 00025-1.URL: https://www.sciencedirect.com/science/ article/pii/S0304407699000251

work page doi:10.1016/s0304- 1999
[10]

The Worldwide Governance Indicators: Methodology and Analytical Issues

Daniel Kaufmann, Aart Kraay, and Massimo Mastruzzi. “The Worldwide Governance Indicators: Methodology and Analytical Issues”. In:Hague Journal on the Rule of Law3.2 (2011), pp. 220–246.DOI: 10 . 1017 / S1876404511200046

work page 2011
[11]

Temporal Fusion Transformers for interpretable multi-horizon time series forecasting

Bryan Lim et al. “Temporal Fusion Transformers for interpretable multi-horizon time series forecasting”. In:International Journal of Forecasting37.4 (2021), pp. 1748–1764.ISSN: 0169-2070.DOI: https : / / doi . org / 10 . 1016 / j . ijforecast . 2021 . 03 . 012.URL: https : / / www. sciencedirect . com / science / article / pii / S0169207021000637

work page 2021
[12]

The Mythos of Model Interpretabil- ity: In machine learning, the concept of interpretability is both important and slippery

Zachary C Lipton. “The Mythos of Model Interpretabil- ity: In machine learning, the concept of interpretability is both important and slippery”. eng. In:ACM queue 16.3 (2018), pp. 31–57.ISSN: 1542-7730.DOI: 10.1145/ 3236386.3241340

work page arXiv 2018
[13]

Yong Liu et al.iTransformer: Inverted Transformers Are Effective for Time Series Forecasting. 2024. arXiv: 2310.06625[cs.LG].URL: https://arxiv.org/abs/2310. 06625

work page internal anchor Pith review Pith/arXiv arXiv 2024
[14]

Renew- able energy and CO2 emissions intensity in the top carbon intense countries

Ziroat Mirziyoyeva and Raufhon Salahodjaev. “Renew- able energy and CO2 emissions intensity in the top carbon intense countries”. In:Renewable Energy192 (2022), pp. 507–512.ISSN: 0960-1481.DOI: https : / / doi . org / 10 . 1016 / j . renene . 2022 . 04 . 137.URL: https : / / www. sciencedirect . com / science / article / pii / S096014812200605X

work page 2022
[15]

de Groote

M.Hashem Pesaran and Ron Smith. “Estimating long- run relationships from dynamic heterogeneous panels”. In:Journal of Econometrics68.1 (1995), pp. 79–113. ISSN: 0304-4076.DOI: https://doi.org/10.1016/0304- 4076(94)01644-F.URL: https://www.sciencedirect.com/ science/article/pii/030440769401644F

work page doi:10.1016/0304- 1995
[16]

Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead

Cynthia Rudin. “Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead”. eng. In:Nature machine intelligence1.5 (May 2019), pp. 206–215.ISSN: 2522- 5839.DOI: 10.1038/s42256-019-0048-x

work page doi:10.1038/s42256-019-0048-x 2019
[17]

Lessons from three decades of IT productivity research: towards a better understanding of IT-induced productivity effects: S. Schweikl, R. Obermaier

Stefan Schweikl and Robert Obermaier. “Lessons from three decades of IT productivity research: towards a better understanding of IT-induced productivity effects: S. Schweikl, R. Obermaier”. In:Management review quarterly70.4 (2020), pp. 461–507.DOI: 10 . 1007 / s11301- 019- 00173- 6.URL: https://doi.org/10.1007/ s11301-019-00173-6

work page 2020
[18]

A comprehensive sur- vey on statistical and deep learning models for panel data analysis: A comprehensive survey on statistical and deep learning

Uthayasanker Thayasivam et al. “A comprehensive sur- vey on statistical and deep learning models for panel data analysis: A comprehensive survey on statistical and deep learning...” In:Knowl. Inf. Syst.67.12 (Oct. 2025), pp. 11359–11379.ISSN: 0219-1377.DOI: 10 . 1007/s10115- 025- 02607- y.URL: https://doi.org/10. 1007/s10115-025-02607-y

work page 2025
[19]

Counterfactual Explanations for Time Series Forecasting

Zhendong Wang et al. “Counterfactual Explanations for Time Series Forecasting”. In:2023 IEEE International Conference on Data Mining (ICDM). 2023, pp. 1391– 1396.DOI: 10.1109/ICDM58522.2023.00180

work page doi:10.1109/icdm58522.2023.00180 2023
[20]

Haixu Wu et al.TimesNet: Temporal 2D-Variation Mod- eling for General Time Series Analysis. 2023. arXiv: 2210 . 02186[cs.LG].URL: https : / / arxiv. org / abs / 2210.02186

work page internal anchor Pith review Pith/arXiv arXiv 2023
[21]

Not All Attention Is Needed: Gated Attention Network for Sequence Data

Lanqing Xue, Xiaopeng Li, and Nevin L. Zhang. “Not All Attention Is Needed: Gated Attention Network for Sequence Data”. In:Proceedings of the AAAI Con- ference on Artificial Intelligence34.04 (Apr. 2020), pp. 6550–6557.DOI: 10.1609/aaai.v34i04.6129.URL: https://ojs.aaai.org/index.php/AAAI/article/view/6129

work page doi:10.1609/aaai.v34i04.6129.url: 2020
[22]

Yucheng Yang, Zhong Zheng, and Weinan E.Inter- pretable Neural Networks for Panel Data Analysis in Economics. 2020. arXiv: 2010 . 05311[econ.EM]. URL: https://arxiv.org/abs/2010.05311

work page arXiv 2020
[23]

Covariate-Adjusted Deep Causal Learning for Hetero- geneous Panel Data Models

Guanhao Zhou, Yuefeng Han, and Xiufan Yu. Covariate-Adjusted Deep Causal Learning for Hetero- geneous Panel Data Models. 2025. arXiv: 2505.20536 [stat.ML].URL: https://arxiv.org/abs/2505.20536

work page arXiv 2025

[1] [1]

Heterogeneous effects of energy efficiency and renewable energy on carbon emissions: Evidence from developing countries

Rabia Akram et al. “Heterogeneous effects of energy efficiency and renewable energy on carbon emissions: Evidence from developing countries”. In:Journal of Cleaner Production247 (2020), p. 119122.ISSN: 0959- 6526.DOI: https : / / doi . org / 10 . 1016 / j . jclepro . 2019 . 119122.URL: https://www.sciencedirect.com/science/ article/pii/S0959652619339927

work page 2020

[2] [2]

Causal mod- els for longitudinal and panel data: a survey

Dmitry Arkhangelsky and Guido Imbens. “Causal mod- els for longitudinal and panel data: a survey”. In:The Econometrics Journal27.3 (Sept. 2024), pp. C1–C61. ISSN: 1368-4221.DOI: 10 . 1093 / ectj / utae014. eprint: https://academic.oup.com/ectj/article- pdf/27/3/C1/ 59830430/utae014.pdf.URL: https://doi.org/10.1093/ ectj/utae014

work page 2024

[3] [3]

Andrii Babii et al.Machine Learning Panel Data Regressions with Heavy-tailed Dependent Data: The- ory and Application. 2021. arXiv: 2008 . 03600 [econ.EM].URL: https://arxiv.org/abs/2008.03600

work page arXiv 2021

[4] [4]

A simple approach to quantile regres- sion for panel data

Ivan A. Canay. “A simple approach to quantile regres- sion for panel data”. In:The Econometrics Journal14.3 (Oct. 2011), pp. 368–386.ISSN: 1368-4221.DOI: 10. 1111 / j . 1368 - 423X . 2011 . 00349 . x. eprint: https : / / academic.oup.com/ectj/article-pdf/14/3/368/27670447/ ectj0368.pdf.URL: https://doi.org/10.1111/j.1368- 423X.2011.00349.x

work page doi:10.1111/j.1368- 2011

[5] [5]

On the (Mis)Use of Machine Learning With Panel Data

Augusto Cerqua, Marco Letta, and Gabriele Pinto. “On the (Mis)Use of Machine Learning With Panel Data”. In:Oxford Bulletin of Economics and Statisticsn/a.n/a ().DOI: https://doi.org/10.1111/obes.70019. eprint: https://onlinelibrary.wiley.com/doi/pdf/10.1111/obes. 70019.URL: https://onlinelibrary.wiley.com/doi/abs/10. 1111/obes.70019

work page doi:10.1111/obes.70019

[6] [6]

Ilias Chronopoulos et al.Deep Neural Network Estima- tion in Panel Data Models. 2023. arXiv: 2305.19921 [econ.EM].URL: https://arxiv.org/abs/2305.19921

work page arXiv 2023

[7] [7]

Absorptive Capacity: A New Perspective on Learning and Innova- tion

Wesley M. Cohen and Daniel A. Levinthal. “Absorptive Capacity: A New Perspective on Learning and Innova- tion”. In:Administrative Science Quarterly35.1 (1990), pp. 128–152.ISSN: 00018392.URL: http://www.jstor. org/stable/2393553 (visited on 05/01/2026)

work page arXiv 1990

[8] [8]

The Next Generation of the Penn World Table

Robert C. Feenstra, Robert Inklaar, and Marcel P. Tim- mer. “The Next Generation of the Penn World Table”. In:American Economic Review105.10 (Oct. 2015), pp. 3150–82.DOI: 10.1257/aer.20130954.URL: https: //www.aeaweb.org/articles?id=10.1257/aer.20130954

work page doi:10.1257/aer.20130954.url: 2015

[9] [9]

Threshold effects in non-dynamic panels: Estimation, testing, and inference

Bruce E. Hansen. “Threshold effects in non-dynamic panels: Estimation, testing, and inference”. In:Journal of Econometrics93.2 (1999), pp. 345–368.ISSN: 0304- 4076.DOI: https://doi.org/10.1016/S0304- 4076(99) 00025-1.URL: https://www.sciencedirect.com/science/ article/pii/S0304407699000251

work page doi:10.1016/s0304- 1999

[10] [10]

The Worldwide Governance Indicators: Methodology and Analytical Issues

Daniel Kaufmann, Aart Kraay, and Massimo Mastruzzi. “The Worldwide Governance Indicators: Methodology and Analytical Issues”. In:Hague Journal on the Rule of Law3.2 (2011), pp. 220–246.DOI: 10 . 1017 / S1876404511200046

work page 2011

[11] [11]

Temporal Fusion Transformers for interpretable multi-horizon time series forecasting

Bryan Lim et al. “Temporal Fusion Transformers for interpretable multi-horizon time series forecasting”. In:International Journal of Forecasting37.4 (2021), pp. 1748–1764.ISSN: 0169-2070.DOI: https : / / doi . org / 10 . 1016 / j . ijforecast . 2021 . 03 . 012.URL: https : / / www. sciencedirect . com / science / article / pii / S0169207021000637

work page 2021

[12] [12]

The Mythos of Model Interpretabil- ity: In machine learning, the concept of interpretability is both important and slippery

Zachary C Lipton. “The Mythos of Model Interpretabil- ity: In machine learning, the concept of interpretability is both important and slippery”. eng. In:ACM queue 16.3 (2018), pp. 31–57.ISSN: 1542-7730.DOI: 10.1145/ 3236386.3241340

work page arXiv 2018

[13] [13]

Yong Liu et al.iTransformer: Inverted Transformers Are Effective for Time Series Forecasting. 2024. arXiv: 2310.06625[cs.LG].URL: https://arxiv.org/abs/2310. 06625

work page internal anchor Pith review Pith/arXiv arXiv 2024

[14] [14]

Renew- able energy and CO2 emissions intensity in the top carbon intense countries

Ziroat Mirziyoyeva and Raufhon Salahodjaev. “Renew- able energy and CO2 emissions intensity in the top carbon intense countries”. In:Renewable Energy192 (2022), pp. 507–512.ISSN: 0960-1481.DOI: https : / / doi . org / 10 . 1016 / j . renene . 2022 . 04 . 137.URL: https : / / www. sciencedirect . com / science / article / pii / S096014812200605X

work page 2022

[15] [15]

de Groote

M.Hashem Pesaran and Ron Smith. “Estimating long- run relationships from dynamic heterogeneous panels”. In:Journal of Econometrics68.1 (1995), pp. 79–113. ISSN: 0304-4076.DOI: https://doi.org/10.1016/0304- 4076(94)01644-F.URL: https://www.sciencedirect.com/ science/article/pii/030440769401644F

work page doi:10.1016/0304- 1995

[16] [16]

Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead

Cynthia Rudin. “Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead”. eng. In:Nature machine intelligence1.5 (May 2019), pp. 206–215.ISSN: 2522- 5839.DOI: 10.1038/s42256-019-0048-x

work page doi:10.1038/s42256-019-0048-x 2019

[17] [17]

Lessons from three decades of IT productivity research: towards a better understanding of IT-induced productivity effects: S. Schweikl, R. Obermaier

Stefan Schweikl and Robert Obermaier. “Lessons from three decades of IT productivity research: towards a better understanding of IT-induced productivity effects: S. Schweikl, R. Obermaier”. In:Management review quarterly70.4 (2020), pp. 461–507.DOI: 10 . 1007 / s11301- 019- 00173- 6.URL: https://doi.org/10.1007/ s11301-019-00173-6

work page 2020

[18] [18]

A comprehensive sur- vey on statistical and deep learning models for panel data analysis: A comprehensive survey on statistical and deep learning

Uthayasanker Thayasivam et al. “A comprehensive sur- vey on statistical and deep learning models for panel data analysis: A comprehensive survey on statistical and deep learning...” In:Knowl. Inf. Syst.67.12 (Oct. 2025), pp. 11359–11379.ISSN: 0219-1377.DOI: 10 . 1007/s10115- 025- 02607- y.URL: https://doi.org/10. 1007/s10115-025-02607-y

work page 2025

[19] [19]

Counterfactual Explanations for Time Series Forecasting

Zhendong Wang et al. “Counterfactual Explanations for Time Series Forecasting”. In:2023 IEEE International Conference on Data Mining (ICDM). 2023, pp. 1391– 1396.DOI: 10.1109/ICDM58522.2023.00180

work page doi:10.1109/icdm58522.2023.00180 2023

[20] [20]

Haixu Wu et al.TimesNet: Temporal 2D-Variation Mod- eling for General Time Series Analysis. 2023. arXiv: 2210 . 02186[cs.LG].URL: https : / / arxiv. org / abs / 2210.02186

work page internal anchor Pith review Pith/arXiv arXiv 2023

[21] [21]

Not All Attention Is Needed: Gated Attention Network for Sequence Data

Lanqing Xue, Xiaopeng Li, and Nevin L. Zhang. “Not All Attention Is Needed: Gated Attention Network for Sequence Data”. In:Proceedings of the AAAI Con- ference on Artificial Intelligence34.04 (Apr. 2020), pp. 6550–6557.DOI: 10.1609/aaai.v34i04.6129.URL: https://ojs.aaai.org/index.php/AAAI/article/view/6129

work page doi:10.1609/aaai.v34i04.6129.url: 2020

[22] [22]

Yucheng Yang, Zhong Zheng, and Weinan E.Inter- pretable Neural Networks for Panel Data Analysis in Economics. 2020. arXiv: 2010 . 05311[econ.EM]. URL: https://arxiv.org/abs/2010.05311

work page arXiv 2020

[23] [23]

Covariate-Adjusted Deep Causal Learning for Hetero- geneous Panel Data Models

Guanhao Zhou, Yuefeng Han, and Xiufan Yu. Covariate-Adjusted Deep Causal Learning for Hetero- geneous Panel Data Models. 2025. arXiv: 2505.20536 [stat.ML].URL: https://arxiv.org/abs/2505.20536

work page arXiv 2025