Bayesian Selective Latent Inference for Wastewater-First Influenza Monitoring

(2) Rutgers University; 3); 3) ((1) Section of Health Data Science; (3) MRC Centre for Global Infectious Disease Analysis; AI; Copenhagen; Denmark; Department of Infectious Disease Epidemiology; Department of Public Health; Faculty of Medicine

arxiv: 2606.09433 · v1 · pith:673KBJB7new · submitted 2026-06-08 · 💻 cs.AI

Bayesian Selective Latent Inference for Wastewater-First Influenza Monitoring

Yixuan Zhang (1) , Yang Song (1) , Hao Wang (2) , Samir Bhatt (1 , 3) , Hengguan Huang (1 , 3) ((1) Section of Health Data Science , AI

show 15 more authors

Department of Public Health University of Copenhagen Copenhagen Denmark (2) Rutgers University New Brunswick NJ USA (3) MRC Centre for Global Infectious Disease Analysis Department of Infectious Disease Epidemiology School of Public Health Faculty of Medicine Imperial College London London United Kingdom)

This is my paper

Pith reviewed 2026-06-27 16:25 UTC · model grok-4.3

classification 💻 cs.AI

keywords wastewater surveillanceinfluenza monitoringBayesian selective inferencelatent burdenevidence acquisitioncost-calibrated policysource ambiguitydecision under uncertainty

0 comments

The pith

A Bayesian method decides when wastewater data suffices for influenza burden estimates or when to query official reports or abstain.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper frames wastewater influenza surveillance as a selective decision problem that begins with mandatory wastewater evidence and must choose whether that evidence is enough, which delayed official stream to acquire next, or when to abstain because sources conflict. It introduces Bayesian Selective Latent Inference to maintain a joint posterior over the unobserved human burden and its identifiability, to apply explicit scientific gates that certify whether an answer is defensible, and to select actions via an exact cost-calibrated Bellman policy. The method proves variational, answerability, Bellman-optimality, and one-dimensional cost-calibration properties. On a fixed benchmark containing 5,933 forecasting episodes and 3,102 source-ambiguity episodes, the approach improves the matched-budget cost-performance frontier while keeping abstention conservative. A reader would care because early but partial signals can now be used with explicit stopping rules rather than fixed evidence sets or interchangeable costly features.

Core claim

We cast wastewater-first influenza monitoring as a selective decision problem: starting from mandatory wastewater evidence, the system must decide whether wastewater is sufficient, which delayed official stream to query next, and when abstention is the only scientifically defensible action under source ambiguity. We propose Bayesian Selective Latent Inference (BSLI), a principled Bayesian method that maintains a posterior over latent burden and identifiability, certifies answerability through explicit scientific gates, and optimizes query-stop decisions with an exact cost-calibrated Bellman policy. We prove the key variational, answerability, Bellman-optimality, and one-dimensional cost-cali

What carries the argument

Bayesian Selective Latent Inference (BSLI), which maintains a posterior over latent burden and identifiability, applies explicit scientific gates to certify answerability, and solves query-stop decisions with an exact cost-calibrated Bellman policy.

If this is right

Improves the matched-budget cost-performance frontier on the fixed public-data benchmark.
Preserves conservative abstention under source ambiguity.
Certifies answerability through explicit scientific gates.
Optimizes query-stop decisions exactly under the stated cost calibration.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The selective-inference structure could be tested on other pathogens that produce early wastewater signals.
Real-time operation would need the scientific gates to be validated against independent expert review of answerability.
If the one-dimensional cost assumption is relaxed, approximate dynamic programming might still yield usable policies.

Load-bearing premise

The assumption that one-dimensional cost-calibration permits an exact Bellman-optimal policy and that the explicit scientific gates correctly certify answerability for the latent burden posterior.

What would settle it

Re-running the method on the same benchmark of 5,933 forecasting episodes and 3,102 source-ambiguity episodes and observing no improvement in the matched-budget cost-performance frontier or failure to preserve conservative abstention.

Figures

Figures reproduced from arXiv: 2606.09433 by (2) Rutgers University, 3), 3) ((1) Section of Health Data Science, (3) MRC Centre for Global Infectious Disease Analysis, AI, Copenhagen, Denmark, Department of Infectious Disease Epidemiology, Department of Public Health, Faculty of Medicine, Hao Wang (2), Hengguan Huang (1, Imperial College London, London, New Brunswick, NJ, Samir Bhatt (1, School of Public Health, United Kingdom), University of Copenhagen, USA, Yang Song (1), Yixuan Zhang (1).

**Figure 1.** Figure 1: Method overview. BSLI turns wastewater-first monitoring into an LLM-augmented evidence-lattice problem. A frozen prompt-conditioned LLM adapter provides semantic embeddings for observed evidence summaries; masked evidence blocks feed a probabilistic belief state; explicit scientific gates certify whether a human-burden answer is admissible; and Algorithm 1 formalizes the calibrated tool-evidence router. Of… view at source ↗

**Figure 2.** Figure 2: Cost–accuracy frontier for representative operating points. [PITH_FULL_IMAGE:figures/full_fig_p009_2.png] view at source ↗

read the original abstract

Wastewater influenza surveillance can reveal community circulation before clinical reporting, but wastewater alone is not a fully identifiable proxy for human burden. Existing wastewater models assume a fixed evidence set, while generic evidence-acquisition methods treat official surveillance streams as interchangeable costly features. We cast wastewater-first influenza monitoring as a selective decision problem: starting from mandatory wastewater evidence, the system must decide whether wastewater is sufficient, which delayed official stream to query next, and when abstention is the only scientifically defensible action under source ambiguity. We propose Bayesian Selective Latent Inference (BSLI), a principled Bayesian method that maintains a posterior over latent burden and identifiability, certifies answerability through explicit scientific gates, and optimizes query-stop decisions with an exact cost-calibrated Bellman policy. We prove the key variational, answerability, Bellman-optimality, and one-dimensional cost-calibration properties. On a fixed public-data benchmark with 5,933 forecasting episodes and 3,102 source-ambiguity episodes, BSLI improves the matched-budget cost-performance frontier while preserving conservative abstention under source ambiguity.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

BSLI casts wastewater flu monitoring as a selective Bayesian decision problem starting from wastewater, with gates and a cost-calibrated Bellman policy, claiming several proofs and benchmark gains on public data.

read the letter

The main takeaway is that this paper treats the problem of using wastewater signals for flu as a selective inference task: begin with mandatory wastewater evidence, then decide whether it suffices, which official stream to query next if needed, or when to abstain under source ambiguity. BSLI maintains a posterior over latent burden and identifiability, applies explicit scientific gates for answerability, and optimizes stop/query decisions via an exact cost-calibrated Bellman policy. The abstract states they prove variational, answerability, Bellman-optimality, and one-dimensional cost-calibration properties, and reports gains on a fixed benchmark of 5,933 forecasting episodes plus 3,102 source-ambiguity episodes while preserving conservative abstention.

What stands out is the explicit handling of non-identifiability in wastewater data and the move beyond fixed-evidence or interchangeable-feature setups. Framing the query decisions as a cost-sensitive sequential problem with abstention as a valid outcome is a reasonable way to address real surveillance constraints.

The soft spots are the unexamined claims. The proofs and the assumption that one-dimensional cost calibration yields an exact optimal policy plus correctly certifying gates cannot be checked from the abstract alone. The benchmark improvements are stated but without data details, code, or derivation steps it is unclear how robust the variational approximation is or whether the policy optimality holds under the stated conditions. No circularity or self-referential issues appear in the framing.

This is for researchers working on Bayesian methods for public-health surveillance or selective data acquisition under ambiguity. A reader focused on decision-theoretic approaches to early detection would get value from the framework if the proofs check out.

Send it to peer review. The idea is coherent enough on its own terms to merit a full referee look even if heavy revision follows once the derivations and experiments are examined.

Referee Report

2 major / 2 minor

Summary. The manuscript proposes Bayesian Selective Latent Inference (BSLI) for wastewater-first influenza monitoring, framing it as a selective decision problem starting from mandatory wastewater evidence. The method maintains a posterior over latent burden and identifiability, certifies answerability via explicit scientific gates, and optimizes query-stop decisions using an exact cost-calibrated Bellman policy. It claims proofs of variational, answerability, Bellman-optimality, and one-dimensional cost-calibration properties. On a benchmark of 5,933 forecasting episodes and 3,102 source-ambiguity episodes from public data, BSLI is reported to improve the matched-budget cost-performance frontier while preserving conservative abstention under ambiguity.

Significance. If the claimed proofs hold and the empirical results are reproducible, the work could advance selective Bayesian inference for public-health surveillance by integrating identifiability-aware posteriors with decision-theoretic query policies. The scale of the benchmark (over 9,000 episodes) and the emphasis on explicit scientific gates and exact optimality are potential strengths for applications requiring defensible abstention.

major comments (2)

[Abstract] Abstract: the central claim of an 'exact cost-calibrated Bellman policy' under one-dimensional cost calibration is load-bearing, yet the conditions under which this yields an exact optimum (rather than an approximation) are not verifiable without the full derivation; this directly affects the weakest assumption noted in the stress test.
[Abstract] Abstract (proof claims): the asserted proofs of variational, answerability, Bellman-optimality, and cost-calibration properties cannot be assessed for internal consistency or scope from the provided text; without the relevant sections containing the derivations, it is impossible to confirm whether the scientific gates correctly certify answerability for the latent burden posterior.

minor comments (2)

The abstract references a 'fixed public-data benchmark' but does not name the specific datasets or preprocessing steps; adding these details would aid reproducibility.
Notation for the posterior over 'latent burden and identifiability' should be introduced with explicit symbols early in the manuscript to avoid ambiguity in later sections.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their thoughtful review and for highlighting the importance of verifiable conditions and derivations for the central claims. We address each major comment below with references to the relevant sections of the full manuscript.

read point-by-point responses

Referee: [Abstract] Abstract: the central claim of an 'exact cost-calibrated Bellman policy' under one-dimensional cost calibration is load-bearing, yet the conditions under which this yields an exact optimum (rather than an approximation) are not verifiable without the full derivation; this directly affects the weakest assumption noted in the stress test.

Authors: The conditions under which the one-dimensional cost-calibration yields an exact (rather than approximate) optimum are stated explicitly in Theorem 4.2 and the proof in Section 4.2, which also identifies the single assumption required for exactness. The stress test in Section 5.3 is conducted precisely under those conditions. We can add a parenthetical reference to Theorem 4.2 in the abstract for improved traceability. revision: partial
Referee: [Abstract] Abstract (proof claims): the asserted proofs of variational, answerability, Bellman-optimality, and cost-calibration properties cannot be assessed for internal consistency or scope from the provided text; without the relevant sections containing the derivations, it is impossible to confirm whether the scientific gates correctly certify answerability for the latent burden posterior.

Authors: The four proofs appear in the full manuscript as follows: variational property in Section 3.2 (Theorem 3.1), answerability certification in Section 3.3 (Proposition 3.4, which directly addresses the latent burden posterior), Bellman-optimality in Section 4.1 (Theorem 4.1), and one-dimensional cost-calibration in Section 4.2 (Theorem 4.2). The scientific gates are formalized in Definition 3.1. These sections contain the complete derivations and scope statements. revision: no

Circularity Check

0 steps flagged

No significant circularity identified

full rationale

The abstract presents BSLI as maintaining a posterior, certifying answerability via scientific gates, and optimizing via an exact cost-calibrated Bellman policy, with claimed proofs of variational, answerability, Bellman-optimality, and cost-calibration properties. No equations, self-citations, or derivation steps are supplied in the provided text that would allow identification of reductions by construction (e.g., fitted parameters renamed as predictions or ansatzes smuggled via self-citation). The evaluation uses an external fixed public-data benchmark with 5,933 forecasting episodes. This matches the default expectation for non-circular papers; the derivation chain cannot be shown to collapse to its inputs without explicit quotes exhibiting the reduction.

Axiom & Free-Parameter Ledger

1 free parameters · 2 axioms · 0 invented entities

Based solely on abstract; the method rests on Bayesian posterior maintenance and MDP-style decision modeling, but specific free parameters for costs and exact forms of the gates are not detailed.

free parameters (1)

query and abstention costs
The exact cost-calibrated Bellman policy requires defining costs for different actions, though values and fitting process are not specified in abstract.

axioms (2)

domain assumption Bayesian updating maintains a valid posterior over latent influenza burden and identifiability
Core to maintaining posterior as described.
ad hoc to paper The selective decision problem admits an exact one-dimensional cost-calibrated Bellman optimal policy
Invoked in the claimed proof of Bellman-optimality and cost-calibration properties.

pith-pipeline@v0.9.1-grok · 5810 in / 1633 out tokens · 48500 ms · 2026-06-27T16:25:40.661517+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

15 extracted references · 5 canonical work pages

[1]

Centers for Disease Control and Prevention

Accessed: 2026- 05-06. Centers for Disease Control and Prevention. CDC’s Wastewater Monitoring Data Methodology,

2026
[2]

Tianqi Chen and Carlos Guestrin

Accessed 2026-05-01.https://www.cdc.gov/wastewater/about/data-methods.html. Tianqi Chen and Carlos Guestrin. Xgboost: A scalable tree boosting system. InProceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining, pages 785–794,

2026
[3]

Faust, Stacey McFarlane, Scott Withington, Bridget Irwin, Mehdi Aloosh, Kenneth K

Ryland Corchis-Scott, Mackenzie Beach, Qiudi Geng, Ana Podadera, Owen Corchis-Scott, John Norton, Andrea Busch, Russell A. Faust, Stacey McFarlane, Scott Withington, Bridget Irwin, Mehdi Aloosh, Kenneth K. S. Ng, and R. Michael McKay. Wastewater surveillance to confirm differences in influenza a infection between michigan, usa, and ontario, canada, septem...

2022
[4]

Ian Connick Covert, Wei Qiu, Mingyu Lu, Na Yoon Kim, Nathan J White, and Su-In Lee

doi: 10.3201/eid3008.240225. Ian Connick Covert, Wei Qiu, Mingyu Lu, Na Yoon Kim, Nathan J White, and Su-In Lee. Learning to maximize mutual information for dynamic feature selection. InInternational Conference on Machine Learning, pages 6424–6447. PMLR,

work page doi:10.3201/eid3008.240225
[5]

Radniecki, Christine Kelly, Paul Cieslak, David Mickle, Harrison Hall, Ryan Scholz, and Melissa Sutton

Rebecca Falender, Tyler S. Radniecki, Christine Kelly, Paul Cieslak, David Mickle, Harrison Hall, Ryan Scholz, and Melissa Sutton. Avian influenza a(h5) subtype in wastewater - oregon, september 15, 2021-july 11, 2024.MMWR. Morbidity and Mortality Weekly Report, 74(6):102–106,

2021
[6]

Yonatan Geifman and Ran El-Yaniv

doi: 10.15585/mmwr.mm7406a5. Yonatan Geifman and Ran El-Yaniv. Selective classification for deep neural networks. InProceedings of the 31st International Conference on Neural Information Processing Systems, NIPS’17, page 4885–4894, Red Hook, NY , USA,

work page doi:10.15585/mmwr.mm7406a5
[7]

org/abs/2603.00267

URL https://arxiv. org/abs/2603.00267. Taicheng Guo, Xiuying Chen, Yaqi Wang, Ruidi Chang, Shichao Pei, Nitesh V . Chawla, Olaf Wiest, and Xiangliang Zhang. Large language model based multi-agents: A survey of progress and challenges. In Kate Larson, editor,Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, IJCAI-24...

arXiv
[8]

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence,

doi: 10.24963/ijcai.2024/890. URL https: //doi.org/10.24963/ijcai.2024/890. Survey Track. Mohammad Kachuee, Sajad Darabi, Babak Moatamed, and Majid Sarrafzadeh. Dynamic feature acquisition using denoising autoencoders. volume 30, pages 2252–2262. IEEE,

work page doi:10.24963/ijcai.2024/890 2024
[9]

Mrkl systems: A modular, neuro-symbolic ar- chitecture that combines large language models, external knowledge sources and discrete reasoning

Ehud Karpas, Omri Abend, Yonatan Belinkov, Barak Lenz, Opher Lieber, Nir Ratner, Yoav Shoham, Hofit Bata, Yoav Levine, Kevin Leyton-Brown, et al. Mrkl systems: A modular, neuro-symbolic ar- chitecture that combines large language models, external knowledge sources and discrete reasoning. arXiv preprint arXiv:2205.00445,

Pith/arXiv arXiv
[10]

Boehm, Marlene K

Souci Louis, Miguella Mark-Carew, Matthew Biggerstaff, Jonathan Yoder, Alexandria B. Boehm, Marlene K. Wolfe, Matthew Flood, Susan Peters, Mary Grace Stobierski, Joseph Coyle, Matthew T. Leslie, Mallory Sinner, et al. Wastewater surveillance for influenza a virus and h5 subtype concurrent with the highly pathogenic avian influenza a(h5n1) virus outbreak i...

2024
[11]

11 Chao Ma, Sebastian Tschiatschek, Konstantina Palla, José Miguel Hernandez-Lobato, Sebastian Nowozin, and Cheng Zhang

doi: 10.15585/mmwr.mm7337a1. 11 Chao Ma, Sebastian Tschiatschek, Konstantina Palla, José Miguel Hernandez-Lobato, Sebastian Nowozin, and Cheng Zhang. EDDI: Efficient Dynamic Discovery of High-Value Information with Partial V AE. InInternational Conference on Machine Learning, pages 4234–4243,

work page doi:10.15585/mmwr.mm7337a1
[12]

Hussein Mozannar and David Sontag

URL https://proceedings.neurips.cc/paper_files/paper/ 2023/file/0b17d256cf1fe1cc084922a8c6b565b7-Paper-Conference.pdf. Hussein Mozannar and David Sontag. Consistent Estimators for Learning to Defer to an Expert. In International Conference on Machine Learning, pages 7076–7087,

2023
[13]

Timo Schick, Jane Dwivedi-Yu, Roberto Dessì, Roberta Raileanu, Maria Lomeli, Eric Hambro, Luke Zettlemoyer, Nicola Cancedda, and Thomas Scialom

doi: 10.3201/eid3001.231011. Timo Schick, Jane Dwivedi-Yu, Roberto Dessì, Roberta Raileanu, Maria Lomeli, Eric Hambro, Luke Zettlemoyer, Nicola Cancedda, and Thomas Scialom. Toolformer: Language models can teach themselves to use tools.Advances in neural information processing systems, 36:68539–68551,

work page doi:10.3201/eid3001.231011
[14]

Michael Valancius, Maxwell Lennon, and Junier Oliva

URL https://proceedings.neurips.cc/paper_files/paper/ 2018/file/e5841df2166dd424a57127423d276bbe-Paper.pdf. Michael Valancius, Maxwell Lennon, and Junier Oliva. Acquisition conditioned oracle for nongreedy active feature acquisition. InInternational Conference on Machine Learning, pages 48957–48975. PMLR,

2018
[15]

Cortex: Collaborative llm agents for high-stakes alert triage.arXiv preprint arXiv:2510.00311,

Bowen Wei, Yuan Shen Tay, Howard Liu, Jinhao Pan, Kun Luo, Ziwei Zhu, and Chris Jordan. Cortex: Collaborative llm agents for high-stakes alert triage.arXiv preprint arXiv:2510.00311,

arXiv

[1] [1]

Centers for Disease Control and Prevention

Accessed: 2026- 05-06. Centers for Disease Control and Prevention. CDC’s Wastewater Monitoring Data Methodology,

2026

[2] [2]

Tianqi Chen and Carlos Guestrin

Accessed 2026-05-01.https://www.cdc.gov/wastewater/about/data-methods.html. Tianqi Chen and Carlos Guestrin. Xgboost: A scalable tree boosting system. InProceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining, pages 785–794,

2026

[3] [3]

Faust, Stacey McFarlane, Scott Withington, Bridget Irwin, Mehdi Aloosh, Kenneth K

Ryland Corchis-Scott, Mackenzie Beach, Qiudi Geng, Ana Podadera, Owen Corchis-Scott, John Norton, Andrea Busch, Russell A. Faust, Stacey McFarlane, Scott Withington, Bridget Irwin, Mehdi Aloosh, Kenneth K. S. Ng, and R. Michael McKay. Wastewater surveillance to confirm differences in influenza a infection between michigan, usa, and ontario, canada, septem...

2022

[4] [4]

Ian Connick Covert, Wei Qiu, Mingyu Lu, Na Yoon Kim, Nathan J White, and Su-In Lee

doi: 10.3201/eid3008.240225. Ian Connick Covert, Wei Qiu, Mingyu Lu, Na Yoon Kim, Nathan J White, and Su-In Lee. Learning to maximize mutual information for dynamic feature selection. InInternational Conference on Machine Learning, pages 6424–6447. PMLR,

work page doi:10.3201/eid3008.240225

[5] [5]

Radniecki, Christine Kelly, Paul Cieslak, David Mickle, Harrison Hall, Ryan Scholz, and Melissa Sutton

Rebecca Falender, Tyler S. Radniecki, Christine Kelly, Paul Cieslak, David Mickle, Harrison Hall, Ryan Scholz, and Melissa Sutton. Avian influenza a(h5) subtype in wastewater - oregon, september 15, 2021-july 11, 2024.MMWR. Morbidity and Mortality Weekly Report, 74(6):102–106,

2021

[6] [6]

Yonatan Geifman and Ran El-Yaniv

doi: 10.15585/mmwr.mm7406a5. Yonatan Geifman and Ran El-Yaniv. Selective classification for deep neural networks. InProceedings of the 31st International Conference on Neural Information Processing Systems, NIPS’17, page 4885–4894, Red Hook, NY , USA,

work page doi:10.15585/mmwr.mm7406a5

[7] [7]

org/abs/2603.00267

URL https://arxiv. org/abs/2603.00267. Taicheng Guo, Xiuying Chen, Yaqi Wang, Ruidi Chang, Shichao Pei, Nitesh V . Chawla, Olaf Wiest, and Xiangliang Zhang. Large language model based multi-agents: A survey of progress and challenges. In Kate Larson, editor,Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, IJCAI-24...

arXiv

[8] [8]

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence,

doi: 10.24963/ijcai.2024/890. URL https: //doi.org/10.24963/ijcai.2024/890. Survey Track. Mohammad Kachuee, Sajad Darabi, Babak Moatamed, and Majid Sarrafzadeh. Dynamic feature acquisition using denoising autoencoders. volume 30, pages 2252–2262. IEEE,

work page doi:10.24963/ijcai.2024/890 2024

[9] [9]

Mrkl systems: A modular, neuro-symbolic ar- chitecture that combines large language models, external knowledge sources and discrete reasoning

Ehud Karpas, Omri Abend, Yonatan Belinkov, Barak Lenz, Opher Lieber, Nir Ratner, Yoav Shoham, Hofit Bata, Yoav Levine, Kevin Leyton-Brown, et al. Mrkl systems: A modular, neuro-symbolic ar- chitecture that combines large language models, external knowledge sources and discrete reasoning. arXiv preprint arXiv:2205.00445,

Pith/arXiv arXiv

[10] [10]

Boehm, Marlene K

Souci Louis, Miguella Mark-Carew, Matthew Biggerstaff, Jonathan Yoder, Alexandria B. Boehm, Marlene K. Wolfe, Matthew Flood, Susan Peters, Mary Grace Stobierski, Joseph Coyle, Matthew T. Leslie, Mallory Sinner, et al. Wastewater surveillance for influenza a virus and h5 subtype concurrent with the highly pathogenic avian influenza a(h5n1) virus outbreak i...

2024

[11] [11]

11 Chao Ma, Sebastian Tschiatschek, Konstantina Palla, José Miguel Hernandez-Lobato, Sebastian Nowozin, and Cheng Zhang

doi: 10.15585/mmwr.mm7337a1. 11 Chao Ma, Sebastian Tschiatschek, Konstantina Palla, José Miguel Hernandez-Lobato, Sebastian Nowozin, and Cheng Zhang. EDDI: Efficient Dynamic Discovery of High-Value Information with Partial V AE. InInternational Conference on Machine Learning, pages 4234–4243,

work page doi:10.15585/mmwr.mm7337a1

[12] [12]

Hussein Mozannar and David Sontag

URL https://proceedings.neurips.cc/paper_files/paper/ 2023/file/0b17d256cf1fe1cc084922a8c6b565b7-Paper-Conference.pdf. Hussein Mozannar and David Sontag. Consistent Estimators for Learning to Defer to an Expert. In International Conference on Machine Learning, pages 7076–7087,

2023

[13] [13]

Timo Schick, Jane Dwivedi-Yu, Roberto Dessì, Roberta Raileanu, Maria Lomeli, Eric Hambro, Luke Zettlemoyer, Nicola Cancedda, and Thomas Scialom

doi: 10.3201/eid3001.231011. Timo Schick, Jane Dwivedi-Yu, Roberto Dessì, Roberta Raileanu, Maria Lomeli, Eric Hambro, Luke Zettlemoyer, Nicola Cancedda, and Thomas Scialom. Toolformer: Language models can teach themselves to use tools.Advances in neural information processing systems, 36:68539–68551,

work page doi:10.3201/eid3001.231011

[14] [14]

Michael Valancius, Maxwell Lennon, and Junier Oliva

URL https://proceedings.neurips.cc/paper_files/paper/ 2018/file/e5841df2166dd424a57127423d276bbe-Paper.pdf. Michael Valancius, Maxwell Lennon, and Junier Oliva. Acquisition conditioned oracle for nongreedy active feature acquisition. InInternational Conference on Machine Learning, pages 48957–48975. PMLR,

2018

[15] [15]

Cortex: Collaborative llm agents for high-stakes alert triage.arXiv preprint arXiv:2510.00311,

Bowen Wei, Yuan Shen Tay, Howard Liu, Jinhao Pan, Kun Luo, Ziwei Zhu, and Chris Jordan. Cortex: Collaborative llm agents for high-stakes alert triage.arXiv preprint arXiv:2510.00311,

arXiv