arxiv: 2605.04265 · v1 · submitted 2026-05-05 · 🧬 q-bio.BM · q-bio.QM

Recognition: unknown

Benchmarking open-source tools for in silico antiviral drug discovery

Daniel C. Elton, Preston W. Estep

Pith reviewed 2026-05-08 17:00 UTC · model grok-4.3

classification 🧬 q-bio.BM q-bio.QM

keywords antiviral drug discoverybinding affinity predictionmachine learning toolsmolecular dockingbenchmarkingDrugFormDTABoltz-2GNINA

0 comments

The pith

Benchmarking shows Boltz-2 and fine-tuned DrugFormDTA provide the strongest predictions of antiviral binding affinities among 15 open-source tools.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper argues that computational tools can speed up antiviral discovery for outbreaks where approved drugs are missing for most viral families. It curates a dataset of 43,005 viral protein-ligand measurements from BindingDB and other sources, finding that 31 percent of entries needed careful splitting of polyprotein sequences to be usable for machine learning. The authors then benchmarked 15 open-source tools on a test set of 853 antiviral compounds across 16 protein targets from 10 virus species. Results identify Boltz-2 and DrugFormDTA as top machine-learning performers and GNINA as the best docking method, with fine-tuning raising DrugFormDTA's correlation from 0.5 to 0.7. This supplies a practical starting point and a public drug library for faster repurposing and combination design.

Core claim

After curating 43,005 viral protein-ligand binding measurements and splitting polyprotein sequences where needed, the authors benchmarked 15 open-source binding affinity tools on 853 antiviral compounds spanning 16 targets from 10 virus species. Boltz-2 and DrugFormDTA ranked highest among machine-learning approaches while GNINA led among docking tools, with clear performance differences across individual viral proteins. Fine-tuning DrugFormDTA on the cleaned antiviral dataset raised its Pearson correlation from 0.5 to 0.7.

What carries the argument

A custom-curated dataset of 43,005 binding measurements used to fine-tune DrugFormDTA and evaluate 15 tools on 853 antiviral compounds across 16 viral protein targets.

Load-bearing premise

The curated dataset of 43,005 measurements accurately reflects true binding affinities after polyprotein splitting, and the 853-compound test set has no leakage or biases that would inflate the reported correlations.

What would settle it

Running the top tools on a fresh independent set of measured antiviral binding affinities for the same viral proteins and verifying whether correlations near 0.7 still hold.

read the original abstract

Antivirals are uniquely positioned to be deployed quickly during a new outbreak, especially when repurposed from approved drugs. Yet there are no FDA-approved antivirals for the majority of viral families with pandemic potential. Here we lay out the case for investing in technologies and techniques for antiviral drug discovery and designing antiviral combinations. We present a survey of open source datasets and computational tools for in silico antiviral drug discovery, with a particular focus on the latest AI-based systems and docking tools. We then present our custom dataset of 43,005 viral protein-ligand binding measurements that we curated from BindingDB and other sources. Importantly, we found that 31% of viral protein binding data in BindingDB required polyprotein sequences to be carefully split before the data were suitable for training or testing ML models. Using our custom dataset we fine-tuned the DrugFormDTA binding affinity prediction model (Khokhlov et al. 2025). We then benchmarked 15 open-source binding affinity prediction tools on a custom test set of 853 antiviral compounds spread across 16 different protein targets from 10 virus species. Models tested include Boltz-2, GNINA, FlowDock, Interformer, AutoDock-GPU, and others. We found that Boltz-2 and DrugFormDTA ranked highest overall among ML-based approaches, and GNINA did best among docking approaches, with notable variance across specific viral proteins. Fine-tuning DrugFormDTA on our custom cleaned antiviral dataset boosted performance from $r=0.5$ to $r=0.7$. As part of this work we also compiled a library of approved drugs and a comprehensive list of investigational and approved antiviral drugs that can be viewed at https://antivirals-database.radvac.org. Together, this work provides a foundation for future work towards new tools and platforms for rapid drug repurposing and rapid design of antiviral combinations.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This is a practical data-curation and benchmarking paper that gives usable rankings for antiviral binding tools plus a cleaned dataset, but the reported fine-tuning boost rests on unstated split details that could hide leakage.

read the letter

The main takeaway is that the authors built a cleaned set of 43k viral protein-ligand measurements, noted that 31% of BindingDB entries needed polyprotein splitting, fine-tuned DrugFormDTA on it, and then ranked 15 tools on an 853-compound test set across 16 targets from 10 viruses. Boltz-2 and the fine-tuned model came out on top for ML methods, GNINA for docking, with clear variance by protein. They also released a public antiviral drug list at the linked site.

Referee Report

3 major / 1 minor

Summary. The manuscript surveys open-source datasets and computational tools for in silico antiviral drug discovery. It introduces a curated dataset of 43,005 viral protein-ligand binding measurements from BindingDB and other sources, noting that 31% required splitting of polyprotein sequences. The authors fine-tune the DrugFormDTA model on this dataset and benchmark 15 open-source tools, including ML-based (Boltz-2, DrugFormDTA) and docking (GNINA) approaches, on a test set of 853 antiviral compounds across 16 protein targets from 10 virus species. They report that Boltz-2 and DrugFormDTA perform best among ML methods, GNINA among docking, with fine-tuning improving Pearson r from 0.5 to 0.7, and provide additional resources like a library of approved antivirals.

Significance. If the reported performance improvements and rankings hold under rigorous validation, this work provides a valuable benchmark and curated resources for antiviral drug discovery, particularly useful for rapid repurposing during outbreaks. The custom dataset and fine-tuning demonstration highlight the potential of domain-specific data curation to enhance ML models for binding affinity prediction.

major comments (3)

[Dataset curation and fine-tuning description] The description of the custom dataset curation and fine-tuning (abstract and methods) lacks any explicit statement of the train-test split protocol, including compound ID or SMILES deduplication steps to ensure the 853-compound test set is strictly disjoint from the 43,005 training measurements. This detail is load-bearing for the central claim that fine-tuning boosts Pearson r from 0.5 to 0.7.
[Dataset curation section] The paper notes that 31% of BindingDB viral entries required polyprotein splitting before use, but provides no validation, controls, or discussion confirming that the resulting fragment labels still reflect true experimental binding affinities rather than sequence artifacts (abstract and dataset section).
[Results and benchmarking section] No error bars, confidence intervals, or statistical tests are reported for the correlation values or tool rankings across the 16 viral targets, weakening the strength of the performance claims and variance observations.

minor comments (1)

[Abstract] The abstract and text could more clearly distinguish between the full curated set used for fine-tuning and any held-out validation during fine-tuning itself.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for their detailed and constructive review. The comments identify important areas for clarification and strengthening of the manuscript. We address each major comment below and will revise the manuscript accordingly to improve transparency and rigor.

read point-by-point responses

Referee: The description of the custom dataset curation and fine-tuning (abstract and methods) lacks any explicit statement of the train-test split protocol, including compound ID or SMILES deduplication steps to ensure the 853-compound test set is strictly disjoint from the 43,005 training measurements. This detail is load-bearing for the central claim that fine-tuning boosts Pearson r from 0.5 to 0.7.

Authors: We agree that an explicit description of the split protocol is essential for validating the fine-tuning results. In the revised manuscript, we will add a dedicated subsection in Methods detailing the train-test split. The 853-compound test set was constructed by first identifying all antiviral compounds from the curated sources, then removing any entries sharing identical compound IDs or canonical SMILES strings with the remaining 43,005 training measurements. We will include the exact deduplication procedure, the number of compounds removed during this step, and a summary table showing overlap statistics before and after filtering. revision: yes
Referee: The paper notes that 31% of BindingDB viral entries required polyprotein splitting before use, but provides no validation, controls, or discussion confirming that the resulting fragment labels still reflect true experimental binding affinities rather than sequence artifacts (abstract and dataset section).

Authors: We acknowledge the need for greater transparency on this curation step. In the revised dataset section, we will expand the description of the polyprotein splitting procedure, including the criteria used to identify cleavage sites and the rationale that binding measurements in BindingDB are typically reported against specific domains or fragments. We will add a limitations paragraph discussing the possibility of sequence artifacts and note that, where possible, we cross-checked a subset of split entries against literature-reported affinities for the isolated domains. Full validation against orthogonal experimental data is beyond the scope of the current work but will be flagged as an area for future improvement. revision: partial
Referee: No error bars, confidence intervals, or statistical tests are reported for the correlation values or tool rankings across the 16 viral targets, weakening the strength of the performance claims and variance observations.

Authors: We agree that quantitative uncertainty estimates and statistical comparisons would strengthen the benchmarking results. In the revised Results section, we will recompute all Pearson r values with bootstrap-derived 95% confidence intervals (1,000 resamples per target) and report them alongside the point estimates. We will also add pairwise statistical tests (e.g., Steiger’s test for dependent correlations or Wilcoxon signed-rank tests on per-target performance) to evaluate whether observed differences between tools are significant, with p-values corrected for multiple comparisons. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical benchmarking on external held-out data

full rationale

The paper performs direct empirical benchmarking of existing tools (Boltz-2, GNINA, DrugFormDTA etc.) against BindingDB-derived measurements and a stated custom test set of 853 compounds. Fine-tuning DrugFormDTA is an explicit training step on the 43k set followed by separate evaluation; no equations, predictions, or uniqueness claims reduce by construction to author-defined quantities or self-citations. The central results are falsifiable correlations on external data, not self-referential derivations.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claims rest on the reliability of public binding databases and the correctness of the authors' polyprotein-splitting procedure. No free parameters are introduced beyond standard ML training; no new physical entities are postulated.

axioms (1)

domain assumption Binding measurements in BindingDB and other sources accurately reflect experimental affinities after polyprotein splitting
This underpins the entire curated dataset of 43,005 measurements and the subsequent fine-tuning and benchmarking.

pith-pipeline@v0.9.0 · 5655 in / 1419 out tokens · 47098 ms · 2026-05-08T17:00:40.496358+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

199 extracted references · 188 canonical work pages

[1]

Journal of The Royal Society Interface19(190) (2022) https://doi.org/10.1098/rsif.2022.0275

Barbosa Libotte, G., Anjos, L., Almeida, R., Mara Cardoso Malta, S., Andrade Medronho, R.: Impacts of a delayed and slow-paced vaccination on cases and deaths during the COVID- 19 pandemic: a modelling study. Journal of The Royal Society Interface19(190) (2022) https://doi.org/10.1098/rsif.2022.0275

work page doi:10.1098/rsif.2022.0275 2022
[2]

Theoretical Biology and Medical Modelling18(1) (2021) https://doi.org/10.1186/s12976-021-00143-0

Amaku, M., Covas, D.T., Coutinho, F.A.B., Azevedo, R.S., Massad, E.: Modelling the impact of delaying vaccination against SARS-CoV-2 assuming unlimited vaccine supply. Theoretical Biology and Medical Modelling18(1) (2021) https://doi.org/10.1186/s12976-021-00143-0

work page doi:10.1186/s12976-021-00143-0 2021
[3]

Chaos: An Interdisciplinary Journal of Nonlinear Science 31(4) (2021) https://doi.org/10.1063/5.0050887

Faranda, D., Alberti, T., Arutkin, M., Lembo, V., Lucarini, V.: Interrupting vaccination poli- cies can greatly spread SARS-CoV-2 and enhance mortality from COVID-19 disease: The AstraZeneca case for france and italy. Chaos: An Interdisciplinary Journal of Nonlinear Science 31(4) (2021) https://doi.org/10.1063/5.0050887

work page doi:10.1063/5.0050887 2021
[4]

Tabarrok,M.:HowManyPeopleAreintheInvisibleGraveyard?Estimatedthata4-monthear- lier emergency authorization of Pfizer’s COVID-19 vaccine would have saved between 130,000 and 350,000 lives over the next two years of the pandemic. (2022). https://www.maximum-p rogress.com/p/how-many-people-are-in-the-invisible-graveyard

2022
[5]

Corbett, K.S., Edwards, D., Leist, S.R., Abiona, O.M., Boyoglu-Barnum, S., Gillespie, R.A., Himansu, S., Schäfer, A., Ziwawo, C.T., DiPiazza, A.T., Dinnon, K.H., Elbashir, S.M., Shaw, 43 C.A., Woods, A., Fritch, E.J., Martinez, D.R., Bock, K.W., Minai, M., Nagata, B.M., Hutchin- son, G.B., Bahl, K., Garcia-Dominguez, D., Ma, L., Renzi, I., Kong, W.-P., Sc...

work page doi:10.1101/2020.06.11.145920 2020
[6]

Antiviral Research 232, 106024 (2024) https://doi.org/10.1016/j.antiviral.2024.106024

Chokwassanasakulkit, T., Oti, V.B., Idris, A., McMillan, N.A.: Sirnas as antiviral drugs – current status, therapeutic potential and challenges. Antiviral Research 232, 106024 (2024) https://doi.org/10.1016/j.antiviral.2024.106024

work page doi:10.1016/j.antiviral.2024.106024 2024
[7]

Zhu, C., Lee, J.Y., Woo, J.Z., Xu, L., Wrynla, X.H., Yamashiro, L.H., Ji, F., Biering, S.B., VanDis,E.,Gonzalez,F.,et al.:AnintranasalASOtherapeutictargetingSARS-CoV-2.Nature communications 13(1), 4503 (2022)

2022
[8]

Naughton, B.: How to design antibodies (2026) https://doi.org/10.62211/58wh-12qp

work page doi:10.62211/58wh-12qp 2026
[9]

Pacesa, M., Nickel, L., Schellhaas, C., Schmidt, J., Pyatova, E., Kissling, L., Barendse, P., Choudhury, J., Kapoor, S., Alcaraz-Serna, A., Cho, Y., Ghamary, K.H., Vinué, L., Yachnin, B.J., Wollacott, A.M., Buckley, S., Westphal, A.H., Lindhoud, S., Georgeon, S., Goverde, C.A., Hatzopoulos, G.N., Gönczy, P., Muller, Y.D., Schwank, G., Swarts, D.C., Vecchi...

work page doi:10.1038/s41586-025-09429-6 2025
[10]

URLhttps://www.biorxiv.org/content/ early/2025/11/24/2025.11.20.689494

Stark, H., Faltings, F., Choi, M., Xie, Y., Hur, E., O’Donnell, T., Bushuiev, A., Uçar, T., Passaro, S., Mao, W., Reveiz, M., Bushuiev, R., Pluskal, T., Sivic, J., Kreis, K., Vahdat, A., Ray, S., Goldstein, J.T., Savinov, A., Hambalek, J.A., Gupta, A., Taquiri-Diaz, D.A., Zhang, Y., Hatstat, A.K., Arada, A., Kim, N.H., Tackie-Yarboi, E., Boselli, D., Schn...

work page doi:10.1101/2025.11.20.689494 2025
[11]

Nature Reviews Microbiology 21(2), 112–124 (2022) https://doi.org/10.1038/s41579-022-00809-7

Cox, M., Peacock, T.P., Harvey, W.T., Hughes, J., Wright, D.W., Willett, B.J., Thomson, E., Gupta, R.K., Peacock, S.J., Robertson, D.L., Carabelli, A.M.: SARS-CoV-2 variant evasion of monoclonal antibodies based on in vitro studies. Nature Reviews Microbiology 21(2), 112–124 (2022) https://doi.org/10.1038/s41579-022-00809-7

work page doi:10.1038/s41579-022-00809-7 2022
[12]

Nature Reviews Drug Discovery 3(8), 673–683 (2004) https://doi.org/10.1038/nrd1468

Ashburn,T.T.,Thor,K.B.:Drugrepositioning:identifyinganddevelopingnewusesforexisting drugs. Nature Reviews Drug Discovery 3(8), 673–683 (2004) https://doi.org/10.1038/nrd1468

work page doi:10.1038/nrd1468 2004
[13]

BMJ, 1038 (2021) https://doi.org/10.1136/bmj.n1038

Prats-Uribe, A., Sena, A.G., Lai, L.Y.H., Ahmed, W.-U.-R., Alghoul, H., Alser, O., Alsham- mari, T.M., Areia, C., Carter, W., Casajust, P., Dawoud, D., Golozar, A., Jonnagaddala, J., Mehta, P.P., Gong, M., Morales, D.R., Nyberg, F., Posada, J.D., Recalde, M., Roel, E., Shah, K., Shah, N.H., Schilling, L.M., Subbian, V., Vizcaya, D., Zhang, L., Zhang, Y., ...

work page doi:10.1136/bmj.n1038 2021
[14]

Cell Reports 35(1), 108959 (2021) https://doi.org/10.1016/j.celrep.2021.108959

Dittmar, M., Lee, J.S., Whig, K., Segrist, E., Li, M., Kamalia, B., Castellana, L., Ayyanathan, K., Cardenas-Diaz, F.L., Morrisey, E.E., Truitt, R., Yang, W., Jurado, K., Samby, K., Ramage, H., Schultz, D.C., Cherry, S.: Drug repurposing screens reveal cell-type-specific entry path- ways and FDA-approved drugs active against SARS-CoV-2. Cell Reports 35(1)...

work page doi:10.1016/j.celrep.2021.108959 2021
[15]

Journal of Medicinal Chemistry 67(12), 10263–10274 (2024) https://doi.org/10.1021/acs.jmedchem.4c00597

Glenn, I.S., Hall, L.N., Khalid, M.M., Ott, M., Shoichet, B.K.: Colloidal aggregation confounds 44 cell-based Covid-19 antiviral screens. Journal of Medicinal Chemistry 67(12), 10263–10274 (2024) https://doi.org/10.1021/acs.jmedchem.4c00597

work page doi:10.1021/acs.jmedchem.4c00597 2024
[16]

Science 373(6554), 541–547 (2021) https://doi.org/10.1126/science.abi4708

Tummino, T.A., Rezelj, V.V., Fischer, B., Fischer, A., O’Meara, M.J., Monel, B., Vallet, T., White,K.M.,Zhang,Z.,Alon,A.,Schadt,H.,O’Donnell,H.R.,Lyu,J.,Rosales,R.,McGovern, B.L., Rathnasinghe, R., Jangra, S., Schotsaert, M., Galarneau, J.-R., Krogan, N.J., Urban, L., Shokat, K.M., Kruse, A.C., García-Sastre, A., Schwartz, O., Moretti, F., Vignuzzi, M., P...

work page doi:10.1126/science.abi4708 2021
[17]

Frontiers in Pharmacology12(2021) https://doi.org/10.3389/fphar.2021.660710

Li, X., Peng, T.: Strategy, progress, and challenges of drug repurposing for efficient antiviral discovery. Frontiers in Pharmacology12(2021) https://doi.org/10.3389/fphar.2021.660710

work page doi:10.3389/fphar.2021.660710 2021
[18]

Kim, Y.-S., Jeon, S.-H., Kim, J., Koh, J.H., Ra, S.W., Kim, J.W., Kim, Y., Kim, C.K., Shin, Y.C., Kang, B.D., Kang, S.j., Park, C.H., Lee, B., Lee, J.Y., Lee, C.H., Choi, J.-p., Kim, J.Y., Yu, S.N., Peck, K.R., Kim, S.-H., Heo, J.Y., Kim, H.a., Park, H.-j., Choi, J., Han, J., Kim, J., Kim, H.j., Han, S.H., Yoon, A., Park, M., Park, S., Kim, Y., Jung, M., ...
[19]

Antimicrobial Agents and Chemotherapy67(1) (2023) https://doi.org/10.1128/aac.00452- 22

work page doi:10.1128/aac.00452- 2023
[20]

Virus Research 264, 22–31 (2019) https://doi.org/10.1016/j.virusres.2019.02.011

García-Serradilla, M., Risco, C., Pacheco, B.: Drug repurposing for new, efficient, broad spectrum antivirals. Virus Research 264, 22–31 (2019) https://doi.org/10.1016/j.virusres.2019.02.011

work page doi:10.1016/j.virusres.2019.02.011 2019
[21]

Continuous Radon Monitoring during Seven Years of V olcanic Unrest at Campi Flegrei Caldera (Italy)

Zidan, A.A., Gad, A.Y.S., Zakaria, N.H., El-Hariri, H.M., Elsharnouby, N.M., Helmy, M.W., El-Setouhy, M.: Effectiveness and safety of cyclosporine a in moderate to severe COVID-19: a randomized, open-label trial. Scientific Reports16(1) (2026) https://doi.org/10.1038/s41598- 026-35292-0

work page doi:10.1038/s41598- 2026
[22]

International Journal of Molecular Sciences 26(16), 7900 (2025) https://doi.org/10.3390/ijms26167900

Elhabyan, A., Khan, M.U.S., Elhabyan, A., Abukhatwa, R., Uzair, H., Jimenez, C., Elhabyan, A., Chan, Y.L., Shabana, B.: Broad-spectrum antiviral activity of cyclophilin inhibitors against coronaviruses: A systematic review. International Journal of Molecular Sciences 26(16), 7900 (2025) https://doi.org/10.3390/ijms26167900

work page doi:10.3390/ijms26167900 2025
[23]

Antiviral Research 110, 94–103 (2014) https://doi.org/10.1016/j.antiviral.2014.07.014

Rossignol, J.-F.: Nitazoxanide: A first-in-class broad-spectrum antiviral agent. Antiviral Research 110, 94–103 (2014) https://doi.org/10.1016/j.antiviral.2014.07.014

work page doi:10.1016/j.antiviral.2014.07.014 2014
[24]

iScience 19, 1279–1290 (2019) https://doi.org/10.1016/j.isci.2019.07.003

Jasenosky, L.D., Cadena, C., Mire, C.E., Borisevich, V., Haridas, V., Ranjbar, S., Nambu, A., Bavari, S., Soloveva, V., Sadukhan, S., Cassell, G.H., Geisbert, T.W., Hur, S., Goldfeld, A.E.: The FDA-approved oral drug nitazoxanide amplifies host antiviral responses and inhibits Ebola virus. iScience 19, 1279–1290 (2019) https://doi.org/10.1016/j.isci.2019.07.003

work page doi:10.1016/j.isci.2019.07.003 2019
[25]

The Lancet Infectious Diseases 14(7), 609–618 (2014) https://doi.org/10.1016/s1473-3099(14)70717-0

Haffizulla, J., Hartman, A., Hoppers, M., Resnick, H., Samudrala, S., Ginocchio, C., Bardin, M., Rossignol, J.-F.: Effect of nitazoxanide in adults and adolescents with acute uncompli- cated influenza: a double-blind, randomised, placebo-controlled, phase 2b/3 trial. The Lancet Infectious Diseases 14(7), 609–618 (2014) https://doi.org/10.1016/s1473-3099(1...

work page doi:10.1016/s1473-3099(14)70717-0 2014
[26]

The Lancet 368(9530), 124–129 (2006) https://doi.org/10.1016/s0140-6736(06)68852-1

Rossignol, J.-F., Abu-Zekry, M., Hussein, A., Santoro, M.G.: Effect of nitazoxanide for treat- ment of severe rotavirus diarrhoea: randomised double-blind placebo-controlled trial. The Lancet 368(9530), 124–129 (2006) https://doi.org/10.1016/s0140-6736(06)68852-1

work page doi:10.1016/s0140-6736(06)68852-1 2006
[27]

Rossignol, J.-F., Bardin, M.C., Fulgencio, J., Mogelnicki, D., Bréchot, C.: A randomized double-blind placebo-controlled clinical trial of nitazoxanide for treatment of mild or moderate COVID-19.eClinicalMedicine45,101310(2022)https://doi.org/10.1016/j.eclinm.2022.101310

work page doi:10.1016/j.eclinm.2022.101310 2022
[28]

Proceedings of the National Academy of Sciences121(18) (2024) https://doi.org/10.1073/pnas.2319566121

Mao, T., Kim, J., Peña-Hernández, M.A., Valle, G., Moriyama, M., Luyten, S., Ott, I.M., Gomez-Calvo, M.L., Gehlhausen, J.R., Baker, E., Israelow, B., Slade, M., Sharma, L., Liu, W., Ryu, C., Korde, A., Lee, C.J., Silva Monteiro, V., Lucas, C., Dong, H., Yang, Y., 45 Gopinath, S., Wilen, C.B., Palm, N., Dela Cruz, C.S., Iwasaki, A., Vogels, C.B.F., Hahn, A...

work page doi:10.1073/pnas.2319566121 2024
[29]

Accessed: 2026-04-03 (2024)

Elton, D.C.: Who should be our next FDA Commissioner? More is Different (Substack). Accessed: 2026-04-03 (2024). https://moreisdifferent.blog/p/who-should-be-our-next-fda-c ommissioner

2026
[30]

Asimov Press (2024) https://doi.org/10.62211/72pr-26gf

Wang, B.: Day zero antivirals for future pandemics. Asimov Press (2024) https://doi.org/10.62211/72pr-26gf . Accessed 2026-04-03

work page doi:10.62211/72pr-26gf 2024
[31]

Science Advances11(35) (2025) https://doi.org/10.1126/sciadv.ady3554

Ezzatpour, S., Thakur, K., Erzoah Ndede, K., Buchholz, D.W., Choi, A., Imbiakha, B., Carter, J., Onofrei, D., Eaton, B., Postnikova, E., Murphy, M., Tapia, B.C., Bello, D., Pasari, S., Russo, A., Babayev, M., Holland, G.P., Holbrook, M.R., Caddy, S.L., Moran, S.J., Davachi, S.M., Monreal, I.A., Sahler, J., Ortega, V., Miranda, J.M., Whittaker, G.R., Jager...

work page doi:10.1126/sciadv.ady3554 2025
[33]

ACS Medicinal Chemistry Letters 11(12), 2526–2533 (2020) https://doi.org/10.1021/acsmedchemlett.0c00521

Ghahremanpour, M.M., Tirado-Rives, J., Deshmukh, M., Ippolito, J.A., Zhang, C.-H., Vaca, I., Liosi, M.-E., Anderson, K.S., Jorgensen, W.L.: Identification of 14 known drugs as inhibitors of the main protease of SARS-CoV-2. ACS Medicinal Chemistry Letters 11(12), 2526–2533 (2020) https://doi.org/10.1021/acsmedchemlett.0c00521

work page doi:10.1021/acsmedchemlett.0c00521 2020
[34]

Precision Clinical Medicine 4(1), 1–16 (2021) https://doi.org/10.1093/pcmedi/pbab001

Hosseini, M., Chen, W., Xiao, D., Wang, C.: Computational molecular docking and virtual screening revealed promising SARS-CoV-2 drugs. Precision Clinical Medicine 4(1), 1–16 (2021) https://doi.org/10.1093/pcmedi/pbab001

work page doi:10.1093/pcmedi/pbab001 2021
[35]

Medical Hypotheses 173, 111047 (2023) https://doi.org/10.1016/j.mehy.2023.111047

Buck, C.B.: The mint versus Covid hypothesis. Medical Hypotheses 173, 111047 (2023) https://doi.org/10.1016/j.mehy.2023.111047

work page doi:10.1016/j.mehy.2023.111047 2023
[36]

Journal of Research in Pharmacy Practice 12(4), 141–147 (2023)

Alikiaie, B., Shalamzari, S.M.H., Soltani, R., Yegdaneh, A., Mousavi, S.: Efficacy of licorice as adjunctive therapy in critically ill patients with COVID-19: A randomized, placebo-controlled, double-blind clinical trial. Journal of Research in Pharmacy Practice 12(4), 141–147 (2023)

2023
[37]

Antiviral Research 234, 106079 (2025) https://doi.org/10.1016/j.antiviral.2025.106079

Kainov, D.E., Ravlo, E., Ianevski, A.: Seeking innovative concepts in devel- opment of antiviral drug combinations. Antiviral Research 234, 106079 (2025) https://doi.org/10.1016/j.antiviral.2025.106079

work page doi:10.1016/j.antiviral.2025.106079 2025
[38]

arXiv (2025)

Huang, Y., Su, X., Ullanat, V., Moon, I., Liang, I., Clegg, L., Olabode, D., John- son, R., Ho, N., Gibbs, M., Gibbs, M., Gusev, A., John, B., Zitnik, M.: Multimodal AI predicts clinical outcomes of drug combinations from preclinical data. arXiv (2025). https://doi.org/10.48550/ARXIV.2503.02781

work page doi:10.48550/arxiv.2503.02781 2025
[39]

Molecular Therapy 29(2), 873–885 (2021) https://doi.org/10.1016/j.ymthe.2020.12.016

Bobrowski, T., Chen, L., Eastman, R.T., Itkin, Z., Shinn, P., Chen, C.Z., Guo, H., Zheng, W., Michael, S., Simeonov, A., Hall, M.D., Zakharov, A.V., Muratov, E.N.: Synergistic and antag- onistic drug combinations against SARS-CoV-2. Molecular Therapy 29(2), 873–885 (2021) https://doi.org/10.1016/j.ymthe.2020.12.016

work page doi:10.1016/j.ymthe.2020.12.016 2021
[40]

Viruses 15(7), 1577 (2023) https://doi.org/10.3390/v15071577

Gidari, A., Sabbatini, S., Schiaroli, E., Bastianelli, S., Pierucci, S., Busti, C., Saraca, L.M., Capogrossi, L., Pasticci, M.B., Francisci, D.: Synergistic activity of Remdesivir–Nirmatrelvir 46 combination on a SARS-CoV-2 in vitro model and a case report. Viruses 15(7), 1577 (2023) https://doi.org/10.3390/v15071577

work page doi:10.3390/v15071577 2023
[41]

The Lancet Infectious Diseases 24(11), 1213–1224 (2024) https://doi.org/10.1016/s1473-3099(24)00353-0

Choi, M.H., Wan, E.Y.F., Wong, I.C.K., Chan, E.W.Y., Chu, W.M., Tam, A.R., Yuen, K.Y., Hung, I.F.N.: Comparative effectiveness of combination therapy with Nirmatrelvir–Ritonavir and Remdesivir versus monotherapy with Remdesivir or Nirmatrelvir–Ritonavir in patients hospitalised with COVID-19: a target trial emulation study. The Lancet Infectious Diseases ...

work page doi:10.1016/s1473-3099(24)00353-0 2024
[42]

Drug Discovery Today 26(10), 2367–2376 (2021) https://doi.org/10.1016/j.drudis.2021.05.008

Shyr, Z.A., Cheng, Y.-S., Lo, D.C., Zheng, W.: Drug combination therapy for emerging viral diseases. Drug Discovery Today 26(10), 2367–2376 (2021) https://doi.org/10.1016/j.drudis.2021.05.008

work page doi:10.1016/j.drudis.2021.05.008 2021
[43]

arXiv preprint arXiv:2504.06196 (2025)

Wang, E., Schmidgall, S., Jaeger, P.F., Zhang, F., Pilgrim, R., Matias, Y., Barral, J., Fleet, D., Azizi, S.: TxGemma: Efficient and Agentic LLMs for Therapeutics. arXiv (2025). https://doi.org/10.48550/ARXIV.2504.06196

work page doi:10.48550/arxiv.2504.06196 2025
[44]

Cell Host & Microbe 31(6), 856–860 (2023) https://doi.org/10.1016/j.chom.2023.05.012

Jochmans, D., Laporte, M., Neyts, J.: Antiviral strategies for epidemic and pandemic prepared- ness. Cell Host & Microbe 31(6), 856–860 (2023) https://doi.org/10.1016/j.chom.2023.05.012

work page doi:10.1016/j.chom.2023.05.012 2023
[45]

Molecular Systems Design & Engineering 4(4), 828–849 (2019) https://doi.org/10.1039/c9me00039a

Elton, D.C., Boukouvalas, Z., Fuge, M.D., Chung, P.W.: Deep learning for molecular design—a review of the state of the art. Molecular Systems Design & Engineering 4(4), 828–849 (2019) https://doi.org/10.1039/c9me00039a

work page doi:10.1039/c9me00039a 2019
[47]

Journal of Computer-Aided Molecular Design 22(3–4), 147–159 (2007) https://doi.org/10.1007/s10822-007-9150-y

Cleves, A.E., Jain, A.N.: Effects of inductive bias on computational evaluations of ligand- based modeling and on drug discovery. Journal of Computer-Aided Molecular Design 22(3–4), 147–159 (2007) https://doi.org/10.1007/s10822-007-9150-y

work page doi:10.1007/s10822-007-9150-y 2007
[48]

Bioinformatics 35(24), 5191–5198 (2019) https://doi.org/10.1093/bioinformatics/btz418

Zeng, X., Zhu, S., Liu, X., Zhou, Y., Nussinov, R., Cheng, F.: deepdr: a network-based deep learning approach to in-silico drug repositioning. Bioinformatics 35(24), 5191–5198 (2019) https://doi.org/10.1093/bioinformatics/btz418

work page doi:10.1093/bioinformatics/btz418 2019
[49]

Journal of Proteome Research 19(11), 4624–4636 (2020) https://doi.org/10.1021/acs.jproteome.0c00316

Zeng, X., Song, X., Ma, T., Pan, X., Zhou, Y., Hou, Y., Zhang, Z., Li, K., Karypis, G., Cheng, F.: Repurpose open data to discover therapeutics for COVID-19 using deep learning. Journal of Proteome Research 19(11), 4624–4636 (2020) https://doi.org/10.1021/acs.jproteome.0c00316

work page doi:10.1021/acs.jproteome.0c00316 2020
[50]

Cell Discovery6(1) (2020) https://doi.org/10.1038/s41421-020-0153-3

Zhou, Y., Hou, Y., Shen, J., Huang, Y., Martin, W., Cheng, F.: Network-based drug repurposing for novel coronavirus 2019-nCoV/SARS-CoV-2. Cell Discovery6(1) (2020) https://doi.org/10.1038/s41421-020-0153-3

work page doi:10.1038/s41421-020-0153-3 2019
[51]

Heliyon 9(3), 14059 (2023) https://doi.org/10.1016/j.heliyon.2023.e14059

Wang, X., Wang, H., Yin, G., Zhang, Y.D.: Network-based drug repurposing for the treatment of COVID-19 patients in different clinical stages. Heliyon 9(3), 14059 (2023) https://doi.org/10.1016/j.heliyon.2023.e14059

work page doi:10.1016/j.heliyon.2023.e14059 2023
[52]

Nadkarni, Benjamin S

Huang, K., Chandak, P., Wang, Q., Havaldar, S., Vaid, A., Leskovec, J., Nadkarni, G.N., Glicksberg, B.S., Gehlenborg, N., Zitnik, M.: A foundation model for clinician-centered drug repurposing. Nature Medicine 30(12), 3601–3613 (2024) https://doi.org/10.1038/s41591-024- 03233-x

work page doi:10.1038/s41591-024- 2024
[53]

Proceedings of the National Academy of Sciences118(19) (2021) https://doi.org/10.1073/pnas.2025581118

Morselli Gysi, D., Valle, I., Zitnik, M., Ameli, A., Gan, X., Varol, O., Ghiassian, S.D., Patten, J.J., Davey, R.A., Loscalzo, J., Barabási, A.-L.: Network medicine framework for identify- ing drug-repurposing opportunities for COVID-19. Proceedings of the National Academy of Sciences118(19) (2021) https://doi.org/10.1073/pnas.2025581118

work page doi:10.1073/pnas.2025581118 2021
[54]

Cochrane Database of Systematic Reviews (5) (2024) 47 https://doi.org/10.1002/14651858.CD015219.pub2

Korula, P., Alexander, H., John, J.S., Kirubakaran, R., Singh, B., Tharyan, P., Rupali, P.: Favipiravir for treating COVID-19. Cochrane Database of Systematic Reviews (5) (2024) 47 https://doi.org/10.1002/14651858.CD015219.pub2

work page doi:10.1002/14651858.cd015219.pub2 2024
[55]

Journal of Chemical Information and Modeling 61(1), 535–546 (2020) https://doi.org/10.1021/acs.jcim.0c01354

Guterres, H., Park, S.-J., Jiang, W., Im, W.: Ligand-binding-site refinement to generate reliable holo protein structure conformations from apo structures. Journal of Chemical Information and Modeling 61(1), 535–546 (2020) https://doi.org/10.1021/acs.jcim.0c01354

work page doi:10.1021/acs.jcim.0c01354 2020
[56]

Computational and Structural Biotechnology Journal 27, 4106–4120 (2025) https://doi.org/10.1016/j.csbj.2025.09.023

Khokhlov, I., Tashchilova, A., Bugaev-Makarovskiy, N., Glushkova, O., Yudin, V., Keskinov, A., Yudin, S., Svetlichnyy, D., Skvortsova, V.: DrugForm-DTA: Towards real-world drug-target binding affinity model. Computational and Structural Biotechnology Journal 27, 4106–4120 (2025) https://doi.org/10.1016/j.csbj.2025.09.023

work page doi:10.1016/j.csbj.2025.09.023 2025
[57]

Scientific Data10(1) (2023) https://doi.org/10.1038/s41597-023-01984-9

Rogers, D.M., Agarwal, R., Vermaas, J.V., Smith, M.D., Rajeshwar, R.T., Cooper, C., Sedova, A., Boehm, S., Baker, M., Glaser, J., Smith, J.C.: SARS-CoV-2 billion-compound docking. Scientific Data10(1) (2023) https://doi.org/10.1038/s41597-023-01984-9

work page doi:10.1038/s41597-023-01984-9 2023
[58]

Bioinformatics 34(17), 821–829 (2018) https://doi.org/10.1093/bioinformatics/bty593

Öztürk, H., Özgür, A., Ozkirimli, E.: DeepDTA: deep drug–target binding affinity prediction. Bioinformatics 34(17), 821–829 (2018) https://doi.org/10.1093/bioinformatics/bty593

work page doi:10.1093/bioinformatics/bty593 2018
[59]

Journal of Chemical Information and Modeling 54(3), 735–743 (2014) https://doi.org/10.1021/ci400709d

Tang, J., Szwajda, A., Shakyawar, S., Xu, T., Hintsanen, P., Wennerberg, K., Aittokallio, T.: Making sense of large-scale kinase inhibitor bioactivity data sets: A comparative and integrative analysis. Journal of Chemical Information and Modeling 54(3), 735–743 (2014) https://doi.org/10.1021/ci400709d

work page doi:10.1021/ci400709d 2014
[60]

and Riniker, Sereina , year = 2024, journal =

Landrum, G.A., Riniker, S.: Combining IC50 or Ki values from different sources is a source of significant noise. Journal of Chemical Information and Modeling 64(5), 1560–1567 (2024) https://doi.org/10.1021/acs.jcim.4c00049

work page doi:10.1021/acs.jcim.4c00049 2024
[61]

Antiviral Research 217, 105620 (2023) https://doi.org/10.1016/j.antiviral.2023.105620

Martin, H.-J., Melo-Filho, C.C., Korn, D., Eastman, R.T., Rai, G., Simeonov, A., Zakharov, A.V., Muratov, E., Tropsha, A.: Small molecule antiviral compound col- lection (SMACC): A comprehensive, highly curated database to support the discov- ery of broad-spectrum antiviral drug molecules. Antiviral Research 217, 105620 (2023) https://doi.org/10.1016/j.an...

work page doi:10.1016/j.antiviral.2023.105620 2023
[62]

Liu, T., Lin, Y., Wen, X., Jorissen, R.N., Gilson, M.K.: Bindingdb: a web-accessible databaseofexperimentallydeterminedprotein-ligandbindingaffinities.NucleicAcidsResearch 35(Database), 198–201 (2007) https://doi.org/10.1093/nar/gkl999

work page doi:10.1093/nar/gkl999 2007
[63]

Nucleic Acids Research 44(D1), 1045–1053 (2015) https://doi.org/10.1093/nar/gkv1072

Gilson, M.K., Liu, T., Baitaluk, M., Nicola, G., Hwang, L., Chong, J.: BindingDB in 2015: A public database for medicinal chemistry, computational chemistry and systems pharmacology. Nucleic Acids Research 44(D1), 1045–1053 (2015) https://doi.org/10.1093/nar/gkv1072

work page doi:10.1093/nar/gkv1072 2015
[64]

Journal of Medicinal Chemistry 44(6), 2114–2125 (2004) https://doi.org/10.1021/ci034244e

Wang, R., Fang, X., Lu, Y., Wang, S.: The PDBbind database: Collection of binding affinities for protein-ligand complexes with known three-dimensional structures. Journal of Medicinal Chemistry 44(6), 2114–2125 (2004) https://doi.org/10.1021/ci034244e

work page doi:10.1021/ci034244e 2004
[65]

Journal of Medicinal Chemistry 48(12), 4111–4119 (2005) https://doi.org/10.1021/jm048957q

Wang, R., Fang, X., Lu, Y., Yang, C.-Y., Wang, S.: The PDBbind database: Methodologies and updates. Journal of Medicinal Chemistry 48(12), 4111–4119 (2005) https://doi.org/10.1021/jm048957q

work page doi:10.1021/jm048957q 2005
[66]

Ethan Perez, Florian Strub, Harm de Vries, Vincent Dumoulin, and Aaron Courville

Mysinger, M.M., Carchia, M., Irwin, J.J., Shoichet, B.K.: Directory of useful decoys, enhanced (dud-e): Better ligands and decoys for better benchmarking. Journal of Medicinal Chemistry 55(14), 6582–6594 (2012) https://doi.org/10.1021/jm300687e

work page doi:10.1021/jm300687e 2012
[67]

Tran-Nguyen, V.-K., Jacquemard, C., Rognan, D.: Lit-pcba: An unbiased data set for machine learningandvirtualscreening.JournalofChemicalInformationandModeling60(9),4263–4273 (2020) https://doi.org/10.1021/acs.jcim.0c00155

work page doi:10.1021/acs.jcim.0c00155 2020
[68]

Data leakage and redundancy in the LIT-PCBA benchmark

Huang, A., Knight, I.S., Naprienko, S.: Data Leakage and Redundancy in the LIT-PCBA Benchmark. arXiv (2025). https://doi.org/10.48550/ARXIV.2507.21404 48

work page doi:10.48550/arxiv.2507.21404 2025
[69]

Scientific Data9(1) (2022) https://doi.org/10.1038/s41597-022-01631-9

Korlepara, D.B., Vasavi, C.S., Jeurkar, S., Pal, P.K., Roy, S., Mehta, S., Sharma, S., Kumar, V., Muvva, C., Sridharan, B., Garg, A., Modee, R., Bhati, A.P., Nayar, D., Priyakumar, U.D.: PLAS-5k: Dataset of protein-ligand affinities from molecular dynamics for machine learning applications. Scientific Data9(1) (2022) https://doi.org/10.1038/s41597-022-01631-9

work page doi:10.1038/s41597-022-01631-9 2022
[70]

In: ICML’24 Workshop ML for Life and Material Science: From Theory to Industry Applications, Vienna, Austria (2024)

Durairaj, J., Adeshina, Y., Cao, Z., Zhang, X., Oleinikovas, V., Duignan, T., McClure, Z., Robin, X., Studer, G., Kovtun, D., Rossi, E., Zhou, G., Veccham, S., Isert, C., Peng, Y., Sundareson, P., Akdel, M., Corso, G., Stärk, H., Tauriello, G., Carpenter, Z., Bronstein, M., Kucukbenli, E., Schwede, T., Naef, L.: PLINDER: The protein-ligand interactions da...

2024
[71]

Zhang, X., Zhang, O., Shen, C., Qu, W., Chen, S., Cao, H., Kang, Y., Wang, Z., Wang, E., Zhang, J., Deng, Y., Liu, F., Wang, T., Du, H., Wang, L., Pan, P., Chen, G., Hsieh, C.-Y., Hou, T.:EfficientandaccuratelargelibraryliganddockingwithKarmaDock.NatureComputational Science 3(9), 789–804 (2023) https://doi.org/10.1038/s43588-023-00511-5

work page doi:10.1038/s43588-023-00511-5 2023
[72]

Journal of Chemical Information and Modeling 59(2), 895–913 (2018) https://doi.org/10.1021/acs.jcim.8b00545

Su, M., Yang, Q., Du, Y., Feng, G., Liu, Z., Li, Y., Wang, R.: Comparative assessment of scoring functions: The casf-2016 update. Journal of Chemical Information and Modeling 59(2), 895–913 (2018) https://doi.org/10.1021/acs.jcim.8b00545

work page doi:10.1021/acs.jcim.8b00545 2016
[73]

Nature Machine Intelligence 7(10), 1713– 1725 (2025) https://doi.org/10.1038/s42256-025-01124-5

Graber, D., Stockinger, P., Meyer, F., Mishra, S., Horn, C., Buller, R.: Resolving data bias improves generalization in binding affinity prediction. Nature Machine Intelligence 7(10), 1713– 1725 (2025) https://doi.org/10.1038/s42256-025-01124-5

work page doi:10.1038/s42256-025-01124-5 2025
[74]

Nature Biotechnology 29(11), 1046–1051 (2011) https://doi.org/10.1038/nbt.1990

Davis, M.I., Hunt, J.P., Herrgard, S., Ciceri, P., Wodicka, L.M., Pallares, G., Hocker, M., Treiber, D.K., Zarrinkar, P.P.: Comprehensive analysis of kinase inhibitor selectivity. Nature Biotechnology 29(11), 1046–1051 (2011) https://doi.org/10.1038/nbt.1990

work page doi:10.1038/nbt.1990 2011
[75]

Cellular and Molecular Life Sciences79(12) (2022) https://doi.org/10.1007/s00018-022-04635-1

Ianevski, A., Ahmad, S., Anunnitipat, K., Oksenych, V., Zusinaite, E., Tenson, T., Bjørås, M., Kainov, D.E.: Seven classes of antiviral agents. Cellular and Molecular Life Sciences79(12) (2022) https://doi.org/10.1007/s00018-022-04635-1

work page doi:10.1007/s00018-022-04635-1 2022
[76]

mBio (2025) https://doi.org/10.1128/mbio.02013-25

Huang, J., Song, Q., Zhang, P., Deng, L., Gao, F., Deng, Y., Krol, E., Růžek, D., Khouri, R., De Clercq, E., Li, G.: Antiviraldb: an expert-curated database of antiviral agents against human infectious diseases. mBio (2025) https://doi.org/10.1128/mbio.02013-25

work page doi:10.1128/mbio.02013-25 2025
[77]

Journal of Cheminformatics16(1) (2024) https://doi.org/10.1186/s13321-024-00864-9

Martin, E., Tropsha, A., Muratov, E.N.: Heli-SMACC: a large bioactivity dataset for helicase inhibitors from ChEMBL. Journal of Cheminformatics16(1) (2024) https://doi.org/10.1186/s13321-024-00864-9

work page doi:10.1186/s13321-024-00864-9 2024
[78]

2024 , pages =

Knox, C., Wilson, M., Klinger, C.M., Franklin, M., Ober, E., Doshi, A., Wishart, D.S.: Drug- bank 6.0: the drugbank knowledgebase for 2024. Nucleic Acids Research 52(D1), 1265–1275 (2024) https://doi.org/10.1093/nar/gkad976

work page doi:10.1093/nar/gkad976 2024
[79]

Nucleic Acids Research 52(D1), 1465–1477 (2024) https://doi.org/10.1093/nar/gkad751

Zhou, Y., Zhang, Y., Zhao, D., Yu, X., Shen, X., Zhou, Y., Wang, S., Qiu, Y., Chen, Y., Zhu, F.: TTD: Therapeutic Target Database describing target druggability information. Nucleic Acids Research 52(D1), 1465–1477 (2024) https://doi.org/10.1093/nar/gkad751

work page doi:10.1093/nar/gkad751 2024
[80]

PLOS ONE 19(9), 0309733 (2024) https://doi.org/10.1371/journal.pone.0309733

Majidifar, S., Zabihian, A., Hooshmand, M.: Combination therapy synergism prediction for virus treatment using machine learning models. PLOS ONE 19(9), 0309733 (2024) https://doi.org/10.1371/journal.pone.0309733

work page doi:10.1371/journal.pone.0309733 2024
[81]

Proceedings of the National Academy of Sciences118(39) (2021) https://doi.org/10.1073/pnas.2105070118

Jin, W., Stokes, J.M., Eastman, R.T., Itkin, Z., Zakharov, A.V., Collins, J.J., Jaakkola, T.S., Barzilay, R.: Deep learning identifies synergistic drug combinations for treating covid-19. Proceedings of the National Academy of Sciences118(39) (2021) https://doi.org/10.1073/pnas.2105070118

work page doi:10.1073/pnas.2105070118 2021
[82]

Klamt, A., Schüürmann, G.: COSMO: a new approach to dielectric screening in solvents with explicit expressions for the screening energy and its gradient. J. Chem. Soc., Perkin Trans. 2 49 (5), 799–805 (1993) https://doi.org/10.1039/p29930000799

work page doi:10.1039/p29930000799 1993

Showing first 80 references.