Hunting Structural Demons in Digital Reticular Chemistry: Lessons from Metal-Organic Frameworks
Pith reviewed 2026-05-14 22:46 UTC · model grok-4.3
The pith
More than half of the top-performing candidates in major computational screenings of metal-organic frameworks are chemically invalid due to structural errors.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central claim is that structural errors in both experimental and hypothetical MOF databases generate chemically invalid structures that dominate the top results of computational screening campaigns, and that these errors can be reduced by preserving raw diffraction data alongside synthesis information, applying consistent curation rules at database entry, and filtering topology choices prior to structure generation.
What carries the argument
Structural demons: erroneous crystal structure models that arise either from converting disordered experimental diffraction data into fully specified simulation inputs or from encoding chemically implausible features in hypothetical structures, which then propagate invalid high-ranking candidates through screening workflows.
If this is right
- Integrating raw diffraction data with synthesis details from the start prevents many conversion errors from entering experimental databases.
- Consistent curation protocols applied when structures are added to a database reduce the number of implausible entries that reach screening.
- Filtering topology choices before generating hypothetical structures keeps chemically unreasonable models from being created in the first place.
- Downstream databases receive fewer bad structures, lowering the amount of post-hoc correction needed for reliable screening results.
Where Pith is reading between the lines
- If structural demons are widespread, many previously published computational rankings of MOFs for gas separation or catalysis may require re-evaluation with corrected inputs.
- The same error patterns could appear in related classes of reticular materials such as covalent organic frameworks once large hypothetical databases are built for them.
- Automated validation checks for oxidation-state balance and coordination geometry could be inserted early in database pipelines to catch demons before screening begins.
Load-bearing premise
The majority of chemically invalid top candidates come from structural model errors rather than from force-field inaccuracies or limitations in the screening methodology itself.
What would settle it
Re-run one of the major published MOF screening campaigns on a version of the database in which every structure has been manually validated and corrected for oxidation states, coordination environments, and charge balance, then check whether the fraction of invalid top performers drops substantially below 50 percent.
read the original abstract
Digital reticular chemistry relies on accurate crystal structures to power computational screening, data-driven discovery, and structure-property analysis, yet recent studies reveal that more than half of the top-performing candidates in major computational screening campaigns are chemically invalid. In experimental MOF databases, structural errors arise when disordered or incomplete structural models are incorrectly converted into fully specified simulation inputs. In hypothetical MOF database, structures are complete by construction but may encode chemically implausible oxidation states, coordination environments, or charge distributions. We term these erroneous structural models "structural demons." This mini-review asks three questions: where these errors enter, how we find them, and how we prevent them. On the prevention side, the key steps are keeping diffraction data and synthesis details together from the start, using consistent curation when structures enter a database, and filtering topology choices before structure generation. Connecting these steps can keep many bad structures out of downstream databases and reduce the need to fix them later.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. This mini-review examines structural errors in MOF databases for computational reticular chemistry, claiming that recent studies show more than half of top-performing candidates in major screening campaigns are chemically invalid. It introduces the term 'structural demons' for erroneous models arising from disordered experimental structures or implausible hypothetical ones (e.g., incorrect oxidation states or charge distributions). The paper addresses error origins, detection methods, and prevention via keeping diffraction data with synthesis details, consistent curation, and pre-filtering topologies before structure generation.
Significance. If the prevalence claim holds, the work identifies a key source of unreliability in high-throughput MOF screening and data-driven discovery, potentially explaining poor experiment-computation agreement. The practical prevention steps could improve database quality and reduce downstream errors, offering guidance for curators and modelers in the field. As a synthesis of existing issues rather than new data, its value lies in framing actionable curation practices.
major comments (2)
- [Abstract] Abstract: The central claim that 'more than half of the top-performing candidates in major computational screening campaigns are chemically invalid' is attributed to unspecified 'recent studies' with no citations, re-analysis, or breakdown provided in the manuscript. This leaves unaddressed whether structural errors (oxidation states, coordination, charge balance) are isolated as the dominant factor versus force-field inaccuracies or screening protocols, weakening the causal link to 'structural demons' as the primary issue.
- [Prevention] Prevention discussion: The suggestion to 'filter topology choices before structure generation' is presented as a key step but without concrete implementation details, validation against known valid structures, or reference to an existing method or tool. This makes the recommendation difficult to operationalize and assess for effectiveness in reducing invalid candidates.
minor comments (2)
- The informal term 'structural demons' is used repeatedly; a brief formal definition or comparison to standard terminology in MOF curation literature (e.g., 'invalid structures' or 'model errors') would improve clarity for a broad audience.
- Ensure all references to 'recent studies' and specific databases or campaigns are accompanied by full citations in the bibliography to allow readers to verify the >50% invalidity rate.
Simulated Author's Rebuttal
We thank the referee for the constructive comments on our mini-review. We address each major point below and have revised the manuscript to incorporate the suggested improvements where appropriate.
read point-by-point responses
-
Referee: [Abstract] Abstract: The central claim that 'more than half of the top-performing candidates in major computational screening campaigns are chemically invalid' is attributed to unspecified 'recent studies' with no citations, re-analysis, or breakdown provided in the manuscript. This leaves unaddressed whether structural errors (oxidation states, coordination, charge balance) are isolated as the dominant factor versus force-field inaccuracies or screening protocols, weakening the causal link to 'structural demons' as the primary issue.
Authors: We agree that explicit citations are required. The claim synthesizes findings from specific recent studies that quantified invalid structures in MOF screening campaigns; these will now be cited directly in the revised abstract and main text. While this is a mini-review and performs no new re-analysis, we have added a brief breakdown noting that the cited works isolate structural errors (e.g., oxidation states and charge balance) as a dominant contributor separate from force-field or protocol issues, thereby strengthening the causal framing of structural demons. revision: yes
-
Referee: [Prevention] Prevention discussion: The suggestion to 'filter topology choices before structure generation' is presented as a key step but without concrete implementation details, validation against known valid structures, or reference to an existing method or tool. This makes the recommendation difficult to operationalize and assess for effectiveness in reducing invalid candidates.
Authors: We accept that additional operational details are needed. In the revised manuscript we expand this section with references to established topology-filtering approaches (e.g., those used in CoRE MOF curation and RCSR-based pre-screening pipelines). We now outline a concrete workflow: restrict generation to topologies that appear in experimentally validated structures, then cross-check generated candidates against a curated set of known valid MOFs for coordination and charge consistency. This provides a practical implementation path. revision: yes
Circularity Check
No circularity: purely descriptive review with no derivations or fitted predictions
full rationale
The manuscript is a mini-review containing no equations, no fitted parameters, no predictions, and no derivation chain. Its central assertion about >50% invalid candidates is attributed to external recent studies rather than derived from the paper's own data or self-citations. No self-definitional loops, fitted-input predictions, or ansatz smuggling occur because there are no quantitative steps to reduce to inputs. The paper is self-contained as descriptive analysis.
Axiom & Free-Parameter Ledger
invented entities (1)
-
structural demons
no independent evidence
Reference graph
Works this paper leans on
-
[1]
O. M. Yaghi, M. J. Kalmutzki, C. S. Diercks, Introduction to reticular chemistry : metal-organic frameworks and covalent organic frameworks, Wiley- VCH Verlag GmbH & Co., Weinheim, Germany, 2019
work page 2019
-
[2]
aO. M. Yaghi, G. Li, H. Li, Nature 1995, 378, 703-706; bH. Li, M. Eddaoudi, T. L. Groy, O. Yaghi, Journal of the American Chemical Society 1998, 120, 8571-8572; cH. Li, M. Eddaoudi, M. O'Keeffe, O. M. Yaghi, Nature 1999, 402, 276-279
work page 1995
-
[3]
O. M. Yaghi, M. O'Keeffe, N. W. Ockwig, H. K. Chae, M. Eddaoudi, J. Kim, Nature 2003, 423, 705-714
work page 2003
-
[4]
aC. R. Groom, F. H. Allen, Angewandte Chemie International Edition 2014, 53, 662-671; bC. R. Groom, I. J. Bruno, M. P. Lightfoot, S. C. Ward, Acta Crystallogr. B 2016, 72, 171-179
work page 2014
-
[5]
aY. G. Chung, J. Camp, M. Haranczyk, B. J. Sikora, W. Bury, V. Krungleviciute, T. Yildirim, O. K. Farha, D. S. Sholl, R. Q. Snurr, Chem Mater 2014, 26, 6185-6192; bP. Z. Moghadam, A. Li, S. B. Wiggin, A. Tao, A. G. P. Maloney, P. A. Wood, S. C. Ward, D. Fairen-Jimenez, Chem Mater 2017, 29, 2618-2625; cY. G. Chung, E. Haldoupis, B. J. Bucior, M. Haranczyk,...
work page 2014
-
[6]
aC. E. Wilmer, M. Leaf, C. Y. Lee, O. K. Farha, B. G. Hauser, J. T. Hupp, R. Q. Snurr, Nat Chem 2012, 4, 83- 89; bY. J. Colón, D. A. Gómez-Gualdrón, R. Q. Snurr, Cryst Growth Des 2017, 17, 5801-5810; cR. Anderson, D. A. Gómez-Gualdrón, Crystengcomm 2019, 21, 1653- 1665; dS. Lee, B. Kim, H. Cho, H. Lee, S. Y. Lee, E. S. Cho, J. Kim, Acs Appl Mater Inter 20...
work page 2012
- [7]
-
[8]
aC. E. Wilmer, O. K. Farha, Y.-S. Bae, J. T. Hupp, R. Q. Snurr, Energy & Environmental Science 2012, 5, 9849- 9856; bS. Li, Y. G. Chung, R. Q. Snurr, Langmuir 2016, 32, 10368-10376; cY. G. Chung, D. A. Gomez- Gualdron, P. Li, K. T. Leperi, P. Deria, H. D. Zhang, N. A. Vermeulen, J. F. Stoddart, F. Q. You, J. T. Hupp, O. K. Farha, R. Q. Snurr, Sci Adv 2016...
work page 2012
-
[9]
aD. A. Gómez-Gualdrón, C. E. Wilmer, O. K. Farha, J. T. Hupp, R. Q. Snurr, The Journal of Physical Chemistry C 2014, 118, 6941-6951; bC. M. Simon, J. Kim, D. A. Gomez-Gualdron, J. S. Camp, Y. G. Chung, R. L. Martin, R. Mercado, M. W. Deem, D. Gunter, M. Haranczyk, Energy & Environmental Science 2015, 8, 1190-1199; cD. A. Gómez-Gualdrón, Y. J. Colón, X. Zh...
work page 2014
-
[10]
aY.-M. Wang, A. Datar, Z.-X. Xu, L.-C. Lin, The Journal of Physical Chemistry C 2024, 128, 384-395; bS.-M. Shih, L.-C. Lin, Journal of the American Chemical Society 2025, 147, 34791-34803
work page 2024
-
[11]
aA. S. Rosen, J. M. Notestein, R. Q. Snurr, Journal of computational chemistry 2019, 40, 1305-1318; bH. Adamji, A. Nandy, I. Kevlishvili, Y. Román-Leshkov, H. J. Kulik, Journal of the American Chemical Society 2023, 145, 14365-14378; cH. Kum, J. Kim, ACS Catalysis 2025, 15, 19756-19767
work page 2019
-
[12]
aT. Y. Chen, T. A. Manz, Rsc Adv 2020, 10, 26944- 26951; bA. J. White, M. Gibaldi, J. Burner, R. A. Mayo, T. K. Woo, Journal of the American Chemical Society 2025, 147, 17579-17583; cX. Jin, K. M. Jablonka, E. Moubarak, Y. T. Li, B. Smit, Digit Discov 2025, 4, 1560- 1569; dG. B. Zhao, P. Y. Zhao, Y. G. Chung, Journal of the American Chemical Society 2025,...
work page 2020
-
[13]
M. Gibaldi, J. Luo, A. J. White, R. A. Mayo, C. Pereira, T. K. Woo, Journal of Materials Chemistry A 2025, 13, 32255-32270
work page 2025
-
[14]
J. Goldsmith, A. G. Wong-Foy, M. J. Cafarella, D. J. Siegel, Chem Mater 2013, 25, 3373-3382
work page 2013
-
[15]
aT. A. Manz, D. S. Sholl, Journal of chemical theory and computation 2012, 8, 2844-2867; bT. A. Manz, N. G. Limas, Rsc Adv 2016, 6, 47771-47801; cN. G. Limas, T. A. Manz, Rsc Adv 2016, 6, 45727-45747; dT. A. Manz, Rsc Adv 2017, 7, 45552-45581; eG. Zhao, Y. G. Chung, Journal of chemical theory and computation 2024, 20, 5368-5380
work page 2012
-
[16]
D. Nazarian, J. S. Camp, Y. G. Chung, R. Q. Snurr, D. S. Sholl, Chem Mater 2017, 29, 2521-2528
work page 2017
- [17]
-
[18]
aT. F. Willems, C. H. Rycroft, M. Kazi, J. C. Meza, M. Haranczyk, Microporous and Mesoporous Materials 2012, 149, 134-141; bR. L. Martin, B. Smit, M. Haranczyk, Journal of chemical information and modeling 2012, 52, 308-318
work page 2012
-
[19]
M. O’keeffe, M. A. Peskov, S. J. Ramsden, O. M. Yaghi, Accounts of chemical research 2008, 41, 1782-1789
work page 2008
-
[20]
B. J. Bucior, A. S. Rosen, M. Haranczyk, Z. Yao, M. E. Ziebel, O. K. Farha, J. T. Hupp, J. I. Siepmann, A. Hunting Structural Demons in Digital Reticular Chemistry 12 Aspuru-Guzik, R. Q. Snurr, Cryst Growth Des 2019, 19, 6682-6697
work page 2019
- [21]
-
[22]
aK. T. Leperi, Y. G. Chung, F. You, R. Q. Snurr, ACS sustainable chemistry & engineering 2019, 7, 11529- 11539; bT. D. Burns, K. N. Pai, S. G. Subraveti, S. P. Collins, M. Krykunov, A. Rajendran, T. K. Woo, Environmental science & technology 2020, 54, 4536- 4544; cC. Charalambous, E. Moubarak, J. Schilling, E. Sanchez Fernandez, J.-Y. Wang, L. Herraiz, F....
work page 2019
- [23]
-
[24]
C. Campañá, B. Mussard, T. K. Woo, Journal of Chemical Theory and Computation 2009, 5, 2866-2878
work page 2009
-
[25]
A. S. Rosen, S. M. Iyer, D. Ray, Z. Yao, A. Aspuru- Guzik, L. Gagliardi, J. M. Notestein, R. Q. Snurr, Matter-Us 2021, 4, 1578-1597
work page 2021
-
[26]
B. J. Sikora, R. Winnegar, D. M. Proserpio, R. Q. Snurr, Microporous and Mesoporous Materials 2014, 186, 207- 213
work page 2014
-
[27]
S. Bureekaew, V. Balwani, S. Amirjalayer, R. Schmid, Crystengcomm 2015, 17, 344-352
work page 2015
-
[28]
P. G. Boyd, T. K. Woo, Crystengcomm 2016, 18, 3777- 3792
work page 2016
- [29]
-
[30]
P. G. Boyd, A. Chidambaram, E. García-Díez, C. P. Ireland, T. D. Daff, R. Bounds, A. Gladysiak, P. Schouwink, S. M. Moosavi, M. M. Maroto-Valer, J. A. Reimer, J. A. R. Navarro, T. K. Woo, S. Garcia, K. C. Stylianou, B. Smit, Nature 2019, 576, 253-+
work page 2019
-
[31]
A. Daru, J. S. Anderson, D. M. Proserpio, L. Gagliardi, Adv Mater 2025, 37, 2414617
work page 2025
-
[32]
M. Gibaldi, O. Kwon, A. White, J. Burner, T. K. Woo, Acs Appl Mater Inter 2022, 14, 43372-43386
work page 2022
-
[33]
J. G. Eon, Acta Crystallogr A 2011, 67, 68-86
work page 2011
-
[34]
aP. Z. Moghadam, S. M. Rogge, A. Li, C.-M. Chow, J. Wieme, N. Moharrami, M. Aragones-Anglada, G. Conduit, D. A. Gomez-Gualdron, V. Van Speybroeck, Matter-Us 2019, 1, 219-234; bR. Anderson, D. A. Gómez-Gualdrón, The Journal of Chemical Physics 2021, 154; cZ. Cao, R. Magar, Y. Wang, A. Barati Farimani, Journal of the American Chemical Society 2023, 145, 295...
work page 2019
-
[35]
aN. Kim, S. Kim, M. Kim, J. Park, S. Ahn, arXiv preprint arXiv:2410.17270 2024; bZ. P. Yao, B. Sánchez-Lengeling, N. S. Bobbitt, B. J. Bucior, S. G. H. Kumar, S. P. Collins, T. Burns, T. K. Woo, O. K. Farha, R. Q. Snurr, A. Aspuru-Guzik, Nature Machine Intelligence 2021, 3, 76-86
- [36]
-
[37]
M. Gibaldi, A. Kapeliukha, A. White, T. K. Woo, Journal of Chemical Information and Modeling 2024, 65, 275-287
work page 2024
-
[38]
W. Ouellette, K. Darling, A. Prosvirin, K. Whitenack, K. R. Dunbar, J. Zubieta, Dalton Transactions 2011, 40, 12288-12300
work page 2011
-
[39]
T. M. Pruyn, A. Aswad, S. T. Khan, J. Huang, R. Black, S. M. Moosavi, Journal of the American Chemical Society 2025, 147, 43474-43486
work page 2025
-
[40]
H. Kim, D. Kim, J. Kim, arXiv preprint arXiv:2512.01693 2025
work page internal anchor Pith review Pith/arXiv arXiv 2025
- [41]
-
[42]
K. Y. Wang, D. D. Ahn, N. Rampal, J. Thomassian, N. S. Sabeva, O. Kannan, C. Borgs, J. T. Chayes, O. M. Yaghi, Journal of the American Chemical Society 2026
work page 2026
-
[43]
K. M. Jablonka, A. S. Rosen, A. S. Krishnapriyan, B. Smit, Acs Central Sci 2023, 9, 563-581. Received: ((will be filled in by the editorial staff)) Accepted: ((will be filled in by the editorial staff)) Published online: ((will be filled in by the editorial staff)) Biographical Sketch Yongchul G. Chung is a Professor of Chemical Engineering at Pusan Natio...
work page 2023
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.