pith. sign in

arxiv: 2603.26295 · v4 · submitted 2026-03-27 · ❄️ cond-mat.mtrl-sci · physics.chem-ph

Hunting Structural Demons in Digital Reticular Chemistry: Lessons from Metal-Organic Frameworks

Pith reviewed 2026-05-14 22:46 UTC · model grok-4.3

classification ❄️ cond-mat.mtrl-sci physics.chem-ph
keywords structural demonsmetal-organic frameworksdigital reticular chemistrycomputational screeningstructural errorsMOF databaseshypothetical structuresdata curation
0
0 comments X

The pith

More than half of the top-performing candidates in major computational screenings of metal-organic frameworks are chemically invalid due to structural errors.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper shows that digital reticular chemistry depends on accurate crystal structures for screening and discovery, yet errors in those structures produce many chemically invalid models. In experimental MOF databases, disordered or incomplete diffraction data get turned into complete simulation inputs that do not reflect real chemistry. In hypothetical databases, structures are fully specified but often contain implausible oxidation states, coordination geometries, or charge distributions. The authors call these bad models structural demons and trace how they enter databases, how they can be detected, and how curation steps can keep them out before they reach screening pipelines.

Core claim

The central claim is that structural errors in both experimental and hypothetical MOF databases generate chemically invalid structures that dominate the top results of computational screening campaigns, and that these errors can be reduced by preserving raw diffraction data alongside synthesis information, applying consistent curation rules at database entry, and filtering topology choices prior to structure generation.

What carries the argument

Structural demons: erroneous crystal structure models that arise either from converting disordered experimental diffraction data into fully specified simulation inputs or from encoding chemically implausible features in hypothetical structures, which then propagate invalid high-ranking candidates through screening workflows.

If this is right

  • Integrating raw diffraction data with synthesis details from the start prevents many conversion errors from entering experimental databases.
  • Consistent curation protocols applied when structures are added to a database reduce the number of implausible entries that reach screening.
  • Filtering topology choices before generating hypothetical structures keeps chemically unreasonable models from being created in the first place.
  • Downstream databases receive fewer bad structures, lowering the amount of post-hoc correction needed for reliable screening results.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • If structural demons are widespread, many previously published computational rankings of MOFs for gas separation or catalysis may require re-evaluation with corrected inputs.
  • The same error patterns could appear in related classes of reticular materials such as covalent organic frameworks once large hypothetical databases are built for them.
  • Automated validation checks for oxidation-state balance and coordination geometry could be inserted early in database pipelines to catch demons before screening begins.

Load-bearing premise

The majority of chemically invalid top candidates come from structural model errors rather than from force-field inaccuracies or limitations in the screening methodology itself.

What would settle it

Re-run one of the major published MOF screening campaigns on a version of the database in which every structure has been manually validated and corrected for oxidation states, coordination environments, and charge balance, then check whether the fraction of invalid top performers drops substantially below 50 percent.

read the original abstract

Digital reticular chemistry relies on accurate crystal structures to power computational screening, data-driven discovery, and structure-property analysis, yet recent studies reveal that more than half of the top-performing candidates in major computational screening campaigns are chemically invalid. In experimental MOF databases, structural errors arise when disordered or incomplete structural models are incorrectly converted into fully specified simulation inputs. In hypothetical MOF database, structures are complete by construction but may encode chemically implausible oxidation states, coordination environments, or charge distributions. We term these erroneous structural models "structural demons." This mini-review asks three questions: where these errors enter, how we find them, and how we prevent them. On the prevention side, the key steps are keeping diffraction data and synthesis details together from the start, using consistent curation when structures enter a database, and filtering topology choices before structure generation. Connecting these steps can keep many bad structures out of downstream databases and reduce the need to fix them later.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. This mini-review examines structural errors in MOF databases for computational reticular chemistry, claiming that recent studies show more than half of top-performing candidates in major screening campaigns are chemically invalid. It introduces the term 'structural demons' for erroneous models arising from disordered experimental structures or implausible hypothetical ones (e.g., incorrect oxidation states or charge distributions). The paper addresses error origins, detection methods, and prevention via keeping diffraction data with synthesis details, consistent curation, and pre-filtering topologies before structure generation.

Significance. If the prevalence claim holds, the work identifies a key source of unreliability in high-throughput MOF screening and data-driven discovery, potentially explaining poor experiment-computation agreement. The practical prevention steps could improve database quality and reduce downstream errors, offering guidance for curators and modelers in the field. As a synthesis of existing issues rather than new data, its value lies in framing actionable curation practices.

major comments (2)
  1. [Abstract] Abstract: The central claim that 'more than half of the top-performing candidates in major computational screening campaigns are chemically invalid' is attributed to unspecified 'recent studies' with no citations, re-analysis, or breakdown provided in the manuscript. This leaves unaddressed whether structural errors (oxidation states, coordination, charge balance) are isolated as the dominant factor versus force-field inaccuracies or screening protocols, weakening the causal link to 'structural demons' as the primary issue.
  2. [Prevention] Prevention discussion: The suggestion to 'filter topology choices before structure generation' is presented as a key step but without concrete implementation details, validation against known valid structures, or reference to an existing method or tool. This makes the recommendation difficult to operationalize and assess for effectiveness in reducing invalid candidates.
minor comments (2)
  1. The informal term 'structural demons' is used repeatedly; a brief formal definition or comparison to standard terminology in MOF curation literature (e.g., 'invalid structures' or 'model errors') would improve clarity for a broad audience.
  2. Ensure all references to 'recent studies' and specific databases or campaigns are accompanied by full citations in the bibliography to allow readers to verify the >50% invalidity rate.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments on our mini-review. We address each major point below and have revised the manuscript to incorporate the suggested improvements where appropriate.

read point-by-point responses
  1. Referee: [Abstract] Abstract: The central claim that 'more than half of the top-performing candidates in major computational screening campaigns are chemically invalid' is attributed to unspecified 'recent studies' with no citations, re-analysis, or breakdown provided in the manuscript. This leaves unaddressed whether structural errors (oxidation states, coordination, charge balance) are isolated as the dominant factor versus force-field inaccuracies or screening protocols, weakening the causal link to 'structural demons' as the primary issue.

    Authors: We agree that explicit citations are required. The claim synthesizes findings from specific recent studies that quantified invalid structures in MOF screening campaigns; these will now be cited directly in the revised abstract and main text. While this is a mini-review and performs no new re-analysis, we have added a brief breakdown noting that the cited works isolate structural errors (e.g., oxidation states and charge balance) as a dominant contributor separate from force-field or protocol issues, thereby strengthening the causal framing of structural demons. revision: yes

  2. Referee: [Prevention] Prevention discussion: The suggestion to 'filter topology choices before structure generation' is presented as a key step but without concrete implementation details, validation against known valid structures, or reference to an existing method or tool. This makes the recommendation difficult to operationalize and assess for effectiveness in reducing invalid candidates.

    Authors: We accept that additional operational details are needed. In the revised manuscript we expand this section with references to established topology-filtering approaches (e.g., those used in CoRE MOF curation and RCSR-based pre-screening pipelines). We now outline a concrete workflow: restrict generation to topologies that appear in experimentally validated structures, then cross-check generated candidates against a curated set of known valid MOFs for coordination and charge consistency. This provides a practical implementation path. revision: yes

Circularity Check

0 steps flagged

No circularity: purely descriptive review with no derivations or fitted predictions

full rationale

The manuscript is a mini-review containing no equations, no fitted parameters, no predictions, and no derivation chain. Its central assertion about >50% invalid candidates is attributed to external recent studies rather than derived from the paper's own data or self-citations. No self-definitional loops, fitted-input predictions, or ansatz smuggling occur because there are no quantitative steps to reduce to inputs. The paper is self-contained as descriptive analysis.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 1 invented entities

The paper is a review and introduces only the descriptive term 'structural demons' without new physical entities, free parameters, or unstated axioms.

invented entities (1)
  • structural demons no independent evidence
    purpose: Label for erroneous structural models in MOF databases
    New term coined to describe the collection of chemically invalid structures discussed in the review.

pith-pipeline@v0.9.0 · 5465 in / 993 out tokens · 37616 ms · 2026-05-14T22:46:10.594420+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

43 extracted references · 43 canonical work pages · 1 internal anchor

  1. [1]

    O. M. Yaghi, M. J. Kalmutzki, C. S. Diercks, Introduction to reticular chemistry : metal-organic frameworks and covalent organic frameworks, Wiley- VCH Verlag GmbH & Co., Weinheim, Germany, 2019

  2. [2]

    aO. M. Yaghi, G. Li, H. Li, Nature 1995, 378, 703-706; bH. Li, M. Eddaoudi, T. L. Groy, O. Yaghi, Journal of the American Chemical Society 1998, 120, 8571-8572; cH. Li, M. Eddaoudi, M. O'Keeffe, O. M. Yaghi, Nature 1999, 402, 276-279

  3. [3]

    O. M. Yaghi, M. O'Keeffe, N. W. Ockwig, H. K. Chae, M. Eddaoudi, J. Kim, Nature 2003, 423, 705-714

  4. [4]

    aC. R. Groom, F. H. Allen, Angewandte Chemie International Edition 2014, 53, 662-671; bC. R. Groom, I. J. Bruno, M. P. Lightfoot, S. C. Ward, Acta Crystallogr. B 2016, 72, 171-179

  5. [5]

    aY. G. Chung, J. Camp, M. Haranczyk, B. J. Sikora, W. Bury, V. Krungleviciute, T. Yildirim, O. K. Farha, D. S. Sholl, R. Q. Snurr, Chem Mater 2014, 26, 6185-6192; bP. Z. Moghadam, A. Li, S. B. Wiggin, A. Tao, A. G. P. Maloney, P. A. Wood, S. C. Ward, D. Fairen-Jimenez, Chem Mater 2017, 29, 2618-2625; cY. G. Chung, E. Haldoupis, B. J. Bucior, M. Haranczyk,...

  6. [6]

    aC. E. Wilmer, M. Leaf, C. Y. Lee, O. K. Farha, B. G. Hauser, J. T. Hupp, R. Q. Snurr, Nat Chem 2012, 4, 83- 89; bY. J. Colón, D. A. Gómez-Gualdrón, R. Q. Snurr, Cryst Growth Des 2017, 17, 5801-5810; cR. Anderson, D. A. Gómez-Gualdrón, Crystengcomm 2019, 21, 1653- 1665; dS. Lee, B. Kim, H. Cho, H. Lee, S. Y. Lee, E. S. Cho, J. Kim, Acs Appl Mater Inter 20...

  7. [7]

    Ongari, A

    aD. Ongari, A. V. Yakutovich, L. Talirz, B. Smit, Acs Central Sci 2019, 5, 1663-1675; bJ. S. De Vos, S. Borgmans, P. Van Der Voort, S. M. Rogge, V. Van Speybroeck, Journal of Materials Chemistry A 2023, 11, 7468-7487

  8. [8]

    aC. E. Wilmer, O. K. Farha, Y.-S. Bae, J. T. Hupp, R. Q. Snurr, Energy & Environmental Science 2012, 5, 9849- 9856; bS. Li, Y. G. Chung, R. Q. Snurr, Langmuir 2016, 32, 10368-10376; cY. G. Chung, D. A. Gomez- Gualdron, P. Li, K. T. Leperi, P. Deria, H. D. Zhang, N. A. Vermeulen, J. F. Stoddart, F. Q. You, J. T. Hupp, O. K. Farha, R. Q. Snurr, Sci Adv 2016...

  9. [9]

    aD. A. Gómez-Gualdrón, C. E. Wilmer, O. K. Farha, J. T. Hupp, R. Q. Snurr, The Journal of Physical Chemistry C 2014, 118, 6941-6951; bC. M. Simon, J. Kim, D. A. Gomez-Gualdron, J. S. Camp, Y. G. Chung, R. L. Martin, R. Mercado, M. W. Deem, D. Gunter, M. Haranczyk, Energy & Environmental Science 2015, 8, 1190-1199; cD. A. Gómez-Gualdrón, Y. J. Colón, X. Zh...

  10. [10]

    aY.-M. Wang, A. Datar, Z.-X. Xu, L.-C. Lin, The Journal of Physical Chemistry C 2024, 128, 384-395; bS.-M. Shih, L.-C. Lin, Journal of the American Chemical Society 2025, 147, 34791-34803

  11. [11]

    aA. S. Rosen, J. M. Notestein, R. Q. Snurr, Journal of computational chemistry 2019, 40, 1305-1318; bH. Adamji, A. Nandy, I. Kevlishvili, Y. Román-Leshkov, H. J. Kulik, Journal of the American Chemical Society 2023, 145, 14365-14378; cH. Kum, J. Kim, ACS Catalysis 2025, 15, 19756-19767

  12. [12]

    aT. Y. Chen, T. A. Manz, Rsc Adv 2020, 10, 26944- 26951; bA. J. White, M. Gibaldi, J. Burner, R. A. Mayo, T. K. Woo, Journal of the American Chemical Society 2025, 147, 17579-17583; cX. Jin, K. M. Jablonka, E. Moubarak, Y. T. Li, B. Smit, Digit Discov 2025, 4, 1560- 1569; dG. B. Zhao, P. Y. Zhao, Y. G. Chung, Journal of the American Chemical Society 2025,...

  13. [13]

    Gibaldi, J

    M. Gibaldi, J. Luo, A. J. White, R. A. Mayo, C. Pereira, T. K. Woo, Journal of Materials Chemistry A 2025, 13, 32255-32270

  14. [14]

    Goldsmith, A

    J. Goldsmith, A. G. Wong-Foy, M. J. Cafarella, D. J. Siegel, Chem Mater 2013, 25, 3373-3382

  15. [15]

    aT. A. Manz, D. S. Sholl, Journal of chemical theory and computation 2012, 8, 2844-2867; bT. A. Manz, N. G. Limas, Rsc Adv 2016, 6, 47771-47801; cN. G. Limas, T. A. Manz, Rsc Adv 2016, 6, 45727-45747; dT. A. Manz, Rsc Adv 2017, 7, 45552-45581; eG. Zhao, Y. G. Chung, Journal of chemical theory and computation 2024, 20, 5368-5380

  16. [16]

    Nazarian, J

    D. Nazarian, J. S. Camp, Y. G. Chung, R. Q. Snurr, D. S. Sholl, Chem Mater 2017, 29, 2521-2528

  17. [17]

    Nazarian, J

    D. Nazarian, J. S. Camp, D. S. Sholl, Chem Mater 2016, 28, 785-793

  18. [18]

    aT. F. Willems, C. H. Rycroft, M. Kazi, J. C. Meza, M. Haranczyk, Microporous and Mesoporous Materials 2012, 149, 134-141; bR. L. Martin, B. Smit, M. Haranczyk, Journal of chemical information and modeling 2012, 52, 308-318

  19. [19]

    O’keeffe, M

    M. O’keeffe, M. A. Peskov, S. J. Ramsden, O. M. Yaghi, Accounts of chemical research 2008, 41, 1782-1789

  20. [20]

    B. J. Bucior, A. S. Rosen, M. Haranczyk, Z. Yao, M. E. Ziebel, O. K. Farha, J. T. Hupp, J. I. Siepmann, A. Hunting Structural Demons in Digital Reticular Chemistry 12 Aspuru-Guzik, R. Q. Snurr, Cryst Growth Des 2019, 19, 6682-6697

  21. [21]

    Nandy, C

    aA. Nandy, C. Duan, H. J. Kulik, Journal of the American Chemical Society 2021, 143, 17535-17547; bG. G. Terrones, S.-P. Huang, M. P. Rivera, S. Yue, A. Hernandez, H. J. Kulik, Journal of the American Chemical Society 2024, 146, 20333-20348

  22. [22]

    aK. T. Leperi, Y. G. Chung, F. You, R. Q. Snurr, ACS sustainable chemistry & engineering 2019, 7, 11529- 11539; bT. D. Burns, K. N. Pai, S. G. Subraveti, S. P. Collins, M. Krykunov, A. Rajendran, T. K. Woo, Environmental science & technology 2020, 54, 4536- 4544; cC. Charalambous, E. Moubarak, J. Schilling, E. Sanchez Fernandez, J.-Y. Wang, L. Herraiz, F....

  23. [23]

    Burner, J

    J. Burner, J. Luo, A. White, A. Mirmiran, O. Kwon, P. G. Boyd, S. Maley, M. Gibaldi, S. Simrod, V. Ogden, Chem Mater 2023, 35, 900-916

  24. [24]

    Campañá, B

    C. Campañá, B. Mussard, T. K. Woo, Journal of Chemical Theory and Computation 2009, 5, 2866-2878

  25. [25]

    A. S. Rosen, S. M. Iyer, D. Ray, Z. Yao, A. Aspuru- Guzik, L. Gagliardi, J. M. Notestein, R. Q. Snurr, Matter-Us 2021, 4, 1578-1597

  26. [26]

    B. J. Sikora, R. Winnegar, D. M. Proserpio, R. Q. Snurr, Microporous and Mesoporous Materials 2014, 186, 207- 213

  27. [27]

    Bureekaew, V

    S. Bureekaew, V. Balwani, S. Amirjalayer, R. Schmid, Crystengcomm 2015, 17, 344-352

  28. [28]

    P. G. Boyd, T. K. Woo, Crystengcomm 2016, 18, 3777- 3792

  29. [29]

    Witman, S

    M. Witman, S. L. Ling, S. Anderson, L. H. Tong, K. C. Stylianou, B. Slater, B. Smit, M. Haranczyk, Chem Sci 2016, 7, 6263-6272

  30. [30]

    P. G. Boyd, A. Chidambaram, E. García-Díez, C. P. Ireland, T. D. Daff, R. Bounds, A. Gladysiak, P. Schouwink, S. M. Moosavi, M. M. Maroto-Valer, J. A. Reimer, J. A. R. Navarro, T. K. Woo, S. Garcia, K. C. Stylianou, B. Smit, Nature 2019, 576, 253-+

  31. [31]

    A. Daru, J. S. Anderson, D. M. Proserpio, L. Gagliardi, Adv Mater 2025, 37, 2414617

  32. [32]

    Gibaldi, O

    M. Gibaldi, O. Kwon, A. White, J. Burner, T. K. Woo, Acs Appl Mater Inter 2022, 14, 43372-43386

  33. [33]

    J. G. Eon, Acta Crystallogr A 2011, 67, 68-86

  34. [34]

    aP. Z. Moghadam, S. M. Rogge, A. Li, C.-M. Chow, J. Wieme, N. Moharrami, M. Aragones-Anglada, G. Conduit, D. A. Gomez-Gualdron, V. Van Speybroeck, Matter-Us 2019, 1, 219-234; bR. Anderson, D. A. Gómez-Gualdrón, The Journal of Chemical Physics 2021, 154; cZ. Cao, R. Magar, Y. Wang, A. Barati Farimani, Journal of the American Chemical Society 2023, 145, 295...

  35. [35]

    aN. Kim, S. Kim, M. Kim, J. Park, S. Ahn, arXiv preprint arXiv:2410.17270 2024; bZ. P. Yao, B. Sánchez-Lengeling, N. S. Bobbitt, B. J. Bucior, S. G. H. Kumar, S. P. Collins, T. Burns, T. K. Woo, O. K. Farha, R. Q. Snurr, A. Aspuru-Guzik, Nature Machine Intelligence 2021, 3, 76-86

  36. [36]

    Müller, R

    P. Müller, R. Herbst-Irmer, A. L. Spek, T. R. Schneider, M. R. Sawaya, Crystal Structure Refinement: A Crystallographer's Guide to SHELXL, 2010 Edition ed., Oxford University Press, 2010

  37. [37]

    Gibaldi, A

    M. Gibaldi, A. Kapeliukha, A. White, T. K. Woo, Journal of Chemical Information and Modeling 2024, 65, 275-287

  38. [38]

    Ouellette, K

    W. Ouellette, K. Darling, A. Prosvirin, K. Whitenack, K. R. Dunbar, J. Zubieta, Dalton Transactions 2011, 40, 12288-12300

  39. [39]

    T. M. Pruyn, A. Aswad, S. T. Khan, J. Huang, R. Black, S. M. Moosavi, Journal of the American Chemical Society 2025, 147, 43474-43486

  40. [40]

    H. Kim, D. Kim, J. Kim, arXiv preprint arXiv:2512.01693 2025

  41. [41]

    Cheung, S

    O. Cheung, S. Tokuda, D. Jędrzejowski, E. Ploetz, B. Baumgartner, M. Pander, F. Yang, J. D. Evans, R. Ettlinger, S. Wuttke, Adv Mater 2025, e21420

  42. [42]

    K. Y. Wang, D. D. Ahn, N. Rampal, J. Thomassian, N. S. Sabeva, O. Kannan, C. Borgs, J. T. Chayes, O. M. Yaghi, Journal of the American Chemical Society 2026

  43. [43]

    K. M. Jablonka, A. S. Rosen, A. S. Krishnapriyan, B. Smit, Acs Central Sci 2023, 9, 563-581. Received: ((will be filled in by the editorial staff)) Accepted: ((will be filled in by the editorial staff)) Published online: ((will be filled in by the editorial staff)) Biographical Sketch Yongchul G. Chung is a Professor of Chemical Engineering at Pusan Natio...