pith. sign in

arxiv: 2606.31332 · v1 · pith:I3LSWEYAnew · submitted 2026-06-30 · 💻 cs.AI

CryoACE: An Atom-centric Framework for Accurate and Automated Model Building in Cryo-EM

Pith reviewed 2026-07-01 05:30 UTC · model grok-4.3

classification 💻 cs.AI
keywords cryo-EMatomic model buildingprotein structureconformational heterogeneitydensity mapsautomated reconstructionlocal resolution
0
0 comments X

The pith

CryoACE builds atomic protein models from cryo-EM maps by sampling density directly at atom positions and applying local resolution guidance.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents CryoACE as an end-to-end method that reconstructs atomic graphs from both uniform and varying cryo-EM density maps. It replaces voxel-based processing with direct sampling of features at atomic coordinates that are then fed back to refine the structure, and adds a training-free step that uses predicted local resolution to clarify ambiguous regions. This combination aims to produce chemically valid models even when the underlying protein shows conformational changes. A reader would care because current automated approaches often fail on heterogeneous maps or require pre-existing static templates, limiting what can be learned from real experimental data.

Core claim

CryoACE reconstructs precise atomic graphs for homogeneous and heterogeneous structures through an atom-centric reconstruction paradigm in which density features are sampled directly at atomic coordinates and iteratively recycled to refine the model, replacing expensive voxel convolutions, together with a training-free guidance mechanism that leverages predicted local resolution priors to resolve dynamic ambiguity.

What carries the argument

The atom-centric reconstruction paradigm, which samples density features at atomic coordinates and recycles them iteratively to refine structures instead of using voxel convolutions.

If this is right

  • Automated atomic model building becomes possible for heterogeneous maps without pre-built static structures.
  • The method produces higher accuracy than prior baselines on standard static test cases.
  • Atomic-level dynamic conformations can be recovered directly from complex experimental datasets such as EMPIAR-10345.
  • Physicochemical validity is maintained through the coordinate-based sampling loop and resolution guidance.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same coordinate-sampling loop might be adapted to other density-based imaging methods where voxel grids are currently the default.
  • If the local-resolution prior can be obtained cheaply, the framework could support iterative refinement during data collection rather than only after the fact.
  • Success on EMPIAR-10345 suggests the approach may scale to larger assemblies where conformational heterogeneity has previously blocked atomic modeling.

Load-bearing premise

Sampling density at atomic coordinates together with local resolution guidance is enough to enforce chemical validity and remove conformational ambiguity in the maps.

What would settle it

Running CryoACE on a high-resolution map of a known static protein and finding that the output model contains bond lengths or angles outside accepted chemical ranges, or that it produces a single static model on EMPIAR-10345 when multiple distinct conformations are independently known to exist.

Figures

Figures reproduced from arXiv: 2606.31332 by Jiakai Zhang, Jingyi Yu, Mingrui Li, Minzhang Li, Qihe Chen, Sixian Shen, Weichen Qin, Yuan Pei.

Figure 1
Figure 1. Figure 1: Pipeline of CryoACE. The pipeline integrates three modalities, including sequences, electron density maps, and atomic profiles, to build atomic structures via a diffusion-based decoder. At the inference stage, we employ an atomic self-refinement strategy. This process is further enhanced by a guided diffusion scheme, utilizing global guidance (Tg) and Q-guidance (Tq) to improve model performance. 2.3. Neur… view at source ↗
Figure 2
Figure 2. Figure 2: Light-weight downstream heads. The architecture con￾sists of two specialized modules: a local resolution estimator (top) that uses a Unet3D to predict density maps from rescaled inputs, and a Q-score prediction head (bottom) that uses embeddings from a frozen CryoACE backbone to estimate residue-level quality. concatenated to form a patch token sequence fd ∈ R N×D: fd = Concat UNet3D(V 1 ), . . . , UNet3D(… view at source ↗
Figure 3
Figure 3. Figure 3: Qualitative comparison of automated model building from homogeneous density maps. We demonstrate three examples with resolution values decreasing from top to bottom. While CryoACE and ModelAngelo achieve near-perfect TM-scores on the 2.00 A map, E3-CryoFold suffers from backbone breaks. Crucially, in the lower resolution cases, only CryoACE generates complete, high ˚ TM-score structures. This highlights th… view at source ↗
Figure 4
Figure 4. Figure 4: Comparison of Heterogeneous Model Building Results on the αVβ8 Integrin. This target exhibits significant flexibility in the Fab arm. Existing methods like E3-CryoFold and ModelAngelo fail to model this dynamic region, while CryoBoltz attempts a full reconstruction but suffers from backbone breaks and visual misalignment. In contrast, CryoACE generates a complete, continuous structure that accurately fits … view at source ↗
Figure 5
Figure 5. Figure 5: An example of results of local resolution. Local resolution maps are calculated via CryoRes and visualized through Chimera on case EMD-3492. 13 [PITH_FULL_IMAGE:figures/full_fig_p013_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: CryoACE’s Heterogeneous Model Building Results on SARS-CoV-2 spike in prefusion state (EMPIAR-10516) F. Implementation Details Inference. We perform inference by solving the Probability Flow ODE derived from the linear interpolation xt = (1 − t)x0 + tϵ with ϵ ∼ N (0, I). Specifically, we employ a first-order Euler solver with 200 steps on a linear schedule t ∈ [0, 1] to integrate the velocity field from t=… view at source ↗
read the original abstract

Protein automodeling from cryo-EM density maps faces unique challenges in enforcing physicochemical validity and managing conformational heterogeneity. Current solvers are often limited to static predictions or require computationally intensive heuristic searches. We present CryoACE, an end-to-end framework that reconstructs precise atomic graphs for both homogeneous and heterogeneous structures. Our method features two key innovations: an atom-centric reconstruction paradigm, where density features are sampled directly at atomic coordinates and iteratively recycled to refine structures, replacing expensive voxel convolutions for efficient multimodal fusion; and a training-free guidance mechanism that leverages predicted local resolution priors to resolve dynamic ambiguity. Validated on a newly constructed high-quality dataset, CryoACE significantly outperforms existing baselines on static benchmarks and, for the first time, unveils atomic-level dynamic conformations on complex real-world datasets like EMPIAR-10345 without relying on pre-built static structures.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 1 minor

Summary. The manuscript presents CryoACE, an end-to-end atom-centric framework for automated atomic model building from cryo-EM density maps. It replaces voxel convolutions with direct sampling of density features at atomic coordinates, incorporates iterative recycling for refinement, and adds a training-free guidance step that uses predicted local resolution priors to address conformational heterogeneity. The abstract claims that the method significantly outperforms baselines on static benchmarks and, for the first time, recovers atomic-level dynamic conformations on real-world heterogeneous datasets such as EMPIAR-10345 without requiring pre-built static structures, all validated on a newly constructed high-quality dataset.

Significance. If the quantitative claims hold, the atom-centric sampling and training-free local-resolution guidance could provide an efficient alternative to existing voxel-based or heuristic cryo-EM modeling pipelines, particularly for heterogeneous maps. The absence of any reported metrics, however, prevents evaluation of whether these innovations actually deliver the stated gains in accuracy or physicochemical validity.

major comments (3)
  1. [Abstract] Abstract: the central claims that CryoACE 'significantly outperforms existing baselines on static benchmarks' and 'unveils atomic-level dynamic conformations on complex real-world datasets like EMPIAR-10345' are unsupported by any numerical results, tables, ablation studies, error bars, or dataset statistics. This evidentiary gap is load-bearing for the paper's primary contribution.
  2. [Abstract] Abstract: no description is supplied of the 'newly constructed high-quality dataset,' its size, composition, resolution range, or how it differs from public benchmarks, nor are any specific performance numbers (e.g., RMSD, completeness, or local-resolution correlation) reported for either the static or dynamic cases. Without these details the outperformance and novelty assertions cannot be assessed.
  3. [Abstract] Abstract: the atom-centric sampling and local-resolution guidance are asserted to enforce 'physicochemical validity' and resolve 'conformational ambiguity,' yet the abstract supplies no concrete mechanism, loss term, or validation metric showing that these steps produce stereochemically valid models or disambiguate states better than existing methods.
minor comments (1)
  1. [Abstract] The abstract would be clearer if it included at least one key quantitative result (e.g., average RMSD or success rate) to anchor the performance claims.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for their careful reading and constructive comments on the manuscript. We address each major comment below and agree that the abstract can be strengthened with additional concrete details drawn from the full paper to better support the claims.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the central claims that CryoACE 'significantly outperforms existing baselines on static benchmarks' and 'unveils atomic-level dynamic conformations on complex real-world datasets like EMPIAR-10345' are unsupported by any numerical results, tables, ablation studies, error bars, or dataset statistics. This evidentiary gap is load-bearing for the paper's primary contribution.

    Authors: The full manuscript reports these quantitative results, including tables with RMSD, completeness, and local-resolution metrics, ablation studies, and error bars in the Results section. The abstract was intentionally concise, but we agree it would benefit from key numerical highlights and will revise it to include representative performance numbers and dataset statistics. revision: yes

  2. Referee: [Abstract] Abstract: no description is supplied of the 'newly constructed high-quality dataset,' its size, composition, resolution range, or how it differs from public benchmarks, nor are any specific performance numbers (e.g., RMSD, completeness, or local-resolution correlation) reported for either the static or dynamic cases. Without these details the outperformance and novelty assertions cannot be assessed.

    Authors: The dataset construction, size, composition, resolution range, and differences from public benchmarks are detailed in the Methods section. We will add a brief summary of these elements and example performance numbers to the abstract. revision: yes

  3. Referee: [Abstract] Abstract: the atom-centric sampling and local-resolution guidance are asserted to enforce 'physicochemical validity' and resolve 'conformational ambiguity,' yet the abstract supplies no concrete mechanism, loss term, or validation metric showing that these steps produce stereochemically valid models or disambiguate states better than existing methods.

    Authors: The atom-centric sampling (replacing voxel convolutions with direct coordinate sampling and iterative recycling) and training-free local-resolution guidance, including the associated loss terms and stereochemical validation metrics, are described in Sections 3.2 and 3.3. We will revise the abstract to include a concise reference to these mechanisms. revision: yes

Circularity Check

0 steps flagged

No significant circularity; derivation self-contained against external benchmarks

full rationale

The abstract and available description introduce an atom-centric sampling paradigm and training-free local-resolution guidance as methodological choices, but present no equations, fitted parameters renamed as predictions, self-citation chains, or uniqueness theorems that reduce the claimed outputs to the inputs by construction. Performance claims are tied to external benchmarks (static sets and EMPIAR-10345) rather than internal recycling of the same quantities. This is the normal case of an empirical method whose validity rests on independent validation rather than definitional equivalence.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract provides insufficient detail to enumerate free parameters, axioms, or invented entities.

pith-pipeline@v0.9.1-grok · 5694 in / 1123 out tokens · 43320 ms · 2026-07-01T05:30:18.296941+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

64 extracted references · 3 canonical work pages

  1. [1]

    Proceedings of the AAAI Conference on Artificial Intelligence , volume=

    3d-rpe: Enhancing long-context modeling through 3d rotary position encoding , author=. Proceedings of the AAAI Conference on Artificial Intelligence , volume=

  2. [2]

    International conference on medical image computing and computer-assisted intervention , pages=

    3D U-Net: learning dense volumetric segmentation from sparse annotation , author=. International conference on medical image computing and computer-assisted intervention , pages=. 2016 , organization=

  3. [3]

    and Fleet, David J

    Punjani, Ali and Rubinstein, John L. and Fleet, David J. and Brubaker, Marcus A. , title=. Nature Methods , year=. doi:10.1038/nmeth.4169 , url=

  4. [4]

    Nature , volume=

    Automated model building and protein identification in cryo-EM maps , author=. Nature , volume=. 2024 , publisher=

  5. [5]

    Nature Structural & Molecular Biology , pages=

    CryoAtom improves model building for cryo-EM , author=. Nature Structural & Molecular Biology , pages=. 2025 , publisher=

  6. [6]

    Nature Machine Intelligence , pages=

    End-to-end Cryo-EM complex structure determination with high accuracy and ultra-fast speed , author=. Nature Machine Intelligence , pages=. 2025 , publisher=

  7. [7]

    arXiv preprint arXiv:2506.04490 , year=

    Multiscale guidance of AlphaFold3 with heterogeneous cryo-EM data , author=. arXiv preprint arXiv:2506.04490 , year=

  8. [8]

    Nature methods , volume=

    CryoDRGN: reconstruction of heterogeneous cryo-EM structures using neural networks , author=. Nature methods , volume=. 2021 , publisher=

  9. [9]

    The Buccaneer software for automated model building. 1. Tracing protein chains , author=. Biological Crystallography , volume=. 2006 , publisher=

  10. [10]

    Nature communications , volume=

    De novo main-chain modeling for EM maps using MAINMAST , author=. Nature communications , volume=. 2018 , publisher=

  11. [11]

    Nature methods , volume=

    A fully automatic method yielding initial models from high-resolution cryo-electron microscopy maps , author=. Nature methods , volume=. 2018 , publisher=

  12. [12]

    Structure , volume=

    Constructing and validating initial C models from subnanometer resolution density maps with pathwalking , author=. Structure , volume=. 2012 , publisher=

  13. [13]

    Journal of structural biology , volume=

    De Novo modeling in cryo-EM density maps with Pathwalking , author=. Journal of structural biology , volume=. 2016 , publisher=

  14. [14]

    Nature methods , volume=

    De novo protein structure determination from near-atomic-resolution cryo-EM maps , author=. Nature methods , volume=. 2015 , publisher=

  15. [15]

    Structure , volume=

    EM-fold: De novo folding of -helical proteins guided by intermediate-resolution electron microscopy density maps , author=. Structure , volume=. 2009 , publisher=

  16. [16]

    Biological crystallography , volume=

    Coot: model-building tools for molecular graphics , author=. Biological crystallography , volume=. 2004 , publisher=

  17. [17]

    IEEE/ACM transactions on computational biology and bioinformatics , volume=

    Intensity-based skeletonization of CryoEM gray-scale images using a true segmentation-free algorithm , author=. IEEE/ACM transactions on computational biology and bioinformatics , volume=. 2013 , publisher=

  18. [18]

    BioRxiv , pages=

    Boltz-1 democratizing biomolecular interaction modeling , author=. BioRxiv , pages=

  19. [19]

    BioRxiv , year=

    Boltz-2: Towards accurate and efficient binding affinity prediction , author=. BioRxiv , year=

  20. [20]

    Nature , volume=

    Improved protein structure prediction using potentials from deep learning , author=. Nature , volume=. 2020 , publisher=

  21. [21]

    nature , volume=

    Highly accurate protein structure prediction with AlphaFold , author=. nature , volume=. 2021 , publisher=

  22. [22]

    Nature , volume=

    Accurate structure prediction of biomolecular interactions with AlphaFold 3 , author=. Nature , volume=. 2024 , publisher=

  23. [23]

    Nature methods , volume=

    OpenFold: Retraining AlphaFold2 yields new insights into its learning mechanisms and capacity for generalization , author=. Nature methods , volume=. 2024 , publisher=

  24. [24]

    arXiv preprint arXiv:2512.24354 , year=

    SeedFold: Scaling Biomolecular Structure Prediction , author=. arXiv preprint arXiv:2512.24354 , year=

  25. [25]

    Science , volume=

    Accurate prediction of protein structures and interactions using a three-track neural network , author=. Science , volume=. 2021 , publisher=

  26. [26]

    Nature methods , volume=

    Accurate prediction of protein--nucleic acid complexes using RoseTTAFoldNA , author=. Nature methods , volume=. 2024 , publisher=

  27. [27]

    Advances in neural information processing systems , volume=

    Language models enable zero-shot prediction of the effects of mutations on protein function , author=. Advances in neural information processing systems , volume=

  28. [28]

    Science , volume=

    Evolutionary-scale prediction of atomic-level protein structure with a language model , author=. Science , volume=. 2023 , publisher=

  29. [29]

    Science , volume=

    Simulating 500 million years of evolution with a language model , author=. Science , volume=. 2025 , publisher=

  30. [30]

    Nature reviews molecular cell biology , volume=

    Advances in protein structure prediction and design , author=. Nature reviews molecular cell biology , volume=. 2019 , publisher=

  31. [31]

    Frontiers in Bioinformatics , volume=

    Protein structure prediction in the era of AI: Challenges and limitations when applying to in silico force spectroscopy , author=. Frontiers in Bioinformatics , volume=. 2022 , publisher=

  32. [32]

    Prediction of Protein Secondary Structure , pages=

    Beyond AlphaFold2: the impact of AI for the further improvement of protein structure prediction , author=. Prediction of Protein Secondary Structure , pages=. 2024 , publisher=

  33. [33]

    Frontiers in Pharmacology , volume=

    Protein structure prediction via deep learning: an in-depth review , author=. Frontiers in Pharmacology , volume=. 2025 , publisher=

  34. [34]

    Scientific reports , volume=

    Deep learning to predict protein backbone structure from high-resolution cryo-EM density maps , author=. Scientific reports , volume=. 2020 , publisher=

  35. [35]

    Nature methods , volume=

    CR-I-TASSER: assemble protein structures from cryo-EM density maps using deep convolutional neural networks , author=. Nature methods , volume=. 2022 , publisher=

  36. [36]

    Proceedings of the National Academy of Sciences , volume=

    DeepTracer for fast de novo cryo-EM protein structure modeling and special studies on CoV-related complexes , author=. Proceedings of the National Academy of Sciences , volume=. 2021 , publisher=

  37. [37]

    Nature Communications , volume=

    De novo atomic protein structure modeling for cryoEM density maps using 3D transformer and HMM , author=. Nature Communications , volume=. 2024 , publisher=

  38. [38]

    Nature Communications , volume=

    Model building of protein complexes from intermediate-resolution cryo-EM maps with deep learning-guided automatic assembly , author=. Nature Communications , volume=. 2022 , publisher=

  39. [39]

    Science , volume=

    Protein structure relationships revealed by mutational analysis , author=. Science , volume=. 1964 , publisher=

  40. [40]

    Protein Engineering, Design and Selection , volume=

    Coordinated amino acid changes in homologous protein families , author=. Protein Engineering, Design and Selection , volume=. 1988 , publisher=

  41. [41]

    Proteins: Structure, Function, and Bioinformatics , volume=

    Correlated mutations and residue contacts in proteins , author=. Proteins: Structure, Function, and Bioinformatics , volume=. 1994 , publisher=

  42. [42]

    Nature Biotechnology , volume=

    Single-sequence protein structure prediction using a language model and deep learning , author=. Nature Biotechnology , volume=. 2022 , publisher=

  43. [43]

    homology

    An introduction to sequence similarity (“homology”) searching , author=. Current protocols in bioinformatics , volume=. 2013 , publisher=

  44. [44]

    Proceedings of the National Academy of Sciences , volume=

    Unexpected features of the dark proteome , author=. Proceedings of the National Academy of Sciences , volume=. 2015 , publisher=

  45. [45]

    Nature methods , volume=

    Residue-wise local quality estimation for protein models from cryo-EM maps , author=. Nature methods , volume=. 2022 , publisher=

  46. [46]

    Biological crystallography , volume=

    MolProbity: all-atom structure validation for macromolecular crystallography , author=. Biological crystallography , volume=. 2010 , publisher=

  47. [47]

    Nature methods , volume=

    Protein secondary structure detection in intermediate-resolution cryo-EM maps using deep learning , author=. Nature methods , volume=. 2019 , publisher=

  48. [48]

    Nature Methods , volume=

    CryoREAD: de novo structure modeling for nucleic acids in cryo-EM maps using deep learning , author=. Nature Methods , volume=. 2023 , publisher=

  49. [49]

    Nature communications , volume=

    NuFold: end-to-end approach for RNA tertiary structure prediction with flexible nucleobase center representation , author=. Nature communications , volume=. 2025 , publisher=

  50. [50]

    Nature Methods , volume=

    DeepMainmast: integrated protocol of protein structure modeling for cryo-EM with deep learning and structure prediction , author=. Nature Methods , volume=. 2024 , publisher=

  51. [51]

    Microscopy and Microanalysis , volume=

    DMcloud: Macromolecular Structure Modeling with Local Structure Fitting for Medium to Low Resolution Cryo-EM Maps , author=. Microscopy and Microanalysis , volume=. 2025 , publisher=

  52. [52]

    Nature Methods , volume=

    DiffModeler: large macromolecular structure modeling for cryo-EM maps using a diffusion model , author=. Nature Methods , volume=. 2024 , publisher=

  53. [53]

    Nucleic acids research , volume=

    EMDataBank unified data resource for 3DEM , author=. Nucleic acids research , volume=. 2016 , publisher=

  54. [54]

    Nucleic acids research , volume=

    RCSB Protein Data Bank: powerful new tools for exploring 3D structures of biological macromolecules for basic and applied research and education in fundamental biology, biomedicine, biotechnology, bioengineering and energy sciences , author=. Nucleic acids research , volume=. 2021 , publisher=

  55. [55]

    Scientific Data , volume=

    Cryo2structdata: A large labeled cryo-em density map dataset for ai-based modeling of protein structures , author=. Scientific Data , volume=. 2024 , publisher=

  56. [56]

    Nature methods , volume=

    Quantifying the local resolution of cryo-EM density maps , author=. Nature methods , volume=. 2014 , publisher=

  57. [57]

    Nature Communications , volume=

    ARMH2 is a cytosolic component of CatSper crucial for sperm function , author=. Nature Communications , volume=. 2025 , publisher=

  58. [58]

    and Vriends, M

    Moran, E. and Vriends, M. and Calvelo, M. and Hansen, T. and Pickles, I. and Xin, X. and Biezeno, M. and Armstrong, Z. and Ferras, M. and Li, L. and Lilley, A. and Harvey, R. and Schr. Influenza Neuraminidase in complex with N-Acyl Oseltamivir inhibitor , year =

  59. [59]

    2025 , publisher =

    Zhai H, Deng J, Yu W , title =. 2025 , publisher =

  60. [60]

    and Lo, Wan-Yen and Dollar, Piotr and Girshick, Ross , title =

    Kirillov, Alexander and Mintun, Eric and Ravi, Nikhila and Mao, Hanzi and Rolland, Chloe and Gustafson, Laura and Xiao, Tete and Whitehead, Spencer and Berg, Alexander C. and Lo, Wan-Yen and Dollar, Piotr and Girshick, Ross , title =. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) , month =. 2023 , pages =

  61. [61]

    Grounding DINO: Marrying DINO with Grounded Pre-training for Open-Set Object Detection

    Liu, Shilong and Zeng, Zhaoyang and Ren, Tianhe and Li, Feng and Zhang, Hao and Yang, Jie and Jiang, Qing and Li, Chunyuan and Yang, Jianwei and Su, Hang and Zhu, Jun and Zhang, Lei. Grounding DINO: Marrying DINO with Grounded Pre-training for Open-Set Object Detection. Computer Vision -- ECCV 2024. 2025

  62. [62]

    Cell , volume=

    Cryo-EM reveals integrin-mediated TGF- activation without release from latent TGF- , author=. Cell , volume=. 2020 , publisher=

  63. [63]

    IUCrJ , volume=

    Continuous flexibility analysis of SARS-CoV-2 spike prefusion structures , author=. IUCrJ , volume=. 2020 , publisher=

  64. [64]

    Nature methods , volume=

    EMPIAR: a public archive for raw electron microscopy image data , author=. Nature methods , volume=. 2016 , publisher=