Scalable Peptide Design via Memory-Efficient Equivariant Transformer

Jianzhu Ma; Rui Jiao; Xiangzhe Kong; Yang Liu; Yijia Zhang; Yinjun Jia; Ziyi Yang

arxiv: 2606.25006 · v2 · pith:3HD52MDEnew · submitted 2026-06-23 · 💻 cs.LG

Scalable Peptide Design via Memory-Efficient Equivariant Transformer

Rui Jiao , Xiangzhe Kong , Yinjun Jia , Yijia Zhang , Ziyi Yang , Yang Liu , Jianzhu Ma This is my paper

Pith reviewed 2026-06-26 05:30 UTC · model grok-4.3

classification 💻 cs.LG

keywords peptide designequivariant transformermemory efficient attentionlatent diffusionE(3) equivariancefull-atom generationVAEbinding affinity

0 comments

The pith

MEET backbone achieves linear memory scaling for full-atom peptide generation while preserving E(3) equivariance.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents MEET as an equivariant transformer backbone that processes atomistic peptide structures with memory linear in atom count rather than quadratic. It integrates this backbone into a VAE plus latent diffusion pipeline to co-design peptide sequences and structures under geometric constraints. A sympathetic reader would care because existing methods hit memory walls on large or full-atom systems, restricting the size of datasets and models that can be used for target-specific design. The work demonstrates that targeted reformulations of vector initialization, distance handling, and bond injection allow the model to scale while keeping the necessary symmetries and constraints intact.

Core claim

MEET maintains coupled invariant scalar and equivariant vector streams and reformulates geometric attention via global coordinate aggregation for vectors, augmented query-key dot products for distances, and sparse bond adaptation; when placed inside VAE and latent diffusion stages it yields linear memory scaling with atom count and higher-quality full-atom peptide samples on large AFDB-derived sets.

What carries the argument

MEET (Memory Efficient Equivariant Transformer), an E(3)-equivariant attention layer that couples scalar and vector features and replaces standard geometric computations with memory-efficient alternatives while preserving equivariance.

If this is right

Linear memory scaling permits systematic increases in model size and training data volume on the same hardware.
The resulting models produce peptides with measurably higher binding affinity, physical validity, and sample diversity than prior peptide design methods.
Full-atom geometric constraints remain satisfied during sequence-structure co-design.
The same backbone can be used for both encoding and denoising stages without breaking the latent generative framework.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The linear scaling could make routine design of peptides with hundreds of atoms practical on modest GPUs.
The same memory-efficient attention pattern might transfer to other atomistic generative tasks such as small-molecule or protein design.
If the equivariance holds at scale, downstream physics-based filtering steps could be reduced without loss of accuracy.

Load-bearing premise

The three specific attention reformulations preserve E(3) equivariance and full-atom geometric constraints through the entire VAE and diffusion pipeline.

What would settle it

A direct measurement showing that memory usage grows quadratically with atom count or that generated structures violate rotational or translational equivariance under the MEET pipeline.

read the original abstract

Target-specific peptide design requires sequence and structure co-design under full atom geometric constraints. Latent generative frameworks offer an effective route for this problem by compressing fine grained atomic structures into block level latent representations and performing conditional generation in a compact latent space. However, the scalability of such systems depends heavily on the geometric backbone used throughout their encoding, decoding, and denoising components. We introduce MEET (Memory Efficient Equivariant Transformer), an E(3) equivariant backbone for scalable atomistic peptide modeling. MEET maintains coupled invariant scalar and equivariant vector feature streams, while reformulating geometric computation around memory efficient attention. It initializes vector features through global coordinate aggregation, incorporates pairwise distances through augmented query and key dot products, and injects covalent bond information through sparse bond adaptation. Integrated into a VAE and latent diffusion pipeline for full atom peptide generation, MEET achieves linear memory scaling with atom count and improves generation quality over existing peptide design methods. Experiments on large scale AFDB derived datasets further show that the proposed backbone supports systematic model and data scaling, leading to better binding affinity, physical validity, and sample diversity.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

MEET gives a concrete memory-efficient equivariant backbone for peptide co-design that scales linearly, but the equivariance of its three geometric reformulations is the part that still needs direct checking.

read the letter

MEET is an E(3)-equivariant transformer that keeps separate scalar and vector streams but rewrites the geometric parts of attention to cut memory use. It initializes vectors via global coordinate aggregation, folds distances into augmented query-key dots, and adds bonds through sparse adaptation. These pieces sit inside a VAE plus latent diffusion pipeline for full-atom peptide generation.

The practical payoff is linear memory scaling with atom count plus reported gains in binding affinity, physical validity, and diversity on AFDB-scale data. The scaling experiments also show that larger models and more data continue to help, which is the kind of result that matters for real peptide work.

The soft spot is the equivariance claim. The abstract states that the three changes preserve the properties needed for geometric constraints, yet it supplies no explicit transformation check or empirical test. If any of them drifts under rotations or translations, the downstream validity numbers would be harder to trust. The quality improvements also need tighter baseline comparisons with matched resources.

People building geometric models for biomolecules will want the architecture details. The paper is coherent on its own terms and addresses a real bottleneck, so it deserves a serious referee even if the equivariance step requires more evidence in revision.

Referee Report

2 major / 2 minor

Summary. The paper introduces MEET, an E(3)-equivariant transformer backbone designed for memory-efficient full-atom peptide modeling. It integrates this backbone into a VAE-latent diffusion pipeline for target-specific peptide sequence and structure co-design, claiming linear memory scaling with atom count via three reformulations (global coordinate aggregation for vector initialization, augmented query-key dot products for distances, and sparse bond adaptation), while improving generation quality, binding affinity, physical validity, and sample diversity over prior methods on large AFDB-derived datasets.

Significance. If the equivariance preservation and linear scaling hold, the work would enable systematic scaling of geometric generative models for peptides, addressing a key bottleneck in atomistic design and potentially improving therapeutic applications through better physical fidelity and diversity.

major comments (2)

[§3 / abstract] The central claim of preserved E(3) equivariance (and thus geometric fidelity for the reported gains in binding affinity and validity) rests on the three reformulations described in the abstract and §3. No explicit transformation proof under rotations/translations is supplied for global coordinate aggregation, augmented QK dot products, or sparse bond adaptation, nor is an empirical equivariance test (e.g., invariance of scalar outputs or equivariance of vector outputs under random SO(3) actions) reported in the methods or experiments sections. This is load-bearing for the headline scaling and quality claims.
[§4.2 / Table 2] Table 2 and Figure 4 report quality improvements and linear memory scaling, but the experimental protocol does not specify how atom counts were varied while holding other factors fixed, nor whether the VAE encoder/decoder and latent diffusion denoiser were each verified separately for equivariance after the reformulations. Without these controls, the attribution of gains to the backbone versus other pipeline choices remains unclear.

minor comments (2)

[§3.1] Notation for the coupled scalar and vector streams is introduced without a compact table of symbols; a small notation table would improve readability when comparing to prior equivariant transformers.
[§4.1] The AFDB dataset construction and filtering criteria (e.g., resolution, sequence identity) are described only at high level; explicit counts and exclusion rules should be added to §4.1 for reproducibility.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive comments. We address each major comment below and will revise the manuscript to strengthen the presentation of equivariance and experimental controls.

read point-by-point responses

Referee: [§3 / abstract] The central claim of preserved E(3) equivariance (and thus geometric fidelity for the reported gains in binding affinity and validity) rests on the three reformulations described in the abstract and §3. No explicit transformation proof under rotations/translations is supplied for global coordinate aggregation, augmented QK dot products, or sparse bond adaptation, nor is an empirical equivariance test (e.g., invariance of scalar outputs or equivariance of vector outputs under random SO(3) actions) reported in the methods or experiments sections. This is load-bearing for the headline scaling and quality claims.

Authors: We acknowledge that the original manuscript does not contain an explicit mathematical proof of E(3) equivariance preservation for the three reformulations nor an empirical test. The MEET design maintains separate invariant scalar and equivariant vector streams, with global coordinate aggregation operating on relative positions (translation-invariant), augmented QK dot products using squared distances (rotation- and translation-invariant), and sparse bond adaptation respecting the fixed molecular graph (equivariant under E(3) actions on coordinates). To address the concern directly, we will add a formal proof in the supplementary material showing each operation is E(3)-equivariant and include an empirical verification experiment demonstrating correct transformation of vector features under random SO(3) rotations. These additions will be included in the revised version. revision: yes
Referee: [§4.2 / Table 2] Table 2 and Figure 4 report quality improvements and linear memory scaling, but the experimental protocol does not specify how atom counts were varied while holding other factors fixed, nor whether the VAE encoder/decoder and latent diffusion denoiser were each verified separately for equivariance after the reformulations. Without these controls, the attribution of gains to the backbone versus other pipeline choices remains unclear.

Authors: We agree that the experimental protocol description in §4.2 is insufficiently detailed. In the revised manuscript we will expand the section to explicitly describe the scaling protocol (models trained on peptide subsets with increasing atom counts while holding batch size, learning rate, and other hyperparameters fixed) and report separate equivariance verification results for the VAE encoder, VAE decoder, and latent diffusion denoiser. This will strengthen attribution of the observed linear scaling and quality gains to the MEET backbone. revision: yes

Circularity Check

0 steps flagged

No circularity: claims rest on empirical scaling experiments, not self-referential definitions or fitted predictions.

full rationale

The paper presents MEET as a new E(3)-equivariant transformer backbone whose memory-efficient reformulations (global coordinate aggregation, augmented QK dots, sparse bond adaptation) are architectural choices whose equivariance is asserted but not derived from prior results by the same authors. All headline performance claims (linear memory scaling, improved binding affinity, physical validity, diversity) are framed as outcomes of experiments on AFDB-derived datasets inside a VAE+latent-diffusion pipeline. No equations, parameters, or uniqueness theorems are shown reducing predictions to inputs by construction, and no self-citation chain is invoked to justify the central geometric properties. The derivation chain is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract provides no explicit free parameters, axioms, or invented entities.

pith-pipeline@v0.9.1-grok · 5741 in / 1040 out tokens · 31641 ms · 2026-06-26T05:30:44.437139+00:00 · methodology

Review history (2 revisions) →

discussion (0)

Reference graph

Works this paper leans on

26 extracted references · 2 canonical work pages · 2 internal anchors

[1]

Posebusters: Ai-based docking methods fail to generate physically valid poses or generalise to novel sequences.Chemical Science, 15(9):3130–3139, 2024

Martin Buttenschoen, Garrett M Morris, and Charlotte M Deane. Posebusters: Ai-based docking methods fail to generate physically valid poses or generalise to novel sequences.Chemical Science, 15(9):3130–3139, 2024

2024
[2]

Pyrosetta: a script-based interface for implementing molecular modeling algorithms using rosetta.Bioinformatics, 26(5):689–691, 2010

Sidhartha Chaudhury, Sergey Lyskov, and Jeffrey J Gray. Pyrosetta: a script-based interface for implementing molecular modeling algorithms using rosetta.Bioinformatics, 26(5):689–691, 2010

2010
[3]

Flashattention: Fast and memory-efficient exact attention with io-awareness.Advances in neural information processing systems, 35:16344–16359, 2022

Tri Dao, Dan Fu, Stefano Ermon, Atri Rudra, and Christopher Ré. Flashattention: Fast and memory-efficient exact attention with io-awareness.Advances in neural information processing systems, 35:16344–16359, 2022

2022
[4]

Vector neurons: A general framework for so (3)-equivariant networks

Congyue Deng, Or Litany, Yueqi Duan, Adrien Poulenard, Andrea Tagliasacchi, and Leonidas J Guibas. Vector neurons: A general framework for so (3)-equivariant networks. InProceedings of the IEEE/CVF international conference on computer vision, pages 12200–12209, 2021

2021
[5]

Denoising diffusion probabilistic models.Advances in neural information processing systems, 33:6840–6851, 2020

Jonathan Ho, Ajay Jain, and Pieter Abbeel. Denoising diffusion probabilistic models.Advances in neural information processing systems, 33:6840–6851, 2020

2020
[6]

An equivariant pretrained transformer for unified 3d molecular representation learning.Nature Communications, 2026

Rui Jiao, Xiangzhe Kong, Li Zhang, Ziyang Yu, Fangyuan Ren, Wenjuan Tan, Wenbing Huang, and Yang Liu. An equivariant pretrained transformer for unified 3d molecular representation learning.Nature Communications, 2026

2026
[7]

Full-atom peptide design with geometric latent diffusion.Advances in Neural Information Processing Systems, 37:74808–74839, 2025

Xiangzhe Kong, Yinjun Jia, Wenbing Huang, and Yang Liu. Full-atom peptide design with geometric latent diffusion.Advances in Neural Information Processing Systems, 37:74808–74839, 2025

2025
[8]

Unimomo: Unified generative modeling of 3d molecules for de novo binder design

Xiangzhe Kong, Zishen Zhang, Ziting Zhang, Rui Jiao, Jianzhu Ma, Kai Liu, Wenbing Huang, and Yang Liu. Unimomo: Unified generative modeling of 3d molecules for de novo binder design. InForty-second International Conference on Machine Learning, 2025

2025
[9]

Full-atom peptide design based on multi-modal flow matching

Jiahan Li, Chaoran Cheng, Zuofan Wu, Ruihan Guo, Shitong Luo, Zhizhou Ren, Jian Peng, and Jianzhu Ma. Full-atom peptide design based on multi-modal flow matching. InInternational Conference on Machine Learning, pages 27615–27640. PMLR, 2024. 11

2024
[10]

Equiformer: Equivariant graph attention transformer for 3d atomistic graphs

Yi-Lun Liao and Tess Smidt. Equiformer: Equivariant graph attention transformer for 3d atomistic graphs. InInternational Conference on Learning Representations, 2023. URLhttps://openreview.net/forum?id= KwmPfARgOTD

2023
[11]

Yaron Lipman, Ricky T. Q. Chen, Heli Ben-Hamu, Maximilian Nickel, and Matthew Le. Flow matching for generative modeling. InThe Eleventh International Conference on Learning Representations, 2023. URLhttps: //openreview.net/forum?id=PqvMRDCJT9t

2023
[12]

Improved denoising diffusion probabilistic models

Alexander Quinn Nichol and Prafulla Dhariwal. Improved denoising diffusion probabilistic models. InInterna- tional conference on machine learning, pages 8162–8171. PMLR, 2021

2021
[13]

Scalable diffusion models with transformers

William Peebles and Saining Xie. Scalable diffusion models with transformers. InProceedings of the IEEE/CVF international conference on computer vision, pages 4195–4205, 2023

2023
[14]

High-resolution image synthesis with latent diffusion models

Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, and Björn Ommer. High-resolution image synthesis with latent diffusion models. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 10684–10695, 2022

2022
[15]

E (n) equivariant graph neural networks

Vıctor Garcia Satorras, Emiel Hoogeboom, and Max Welling. E (n) equivariant graph neural networks. In International conference on machine learning, pages 9323–9332. PMLR, 2021

2021
[16]

Equivariant message passing for the prediction of tensorial properties and molecular spectra

Kristof Schütt, Oliver Unke, and Michael Gastegger. Equivariant message passing for the prediction of tensorial properties and molecular spectra. InInternational conference on machine learning, pages 9377–9388. PMLR, 2021

2021
[17]

Schnet–a deep learning architecture for molecules and materials.The Journal of chemical physics, 148(24), 2018

Kristof T Schütt, Huziel E Sauceda, P-J Kindermans, Alexandre Tkatchenko, and K-R Müller. Schnet–a deep learning architecture for molecules and materials.The Journal of chemical physics, 148(24), 2018

2018
[18]

GLU Variants Improve Transformer

Noam Shazeer. Glu variants improve transformer.arXiv preprint arXiv:2002.05202, 2020

work page internal anchor Pith review Pith/arXiv arXiv 2002
[19]

Score- based generative modeling through stochastic differential equations

Yang Song, Jascha Sohl-Dickstein, Diederik P Kingma, Abhishek Kumar, Stefano Ermon, and Ben Poole. Score- based generative modeling through stochastic differential equations. InInternational Conference on Learning Representations, 2021. URLhttps://openreview.net/forum?id=PxTIG12RRHS

2021
[20]

Equivariant transformers for neural network based molecular poten- tials

Philipp Thölke and Gianni De Fabritiis. Equivariant transformers for neural network based molecular poten- tials. InInternational Conference on Learning Representations, 2022. URLhttps://openreview.net/forum? id=zNHzqZ9wrRB

2022
[21]

Tensor field networks: Rotation- and translation-equivariant neural networks for 3D point clouds

Nathaniel Thomas, Tess Smidt, Steven Kearnes, Lusann Yang, Li Li, Kai Kohlhoff, and Patrick Riley. Ten- sor field networks: Rotation-and translation-equivariant neural networks for 3d point clouds.arXiv preprint arXiv:1802.08219, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018
[22]

Alphafold protein structure database: massively expanding the structural coverage of protein-sequence space with high-accuracy models.Nucleic acids research, 50(D1):D439–D444, 2022

Mihaly Varadi, Stephen Anyango, Mandar Deshpande, Sreenath Nair, Cindy Natassia, Galabina Yordanova, David Yuan, Oana Stroe, Gemma Wood, Agata Laydon, et al. Alphafold protein structure database: massively expanding the structural coverage of protein-sequence space with high-accuracy models.Nucleic acids research, 50(D1):D439–D444, 2022

2022
[23]

Target-specific de novo peptide binder design with diffpepbuilder.Journal of Chemical Information and Modeling, 64(24):9135–9149, 2024

Fanhao Wang, Yuzhe Wang, Laiyi Feng, Changsheng Zhang, and Luhua Lai. Target-specific de novo peptide binder design with diffpepbuilder.Journal of Chemical Information and Modeling, 64(24):9135–9149, 2024

2024
[24]

Therapeutic peptides: current applications and future directions.Signal transduction and targeted therapy, 7(1):48, 2022

Lei Wang, Nanxi Wang, Wenping Zhang, Xurui Cheng, Zhibin Yan, Gang Shao, Xi Wang, Rui Wang, and Caiyun Fu. Therapeutic peptides: current applications and future directions.Signal transduction and targeted therapy, 7(1):48, 2022

2022
[25]

Flashbias: Fast computation of attention with bias

Haixu Wu, Minghao Guo, Yuezhou Ma, Yuanxu Sun, Jianmin Wang, Wojciech Matusik, and Mingsheng Long. Flashbias: Fast computation of attention with bias. InThe Thirty-ninth Annual Conference on Neural Informa- tion Processing Systems, 2026. URLhttps://openreview.net/forum?id=7L4NvUtZY3

2026
[26]

Root mean square layer normalization.Advances in neural information processing systems, 32, 2019

Biao Zhang and Rico Sennrich. Root mean square layer normalization.Advances in neural information processing systems, 32, 2019. 12 Appendix A Introduction of Latent Generative Framework We instantiateMeetin a two-stage latent generative framework for target-specific full-atom peptide design. The overall design follows the motivation of latent generative m...

2019

[1] [1]

Posebusters: Ai-based docking methods fail to generate physically valid poses or generalise to novel sequences.Chemical Science, 15(9):3130–3139, 2024

Martin Buttenschoen, Garrett M Morris, and Charlotte M Deane. Posebusters: Ai-based docking methods fail to generate physically valid poses or generalise to novel sequences.Chemical Science, 15(9):3130–3139, 2024

2024

[2] [2]

Pyrosetta: a script-based interface for implementing molecular modeling algorithms using rosetta.Bioinformatics, 26(5):689–691, 2010

Sidhartha Chaudhury, Sergey Lyskov, and Jeffrey J Gray. Pyrosetta: a script-based interface for implementing molecular modeling algorithms using rosetta.Bioinformatics, 26(5):689–691, 2010

2010

[3] [3]

Flashattention: Fast and memory-efficient exact attention with io-awareness.Advances in neural information processing systems, 35:16344–16359, 2022

Tri Dao, Dan Fu, Stefano Ermon, Atri Rudra, and Christopher Ré. Flashattention: Fast and memory-efficient exact attention with io-awareness.Advances in neural information processing systems, 35:16344–16359, 2022

2022

[4] [4]

Vector neurons: A general framework for so (3)-equivariant networks

Congyue Deng, Or Litany, Yueqi Duan, Adrien Poulenard, Andrea Tagliasacchi, and Leonidas J Guibas. Vector neurons: A general framework for so (3)-equivariant networks. InProceedings of the IEEE/CVF international conference on computer vision, pages 12200–12209, 2021

2021

[5] [5]

Denoising diffusion probabilistic models.Advances in neural information processing systems, 33:6840–6851, 2020

Jonathan Ho, Ajay Jain, and Pieter Abbeel. Denoising diffusion probabilistic models.Advances in neural information processing systems, 33:6840–6851, 2020

2020

[6] [6]

An equivariant pretrained transformer for unified 3d molecular representation learning.Nature Communications, 2026

Rui Jiao, Xiangzhe Kong, Li Zhang, Ziyang Yu, Fangyuan Ren, Wenjuan Tan, Wenbing Huang, and Yang Liu. An equivariant pretrained transformer for unified 3d molecular representation learning.Nature Communications, 2026

2026

[7] [7]

Full-atom peptide design with geometric latent diffusion.Advances in Neural Information Processing Systems, 37:74808–74839, 2025

Xiangzhe Kong, Yinjun Jia, Wenbing Huang, and Yang Liu. Full-atom peptide design with geometric latent diffusion.Advances in Neural Information Processing Systems, 37:74808–74839, 2025

2025

[8] [8]

Unimomo: Unified generative modeling of 3d molecules for de novo binder design

Xiangzhe Kong, Zishen Zhang, Ziting Zhang, Rui Jiao, Jianzhu Ma, Kai Liu, Wenbing Huang, and Yang Liu. Unimomo: Unified generative modeling of 3d molecules for de novo binder design. InForty-second International Conference on Machine Learning, 2025

2025

[9] [9]

Full-atom peptide design based on multi-modal flow matching

Jiahan Li, Chaoran Cheng, Zuofan Wu, Ruihan Guo, Shitong Luo, Zhizhou Ren, Jian Peng, and Jianzhu Ma. Full-atom peptide design based on multi-modal flow matching. InInternational Conference on Machine Learning, pages 27615–27640. PMLR, 2024. 11

2024

[10] [10]

Equiformer: Equivariant graph attention transformer for 3d atomistic graphs

Yi-Lun Liao and Tess Smidt. Equiformer: Equivariant graph attention transformer for 3d atomistic graphs. InInternational Conference on Learning Representations, 2023. URLhttps://openreview.net/forum?id= KwmPfARgOTD

2023

[11] [11]

Yaron Lipman, Ricky T. Q. Chen, Heli Ben-Hamu, Maximilian Nickel, and Matthew Le. Flow matching for generative modeling. InThe Eleventh International Conference on Learning Representations, 2023. URLhttps: //openreview.net/forum?id=PqvMRDCJT9t

2023

[12] [12]

Improved denoising diffusion probabilistic models

Alexander Quinn Nichol and Prafulla Dhariwal. Improved denoising diffusion probabilistic models. InInterna- tional conference on machine learning, pages 8162–8171. PMLR, 2021

2021

[13] [13]

Scalable diffusion models with transformers

William Peebles and Saining Xie. Scalable diffusion models with transformers. InProceedings of the IEEE/CVF international conference on computer vision, pages 4195–4205, 2023

2023

[14] [14]

High-resolution image synthesis with latent diffusion models

Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, and Björn Ommer. High-resolution image synthesis with latent diffusion models. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 10684–10695, 2022

2022

[15] [15]

E (n) equivariant graph neural networks

Vıctor Garcia Satorras, Emiel Hoogeboom, and Max Welling. E (n) equivariant graph neural networks. In International conference on machine learning, pages 9323–9332. PMLR, 2021

2021

[16] [16]

Equivariant message passing for the prediction of tensorial properties and molecular spectra

Kristof Schütt, Oliver Unke, and Michael Gastegger. Equivariant message passing for the prediction of tensorial properties and molecular spectra. InInternational conference on machine learning, pages 9377–9388. PMLR, 2021

2021

[17] [17]

Schnet–a deep learning architecture for molecules and materials.The Journal of chemical physics, 148(24), 2018

Kristof T Schütt, Huziel E Sauceda, P-J Kindermans, Alexandre Tkatchenko, and K-R Müller. Schnet–a deep learning architecture for molecules and materials.The Journal of chemical physics, 148(24), 2018

2018

[18] [18]

GLU Variants Improve Transformer

Noam Shazeer. Glu variants improve transformer.arXiv preprint arXiv:2002.05202, 2020

work page internal anchor Pith review Pith/arXiv arXiv 2002

[19] [19]

Score- based generative modeling through stochastic differential equations

Yang Song, Jascha Sohl-Dickstein, Diederik P Kingma, Abhishek Kumar, Stefano Ermon, and Ben Poole. Score- based generative modeling through stochastic differential equations. InInternational Conference on Learning Representations, 2021. URLhttps://openreview.net/forum?id=PxTIG12RRHS

2021

[20] [20]

Equivariant transformers for neural network based molecular poten- tials

Philipp Thölke and Gianni De Fabritiis. Equivariant transformers for neural network based molecular poten- tials. InInternational Conference on Learning Representations, 2022. URLhttps://openreview.net/forum? id=zNHzqZ9wrRB

2022

[21] [21]

Tensor field networks: Rotation- and translation-equivariant neural networks for 3D point clouds

Nathaniel Thomas, Tess Smidt, Steven Kearnes, Lusann Yang, Li Li, Kai Kohlhoff, and Patrick Riley. Ten- sor field networks: Rotation-and translation-equivariant neural networks for 3d point clouds.arXiv preprint arXiv:1802.08219, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018

[22] [22]

Alphafold protein structure database: massively expanding the structural coverage of protein-sequence space with high-accuracy models.Nucleic acids research, 50(D1):D439–D444, 2022

Mihaly Varadi, Stephen Anyango, Mandar Deshpande, Sreenath Nair, Cindy Natassia, Galabina Yordanova, David Yuan, Oana Stroe, Gemma Wood, Agata Laydon, et al. Alphafold protein structure database: massively expanding the structural coverage of protein-sequence space with high-accuracy models.Nucleic acids research, 50(D1):D439–D444, 2022

2022

[23] [23]

Target-specific de novo peptide binder design with diffpepbuilder.Journal of Chemical Information and Modeling, 64(24):9135–9149, 2024

Fanhao Wang, Yuzhe Wang, Laiyi Feng, Changsheng Zhang, and Luhua Lai. Target-specific de novo peptide binder design with diffpepbuilder.Journal of Chemical Information and Modeling, 64(24):9135–9149, 2024

2024

[24] [24]

Therapeutic peptides: current applications and future directions.Signal transduction and targeted therapy, 7(1):48, 2022

Lei Wang, Nanxi Wang, Wenping Zhang, Xurui Cheng, Zhibin Yan, Gang Shao, Xi Wang, Rui Wang, and Caiyun Fu. Therapeutic peptides: current applications and future directions.Signal transduction and targeted therapy, 7(1):48, 2022

2022

[25] [25]

Flashbias: Fast computation of attention with bias

Haixu Wu, Minghao Guo, Yuezhou Ma, Yuanxu Sun, Jianmin Wang, Wojciech Matusik, and Mingsheng Long. Flashbias: Fast computation of attention with bias. InThe Thirty-ninth Annual Conference on Neural Informa- tion Processing Systems, 2026. URLhttps://openreview.net/forum?id=7L4NvUtZY3

2026

[26] [26]

Root mean square layer normalization.Advances in neural information processing systems, 32, 2019

Biao Zhang and Rico Sennrich. Root mean square layer normalization.Advances in neural information processing systems, 32, 2019. 12 Appendix A Introduction of Latent Generative Framework We instantiateMeetin a two-stage latent generative framework for target-specific full-atom peptide design. The overall design follows the motivation of latent generative m...

2019