3D Equivariant Diffusion for Target-Aware Molecule Generation and Affinity Prediction
read the original abstract
Rich data and powerful machine learning models allow us to design drugs for a specific protein target \textit{in silico}. Recently, the inclusion of 3D structures during targeted drug design shows superior performance to other target-free models as the atomic interaction in the 3D space is explicitly modeled. However, current 3D target-aware models either rely on the voxelized atom densities or the autoregressive sampling process, which are not equivariant to rotation or easily violate geometric constraints resulting in unrealistic structures. In this work, we develop a 3D equivariant diffusion model to solve the above challenges. To achieve target-aware molecule design, our method learns a joint generative process of both continuous atom coordinates and categorical atom types with a SE(3)-equivariant network. Moreover, we show that our model can serve as an unsupervised feature extractor to estimate the binding affinity under proper parameterization, which provides an effective way for drug screening. To evaluate our model, we propose a comprehensive framework to evaluate the quality of sampled molecules from different dimensions. Empirical studies show our model could generate molecules with more realistic 3D structures and better affinities towards the protein targets, and improve binding affinity ranking and prediction without retraining.
This paper has not been read by Pith yet.
Forward citations
Cited by 11 Pith papers
-
DGLD: Domain-Gated Latent Diffusion for the Discovery of Novel Energetic Materials
DGLD applies domain-gated latent diffusion with label-quality gating and multi-task guidance to discover 12 novel energetic material leads validated by DFT, outperforming SMILES-LSTM, SELFIES-GA, and REINVENT baseline...
-
From Holo Pockets to Electron Density: GPT-style Drug Design with Density
EDMolGPT generates drug-like molecules from low-resolution electron density point clouds of holo binding pockets and shows effectiveness across 101 biological targets.
-
Multigrid Training for Molecular Generation using Graph Neural Networks
Multigrid training accelerates convergence and improves generalization for receptor-conditioned 3D ligand generation by transferring parameters from coarse to fine graph and voxel resolutions.
-
Demystifying Multimodal Biomolecular Co-design With Intrinsic Geodesic Coupling
GeoCoupling optimizes temporal couplings between modalities in biomolecular generative models and outperforms synchronous baselines on drug design and protein design tasks.
-
From Holo Pockets to Electron Density: GPT-style Drug Design with Density
EDMolGPT generates molecules from low-resolution electron density for de novo structure-based drug design, claiming better performance than pocket-based methods on 101 targets.
-
Toward Better Geometric Representations for Molecule Generative Models
LENSEs improves representation-conditioned molecule generation by jointly training a multi-level representation head, perceptual loss, and REPA alignment on pretrained encoders, yielding 97.28% validity and 98.51% sta...
-
Flow-Direct: Feedback-Efficient and Reusable Guidance for Flow Models via Non-Parametric Guidance Field
Flow-Direct constructs a reusable non-parametric guidance field from the log-density ratio of base and target distributions using all accumulated reward samples for feedback-efficient guidance in flow models.
-
Structure-guided molecular design with contrastive 3D protein-ligand learning
An SE(3)-equivariant transformer encodes 3D protein-ligand interactions via contrastive learning for zero-shot virtual screening, and these embeddings condition a multimodal chemical language model to autoregressively...
-
D-Flow: Multi-modality Flow Matching for D-peptide Design
D-Flow applies multi-modality flow matching and a mirror-image data augmentation to generate D-peptides with 10.2% higher sequence identity and 24.31% top affinity on the PepMerge benchmark.
-
Synergistic Benefits of Joint Molecule Generation and Property Prediction
Hyformer jointly models molecule generation and property prediction via alternating attention and joint pre-training, showing synergistic gains in conditional sampling, OOD prediction, and a drug design case for antim...
-
Fine-Tuning Diffusion Models for Molecular Generation via Reinforcement Learning and Fast Sampling
FTDiff applies GRPO-style RL fine-tuning and fast sampling to a time-free pretrained diffusion model to generate valid diverse high-quality molecules balancing multiple drug design objectives in SBDD.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.