pith. sign in

arxiv: 2303.03543 · v1 · pith:OKH725K2new · submitted 2023-03-06 · 🧬 q-bio.BM · cs.LG

3D Equivariant Diffusion for Target-Aware Molecule Generation and Affinity Prediction

classification 🧬 q-bio.BM cs.LG
keywords equivariantmodelaffinityatomdesignmodelsstructurestarget-aware
0
0 comments X
read the original abstract

Rich data and powerful machine learning models allow us to design drugs for a specific protein target \textit{in silico}. Recently, the inclusion of 3D structures during targeted drug design shows superior performance to other target-free models as the atomic interaction in the 3D space is explicitly modeled. However, current 3D target-aware models either rely on the voxelized atom densities or the autoregressive sampling process, which are not equivariant to rotation or easily violate geometric constraints resulting in unrealistic structures. In this work, we develop a 3D equivariant diffusion model to solve the above challenges. To achieve target-aware molecule design, our method learns a joint generative process of both continuous atom coordinates and categorical atom types with a SE(3)-equivariant network. Moreover, we show that our model can serve as an unsupervised feature extractor to estimate the binding affinity under proper parameterization, which provides an effective way for drug screening. To evaluate our model, we propose a comprehensive framework to evaluate the quality of sampled molecules from different dimensions. Empirical studies show our model could generate molecules with more realistic 3D structures and better affinities towards the protein targets, and improve binding affinity ranking and prediction without retraining.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 11 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. DGLD: Domain-Gated Latent Diffusion for the Discovery of Novel Energetic Materials

    physics.chem-ph 2026-05 unverdicted novelty 7.0

    DGLD applies domain-gated latent diffusion with label-quality gating and multi-task guidance to discover 12 novel energetic material leads validated by DFT, outperforming SMILES-LSTM, SELFIES-GA, and REINVENT baseline...

  2. From Holo Pockets to Electron Density: GPT-style Drug Design with Density

    cs.AI 2026-05 unverdicted novelty 7.0

    EDMolGPT generates drug-like molecules from low-resolution electron density point clouds of holo binding pockets and shows effectiveness across 101 biological targets.

  3. Multigrid Training for Molecular Generation using Graph Neural Networks

    cs.LG 2026-06 unverdicted novelty 6.0

    Multigrid training accelerates convergence and improves generalization for receptor-conditioned 3D ligand generation by transferring parameters from coarse to fine graph and voxel resolutions.

  4. Demystifying Multimodal Biomolecular Co-design With Intrinsic Geodesic Coupling

    q-bio.BM 2026-06 unverdicted novelty 6.0

    GeoCoupling optimizes temporal couplings between modalities in biomolecular generative models and outperforms synchronous baselines on drug design and protein design tasks.

  5. From Holo Pockets to Electron Density: GPT-style Drug Design with Density

    cs.AI 2026-05 unverdicted novelty 6.0

    EDMolGPT generates molecules from low-resolution electron density for de novo structure-based drug design, claiming better performance than pocket-based methods on 101 targets.

  6. Toward Better Geometric Representations for Molecule Generative Models

    cs.LG 2026-05 unverdicted novelty 6.0

    LENSEs improves representation-conditioned molecule generation by jointly training a multi-level representation head, perceptual loss, and REPA alignment on pretrained encoders, yielding 97.28% validity and 98.51% sta...

  7. Flow-Direct: Feedback-Efficient and Reusable Guidance for Flow Models via Non-Parametric Guidance Field

    cs.LG 2026-05 unverdicted novelty 6.0

    Flow-Direct constructs a reusable non-parametric guidance field from the log-density ratio of base and target distributions using all accumulated reward samples for feedback-efficient guidance in flow models.

  8. Structure-guided molecular design with contrastive 3D protein-ligand learning

    cs.LG 2026-04 unverdicted novelty 6.0

    An SE(3)-equivariant transformer encodes 3D protein-ligand interactions via contrastive learning for zero-shot virtual screening, and these embeddings condition a multimodal chemical language model to autoregressively...

  9. D-Flow: Multi-modality Flow Matching for D-peptide Design

    cs.CE 2024-11 unverdicted novelty 6.0

    D-Flow applies multi-modality flow matching and a mirror-image data augmentation to generate D-peptides with 10.2% higher sequence identity and 24.31% top affinity on the PepMerge benchmark.

  10. Synergistic Benefits of Joint Molecule Generation and Property Prediction

    cs.LG 2025-04 unverdicted novelty 5.0

    Hyformer jointly models molecule generation and property prediction via alternating attention and joint pre-training, showing synergistic gains in conditional sampling, OOD prediction, and a drug design case for antim...

  11. Fine-Tuning Diffusion Models for Molecular Generation via Reinforcement Learning and Fast Sampling

    cs.LG 2026-05 unverdicted novelty 4.0

    FTDiff applies GRPO-style RL fine-tuning and fast sampling to a time-free pretrained diffusion model to generate valid diverse high-quality molecules balancing multiple drug design objectives in SBDD.