pith. machine review for the scientific record. sign in

arxiv: 1905.13372 · v1 · submitted 2019-05-31 · 💻 cs.LG · cs.AI· q-bio.MN· q-bio.QM· stat.ML

Recognition: unknown

MolecularRNN: Generating realistic molecular graphs with optimized properties

Authors on Pith no claims yet
classification 💻 cs.LG cs.AIq-bio.MNq-bio.QMstat.ML
keywords modelmolecularmoleculesgraphsmolecularrnnproblempropertiesrealistic
0
0 comments X
read the original abstract

Designing new molecules with a set of predefined properties is a core problem in modern drug discovery and development. There is a growing need for de-novo design methods that would address this problem. We present MolecularRNN, the graph recurrent generative model for molecular structures. Our model generates diverse realistic molecular graphs after likelihood pretraining on a big database of molecules. We perform an analysis of our pretrained models on large-scale generated datasets of 1 million samples. Further, the model is tuned with policy gradient algorithm, provided a critic that estimates the reward for the property of interest. We show a significant distribution shift to the desired range for lipophilicity, drug-likeness, and melting point outperforming state-of-the-art works. With the use of rejection sampling based on valency constraints, our model yields 100% validity. Moreover, we show that invalid molecules provide a rich signal to the model through the use of structure penalty in our reinforcement learning pipeline.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. How Creative Are Large Language Models in Generating Molecules?

    cs.CL 2026-04 unverdicted novelty 7.0

    Large language models exhibit distinct creative patterns in molecule generation, including higher constraint satisfaction when more constraints are added, and this is the first work to reframe molecule generation abil...

  2. Equivariant Efficient Joint Discrete and Continuous MeanFlow for Molecular Graph Generation

    cs.LG 2026-04 unverdicted novelty 6.0

    EQUIMF is a unified equivariant framework that jointly generates discrete topologies and continuous geometries in molecular graphs via synchronized MeanFlow dynamics for efficient few-step sampling.

  3. Fine-Grained Graph Generation through Latent Mixture Scheduling

    cs.AI 2026-05 unverdicted novelty 4.0

    A novel CVAE with mixture scheduling achieves fine-grained structural control in graph generation, showing high quality and controllability on five datasets.