SGRPO expands the utility-diversity Pareto frontier in biomolecular design by using supergroup sampling and leave-one-out diversity rewards combined with utility signals.
Drugex v3: scaffold-constrained drug design with graph transformer-based reinforcement learning.Journal of Cheminformatics, 15(1):24
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CE 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Pushing Biomolecular Utility-Diversity Frontiers with Supergroup Relative Policy Optimization
SGRPO expands the utility-diversity Pareto frontier in biomolecular design by using supergroup sampling and leave-one-out diversity rewards combined with utility signals.