SILO outperforms five baselines on eight protein fitness landscapes by using trajectory-level imitation on trajectories selected via hierarchical beam search and biological proxy guidance under limited oracle budgets.
Design by adaptive sampling
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
citation-role summary
background 1
citation-polarity summary
years
2026 2roles
background 1polarities
background 1representative citing papers
SGRPO is a GRPO-style framework that constructs set-level diversity rewards via supergroup sampling and leave-one-out redistribution to expand the utility-diversity Pareto frontier in biomolecular design tasks.
citing papers explorer
-
Self-Improvement Imitation with Biologically Guided Search for Protein Design Under Oracle Budgets
SILO outperforms five baselines on eight protein fitness landscapes by using trajectory-level imitation on trajectories selected via hierarchical beam search and biological proxy guidance under limited oracle budgets.
-
Pushing Biomolecular Utility-Diversity Frontiers with Supergroup Relative Policy Optimization
SGRPO is a GRPO-style framework that constructs set-level diversity rewards via supergroup sampling and leave-one-out redistribution to expand the utility-diversity Pareto frontier in biomolecular design tasks.