pith. machine review for the scientific record. sign in

arxiv: 2410.19471 · v1 · submitted 2024-10-25 · 💻 cs.LG · cs.AI

Recognition: unknown

Improving Inverse Folding for Peptide Design with Diversity-regularized Direct Preference Optimization

Authors on Pith no claims yet
classification 💻 cs.LG cs.AI
keywords modelssequencesdiversityimprovingproteinmpnnreferencedesigndirect
0
0 comments X
read the original abstract

Inverse folding models play an important role in structure-based design by predicting amino acid sequences that fold into desired reference structures. Models like ProteinMPNN, a message-passing encoder-decoder model, are trained to reliably produce new sequences from a reference structure. However, when applied to peptides, these models are prone to generating repetitive sequences that do not fold into the reference structure. To address this, we fine-tune ProteinMPNN to produce diverse and structurally consistent peptide sequences via Direct Preference Optimization (DPO). We derive two enhancements to DPO: online diversity regularization and domain-specific priors. Additionally, we develop a new understanding on improving diversity in decoder models. When conditioned on OpenFold generated structures, our fine-tuned models achieve state-of-the-art structural similarity scores, improving base ProteinMPNN by at least 8%. Compared to standard DPO, our regularized method achieves up to 20% higher sequence diversity with no loss in structural similarity score.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Pushing Biomolecular Utility-Diversity Frontiers with Supergroup Relative Policy Optimization

    cs.CE 2026-05 unverdicted novelty 6.0

    SGRPO expands the utility-diversity Pareto frontier in biomolecular design by using supergroup sampling and leave-one-out diversity rewards combined with utility signals.