Pepti-drift: Toxicity-Repulsive Drifting for Antigen-Conditioned Discrete Peptide Generation

Hikaru Shindo; Jun Jin Choong; Kaushalya Madhawa; Keisuke Ozawa; Takashi Fujiwara

arxiv: 2606.27824 · v2 · pith:QROS64YGnew · submitted 2026-06-26 · 💻 cs.LG · cs.AI

Pepti-drift: Toxicity-Repulsive Drifting for Antigen-Conditioned Discrete Peptide Generation

Takashi Fujiwara , Hikaru Shindo , Kaushalya Madhawa , Jun Jin Choong , Keisuke Ozawa This is my paper

Pith reviewed 2026-06-30 10:00 UTC · model grok-4.3

classification 💻 cs.LG cs.AI

keywords peptide generationtoxicity reductionantigen-specific designlatent space driftingtherapeutic peptidesdiscrete sequence generationmachine learning for drug design

0 comments

The pith

A single antigen-conditioned drift in peptide latent space attracts binding features while repelling toxicity regions after a warm-up phase.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents Pepti-drift as a framework that refines discrete peptide candidates in an embedding space by pulling them toward antigen-matched binding examples and pushing them away from toxicity-linked areas. Binding and toxicity features often overlap, so the method first trains on attraction alone before adding repulsion to stabilize the process. This produces peptides that are valid, unique, and diverse while showing lower predicted toxicity and hemolysis across length ranges yet preserving binding signals. The approach runs much faster than earlier generators and avoids reusing sequences across different antigens.

Core claim

Pepti-drift learns to attract generated peptide latents toward antigen-matched binding peptides while repelling them from toxicity-associated regions in a peptide embedding space; a warm-up strategy first learns binding-oriented attraction and then increases toxicity repulsion, enabling a single drift step to produce valid, diverse peptides with reduced toxicity and retained binding signal.

What carries the argument

toxicity-repulsive drifting: a latent-space operation that attracts to binding peptides and repels from toxicity regions after warm-up training

If this is right

Generation runs 16.2 times faster than PepMLM and 1,092 times faster than PepTune.
All outputs are valid sequences with 98.1 percent uniqueness and the highest observed sequence diversity.
Toxicity and hemolysis risk drop across most peptide-length ranges while target binding predictions stay intact.
Near-zero reuse of sequences across different antigens occurs.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same single-step refinement could apply to other design tasks where specificity and safety trade off in molecular space.
If predictive toxicity models align with wet-lab results, the framework would cut early-stage filtering costs in peptide drug pipelines.
Extending the drift to multi-objective repulsion (for example, adding off-target or stability penalties) would require only additional repulsion terms.

Load-bearing premise

A warm-up phase can separate overlapping binding and toxicity features in the embedding space so that one drift step improves both properties at once.

What would settle it

Generated peptides show no measurable drop in predicted toxicity or hemolysis scores relative to baselines that lack the repulsion term, or they lose the target binding signal.

Figures

Figures reproduced from arXiv: 2606.27824 by Hikaru Shindo, Jun Jin Choong, Kaushalya Madhawa, Keisuke Ozawa, Takashi Fujiwara.

**Figure 1.** Figure 1: Pepti-drift resolves the binding-toxicity overlap in peptide space. (Left) The sequence features that promote [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗

**Figure 2.** Figure 2: Overview of the Pepti-drift framework. A target antigen sequence is encoded by a frozen ESM-2 model. The [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 3.** Figure 3: Warm-up enables stable positive attraction and negative avoidance [PITH_FULL_IMAGE:figures/full_fig_p008_3.png] view at source ↗

**Figure 4.** Figure 4: Training dynamics of toxicity-aware latent drift. [PITH_FULL_IMAGE:figures/full_fig_p009_4.png] view at source ↗

**Figure 5.** Figure 5: Cosine-based validation of learned latent drift directions. Cosine similarities were computed between the [PITH_FULL_IMAGE:figures/full_fig_p010_5.png] view at source ↗

**Figure 6.** Figure 6: Length-stratified PeptiVerse prediction profiles for generated peptides. Mean predicted hemolysis risk, toxicity [PITH_FULL_IMAGE:figures/full_fig_p013_6.png] view at source ↗

read the original abstract

Peptides are a promising therapeutic modality that combine the chemical tunability of small molecules with the target specificity of macromolecular therapeutics. However, designing antigen-specific binding peptides while avoiding toxicity remains a major challenge for therapeutic peptide discovery. Here, we present Pepti-drift, a toxicity-aware latent refinement framework that generates peptide candidates through a single antigen-conditioned drift step. In a peptide embedding space, Pepti-drift learns to attract generated peptide latents toward antigen-matched binding peptides while repelling them from toxicity-associated regions. This is challenging because binding-promoting physicochemical features often overlap with toxicity-associated features in peptide representation space. To address this, we introduce a warm-up strategy to stabilize this competing objective by first learning binding-oriented attraction and then increasing toxicity repulsion. Pepti-drift achieves highly efficient generation, running 16.2-fold faster than PepMLM and 1,092.0-fold faster than PepTune. Generated peptides show 100% validity, 98.1% uniqueness, the highest sequence diversity, and near-zero cross-antigen reuse. Further evaluation indicates consistently reduced toxicity and hemolysis risk across most peptide-length ranges while retaining target-related predictive binding signal. Pepti-drift thus provides a fast, scalable, and controllable framework for antigen-specific peptide design that directly encodes safe-and-active properties.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Pepti-drift claims a single-drift toxicity-repulsive generator for antigen peptides after warm-up, but the abstract gives no evidence that the warm-up actually separates the overlapping features.

read the letter

The main point here is that Pepti-drift introduces a latent-space method that attracts peptides toward antigen-matched binders while repelling toxicity regions in one drift step, using a warm-up phase to manage the overlap between those signals, and it reports large speed gains plus strong validity and diversity numbers.

The new piece is the explicit combination of antigen-conditioned attraction, toxicity repulsion, and the staged warm-up to stabilize the competing objectives. The paper does a reasonable job framing the practical problem in therapeutic peptide design and showing downstream results that look promising on paper: 16-fold and 1000-fold speedups, 100% validity, high uniqueness and diversity, near-zero cross-antigen reuse, and lower toxicity and hemolysis across length ranges while keeping some binding signal.

The soft spots are in the middle. The abstract itself notes that binding and toxicity features overlap in the embedding space, then claims the warm-up fixes it so a single drift works. Yet there are no embedding visualizations, no distance measurements between clusters, and no ablation that removes the warm-up to test whether it is actually doing the disentangling. All the reported wins are final metrics that could arise from other parts of the pipeline. Dataset details, the base embedding model, and the exact drift implementation are also missing, so the claims cannot be checked.

This is aimed at people building generative tools for peptide therapeutics who need fast, controllable sampling. A reader in that niche could pick up the high-level idea, but the work stays at the level of a teaser without the technical controls.

It would be worth sending to peer review only if the full paper supplies the missing ablations, embedding checks, and reproducibility details; otherwise the central assumption stays untested and the efficiency claims rest on faith.

Referee Report

2 major / 2 minor

Summary. The paper introduces Pepti-drift, a toxicity-aware latent refinement framework for antigen-conditioned discrete peptide generation. It performs a single drift step in peptide embedding space that attracts latents toward antigen-matched binding peptides while repelling toxicity-associated regions; a warm-up strategy (initial binding-oriented attraction followed by increased toxicity repulsion) is used to stabilize the objective given acknowledged feature overlap. The method is reported to achieve 16.2-fold and 1,092-fold speedups over PepMLM and PepTune respectively, with 100% validity, 98.1% uniqueness, highest sequence diversity, near-zero cross-antigen reuse, and reduced toxicity/hemolysis across peptide lengths while retaining binding signal.

Significance. If the central mechanism is verified, the work would represent a meaningful advance in computational therapeutic peptide design by offering a fast, scalable, and directly controllable approach that encodes both activity and safety constraints in a single latent-space operation, potentially reducing the need for post-hoc filtering in peptide discovery pipelines.

major comments (2)

[Methods (warm-up and drift description)] The central claim that a single antigen-conditioned drift step (after warm-up) simultaneously achieves binding attraction and toxicity repulsion rests on the unverified assumption that the warm-up produces separable directions in latent space despite acknowledged feature overlap. No embedding visualizations, inter-cluster distance metrics, or ablation removing the warm-up phase are supplied to test this assumption, which directly underpins the efficiency and safety claims.
[Results (quantitative evaluation)] Performance numbers (16.2-fold and 1,092-fold speedups, 100% validity, 98.1% uniqueness) are stated without accompanying dataset details, training protocol, or verification steps that would allow assessment of whether the drift mechanism, rather than data-driven fitting, supports the outcomes.

minor comments (2)

[Abstract] The abstract states 'near-zero cross-antigen reuse' without defining the reuse metric or the numerical threshold applied.
[Introduction] Notation for the embedding space and drift operator could be introduced earlier with explicit mathematical definitions to improve readability for readers outside the immediate subfield.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their detailed and constructive comments on our manuscript. We address each of the major comments below and indicate the revisions we plan to make.

read point-by-point responses

Referee: [Methods (warm-up and drift description)] The central claim that a single antigen-conditioned drift step (after warm-up) simultaneously achieves binding attraction and toxicity repulsion rests on the unverified assumption that the warm-up produces separable directions in latent space despite acknowledged feature overlap. No embedding visualizations, inter-cluster distance metrics, or ablation removing the warm-up phase are supplied to test this assumption, which directly underpins the efficiency and safety claims.

Authors: We agree that direct evidence for the separability of binding attraction and toxicity repulsion directions in the latent space would strengthen the central claim. Although the empirical results on peptide validity, diversity, and reduced toxicity provide supporting evidence for the effectiveness of the warm-up strategy, we acknowledge the value of additional analyses. In the revised manuscript, we will include embedding visualizations (e.g., t-SNE plots), inter-cluster distance metrics, and an ablation study that removes the warm-up phase to directly test this assumption. revision: yes
Referee: [Results (quantitative evaluation)] Performance numbers (16.2-fold and 1,092-fold speedups, 100% validity, 98.1% uniqueness) are stated without accompanying dataset details, training protocol, or verification steps that would allow assessment of whether the drift mechanism, rather than data-driven fitting, supports the outcomes.

Authors: The Methods section provides details on the datasets, model architecture, and training procedures used to obtain these performance metrics. To better demonstrate that the outcomes are attributable to the drift mechanism, we will expand the Results and Methods sections with additional verification steps, such as comparisons to baseline models without the drift component and more detailed reporting of experimental protocols. revision: yes

Circularity Check

0 steps flagged

No circularity detected; method is empirical proposal without self-referential derivation

full rationale

The paper presents Pepti-drift as a proposed latent refinement framework using attraction/repulsion in embedding space plus a warm-up strategy. No equations, uniqueness theorems, or first-principles derivations are shown that reduce to inputs by construction. Claims rest on downstream empirical metrics (validity, diversity, toxicity reduction) rather than any fitted parameter renamed as prediction or self-citation chain. The warm-up is introduced as a design choice to address acknowledged feature overlap, not derived from prior self-work. This is a standard ML method description with independent evaluation, yielding no load-bearing circular steps.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review supplies no explicit free parameters, axioms, or invented entities; the approach appears to rest on standard latent-space generative modeling assumptions whose details are not stated.

pith-pipeline@v0.9.1-grok · 5783 in / 1123 out tokens · 33816 ms · 2026-06-30T10:00:33.985461+00:00 · methodology

Review history (2 revisions) →

discussion (0)

Reference graph

Works this paper leans on

27 extracted references · 1 canonical work pages · 1 internal anchor

[1]

Nature Biotechnology , year=

Target sequence-conditioned design of peptide binders using masked language modeling , author=. Nature Biotechnology , year=
[2]

Tang, Sophia and Zhang, Yinuo and Chatterjee, Pranam , booktitle=
[3]

Generative Modeling via Drifting

Generative Modeling via Drifting , author=. arXiv preprint arXiv:2602.04770 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[4]

Science , volume=

Evolutionary-scale prediction of atomic-level protein structure with a language model , author =. Science , volume=. 2023 , publisher=

2023
[5]

International Conference on Learning Representations , year=

Flow Matching for Generative Modeling , author=. International Conference on Learning Representations , year=
[6]

International Conference on Learning Representations , year=

Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow , author=. International Conference on Learning Representations , year=
[7]

International Conference on Machine Learning , year=

Consistency Models , author=. International Conference on Machine Learning , year=
[8]

NeurIPS Workshop on Deep Generative Models and Downstream Applications , year=

Classifier-Free Diffusion Guidance , author=. NeurIPS Workshop on Deep Generative Models and Downstream Applications , year=
[9]

International Conference on Learning Representations , volume=

Dynamic negative guidance of diffusion models , author=. International Conference on Learning Representations , volume=
[10]

European Conference on Computer Vision , year=

Compositional Visual Generation with Composable Diffusion Models , author=. European Conference on Computer Vision , year=
[11]

Nature Biomedical Engineering , volume=

Accelerated antimicrobial discovery via deep generative models and molecular dynamics simulations , author=. Nature Biomedical Engineering , volume=
[12]

International Conference on Learning Representations , year=

Non-Autoregressive Neural Machine Translation , author=. International Conference on Learning Representations , year=
[13]

Zhu, Ning and Ming, Yanyu and Zhang, Chengyun and Cao, Sen and Li, Chongyang and Guo, Jingjing and Duan, Hongliang , journal=
[14]

Zhang, Jun and Zhou, Yangyang and Zhu, Tiantian and Zhu, Zexuan , booktitle=
[15]

Nucleic Acids Research , volume =

DRAMP 4.0: an open-access data repository dedicated to the clinical translation of antimicrobial peptides , author =. Nucleic Acids Research , volume =
[16]

Rathore, Anand Singh and Choudhury, Shubham and Arora, Akanksha and Tijare, P. A. and Raghava, Gajendra P. S. , journal=
[17]

Nucleic acids research , volume=

Hemolytik: a database of experimentally determined hemolytic and non-hemolytic peptides , author=. Nucleic acids research , volume=. 2014 , publisher=

2014
[18]

Bioinformatics , volume =

Li, Weizhong and Godzik, Adam , title =. Bioinformatics , volume =
[19]

Hancock, Robert E. W. and Sahl, Hans-Georg , title =. Nature Biotechnology , volume =
[20]

and Hiss, Jan A

Fjell, Christopher D. and Hiss, Jan A. and Hancock, Robert E. W. and Schneider, Gisbert , title =. Nature Reviews Drug Discovery , volume =
[21]

Scientific Reports , volume=

A Web Server and Mobile App for Computing Hemolytic Potency of Peptides , author=. Scientific Reports , volume=
[22]

and Hewage, Chandralal M

Timmons, Patrick B. and Hewage, Chandralal M. , journal=
[23]

Protein Science , volume=

Structure-aware deep learning model for peptide toxicity prediction , author =. Protein Science , volume=
[24]

Communications Biology , volume=

Prediction of hemolytic peptides and their hemolytic concentration , author=. Communications Biology , volume=
[25]

bioRxiv , year=

PeptiVerse: A Unified Platform for Therapeutic Peptide Property Prediction , author=. bioRxiv , year=
[26]

The Journal of Physical Chemistry Letters , volume=

PeptideBERT: A Language Model Based on Transformers for Peptide Property Prediction , author=. The Journal of Physical Chemistry Letters , volume=
[27]

Peptide-

Shah, Aayush and Guntuboina, Chakradhar and Barati Farimani, Amir , journal=. Peptide-

[1] [1]

Nature Biotechnology , year=

Target sequence-conditioned design of peptide binders using masked language modeling , author=. Nature Biotechnology , year=

[2] [2]

Tang, Sophia and Zhang, Yinuo and Chatterjee, Pranam , booktitle=

[3] [3]

Generative Modeling via Drifting

Generative Modeling via Drifting , author=. arXiv preprint arXiv:2602.04770 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[4] [4]

Science , volume=

Evolutionary-scale prediction of atomic-level protein structure with a language model , author =. Science , volume=. 2023 , publisher=

2023

[5] [5]

International Conference on Learning Representations , year=

Flow Matching for Generative Modeling , author=. International Conference on Learning Representations , year=

[6] [6]

International Conference on Learning Representations , year=

Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow , author=. International Conference on Learning Representations , year=

[7] [7]

International Conference on Machine Learning , year=

Consistency Models , author=. International Conference on Machine Learning , year=

[8] [8]

NeurIPS Workshop on Deep Generative Models and Downstream Applications , year=

Classifier-Free Diffusion Guidance , author=. NeurIPS Workshop on Deep Generative Models and Downstream Applications , year=

[9] [9]

International Conference on Learning Representations , volume=

Dynamic negative guidance of diffusion models , author=. International Conference on Learning Representations , volume=

[10] [10]

European Conference on Computer Vision , year=

Compositional Visual Generation with Composable Diffusion Models , author=. European Conference on Computer Vision , year=

[11] [11]

Nature Biomedical Engineering , volume=

Accelerated antimicrobial discovery via deep generative models and molecular dynamics simulations , author=. Nature Biomedical Engineering , volume=

[12] [12]

International Conference on Learning Representations , year=

Non-Autoregressive Neural Machine Translation , author=. International Conference on Learning Representations , year=

[13] [13]

Zhu, Ning and Ming, Yanyu and Zhang, Chengyun and Cao, Sen and Li, Chongyang and Guo, Jingjing and Duan, Hongliang , journal=

[14] [14]

Zhang, Jun and Zhou, Yangyang and Zhu, Tiantian and Zhu, Zexuan , booktitle=

[15] [15]

Nucleic Acids Research , volume =

DRAMP 4.0: an open-access data repository dedicated to the clinical translation of antimicrobial peptides , author =. Nucleic Acids Research , volume =

[16] [16]

Rathore, Anand Singh and Choudhury, Shubham and Arora, Akanksha and Tijare, P. A. and Raghava, Gajendra P. S. , journal=

[17] [17]

Nucleic acids research , volume=

Hemolytik: a database of experimentally determined hemolytic and non-hemolytic peptides , author=. Nucleic acids research , volume=. 2014 , publisher=

2014

[18] [18]

Bioinformatics , volume =

Li, Weizhong and Godzik, Adam , title =. Bioinformatics , volume =

[19] [19]

Hancock, Robert E. W. and Sahl, Hans-Georg , title =. Nature Biotechnology , volume =

[20] [20]

and Hiss, Jan A

Fjell, Christopher D. and Hiss, Jan A. and Hancock, Robert E. W. and Schneider, Gisbert , title =. Nature Reviews Drug Discovery , volume =

[21] [21]

Scientific Reports , volume=

A Web Server and Mobile App for Computing Hemolytic Potency of Peptides , author=. Scientific Reports , volume=

[22] [22]

and Hewage, Chandralal M

Timmons, Patrick B. and Hewage, Chandralal M. , journal=

[23] [23]

Protein Science , volume=

Structure-aware deep learning model for peptide toxicity prediction , author =. Protein Science , volume=

[24] [24]

Communications Biology , volume=

Prediction of hemolytic peptides and their hemolytic concentration , author=. Communications Biology , volume=

[25] [25]

bioRxiv , year=

PeptiVerse: A Unified Platform for Therapeutic Peptide Property Prediction , author=. bioRxiv , year=

[26] [26]

The Journal of Physical Chemistry Letters , volume=

PeptideBERT: A Language Model Based on Transformers for Peptide Property Prediction , author=. The Journal of Physical Chemistry Letters , volume=

[27] [27]

Peptide-

Shah, Aayush and Guntuboina, Chakradhar and Barati Farimani, Amir , journal=. Peptide-