MMseqs 2 enables sensitive protein sequence searching for the analysis of massive data sets

Martin Steinegger · 2017 · DOI 10.1038/nbt.3988

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

open at publisher browse 4 citing papers

representative citing papers

cs.LG · 2026-05-07 · unverdicted · novelty 7.0

Contrastive fine-tuning of protein language models on Pfam, structural, interaction, and mutational datasets produces embeddings that improve kNN performance on 15-16 of 23 downstream tasks including remote homology detection and structural retrieval.

Pareto-Optimal Offline Reinforcement Learning via Smooth Tchebysheff Scalarization

cs.LG · 2026-04-14 · unverdicted · novelty 7.0

STOMP extends direct preference optimization to the multi-objective setting via smooth Tchebysheff scalarization and standardization of observed rewards, achieving highest hypervolume in eight of nine protein engineering evaluations.

Tree-Conditioned Edit Flows for Ancestral Sequence Reconstruction

q-bio.QM · 2026-05-05 · unverdicted · novelty 6.0

A new tree-conditioned edit-flow model for ancestral sequence reconstruction achieves reasonable accuracy on substitution-only evolved sequences and superior localization of changes on natural indel-rich sequences.

Galactica: A Large Language Model for Science

cs.CL · 2022-11-16 · unverdicted · novelty 5.0

Galactica, a science-specialized LLM, reports higher scores than GPT-3, Chinchilla, and PaLM on LaTeX knowledge, mathematical reasoning, and medical QA benchmarks while outperforming general models on BIG-bench.

citing papers explorer

Showing 2 of 2 citing papers after filters.

ProtSent: Protein Sentence Transformers cs.LG · 2026-05-07 · unverdicted · none · ref 9
Contrastive fine-tuning of protein language models on Pfam, structural, interaction, and mutational datasets produces embeddings that improve kNN performance on 15-16 of 23 downstream tasks including remote homology detection and structural retrieval.
Pareto-Optimal Offline Reinforcement Learning via Smooth Tchebysheff Scalarization cs.LG · 2026-04-14 · unverdicted · none · ref 75
STOMP extends direct preference optimization to the multi-objective setting via smooth Tchebysheff scalarization and standardization of observed rewards, achieving highest hypervolume in eight of nine protein engineering evaluations.

MMseqs 2 enables sensitive protein sequence searching for the analysis of massive data sets

fields

years

verdicts

representative citing papers

citing papers explorer