PatenTEB: A Comprehensive Benchmark and Model Family for Patent Text Embedding

PatenTEB: A comprehensive benchmark for patent text embeddings · 2025 · arXiv 2510.22264

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Citation-Driven Multi-View Training for Patent Embeddings: QaECTER and Sophia-Bench

cs.IR · 2026-04-24 · unverdicted · novelty 7.0

QaECTER sets new state-of-the-art patent retrieval performance on the new Sophia-Bench benchmark and an external test, outperforming a 23x larger general model and all prior patent-specific models using citation-driven training.

Benchmarking Patent Embeddings: A Multi-Task Evaluation of 22 Models Across Retrieval, Classification, and Clustering

cs.IR · 2026-05-22 · unverdicted · novelty 5.0

Multi-task evaluation of 22 patent embedding models finds task-specific fine-tuning benefits and significant cross-landscape retrieval degradation that cannot be fixed by hybrid fusion.

citing papers explorer

Showing 2 of 2 citing papers.

Citation-Driven Multi-View Training for Patent Embeddings: QaECTER and Sophia-Bench cs.IR · 2026-04-24 · unverdicted · none · ref 11
QaECTER sets new state-of-the-art patent retrieval performance on the new Sophia-Bench benchmark and an external test, outperforming a 23x larger general model and all prior patent-specific models using citation-driven training.
Benchmarking Patent Embeddings: A Multi-Task Evaluation of 22 Models Across Retrieval, Classification, and Clustering cs.IR · 2026-05-22 · unverdicted · none · ref 1
Multi-task evaluation of 22 patent embedding models finds task-specific fine-tuning benefits and significant cross-landscape retrieval degradation that cannot be fixed by hybrid fusion.

PatenTEB: A Comprehensive Benchmark and Model Family for Patent Text Embedding

fields

years

verdicts

representative citing papers

citing papers explorer