Clamp: Contrastive language-music pre-training for cross-modal symbolic music information retrieval

Clamp: Contrastive language-music pretraining for cross-modal symbolic music information retrieval · 2023 · arXiv 2304.11029

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

representative citing papers

FIGMA: Towards FIne-Grained Music retrievAl

cs.SD · 2026-06-04 · unverdicted · novelty 6.0

FIGMA proposes a multi-view contrastive architecture plus the FGMCaps dataset to retrieve music from fine-grained textual descriptions of musical attributes, reporting up to 73.3% relative gains over CLAP baselines.

CustomDancer: Customized Dance Recommendation by Text-Dance Retrieval

cs.MM · 2026-05-01 · unverdicted · novelty 6.0

CustomDancer achieves state-of-the-art text-to-dance retrieval with 10.23% Recall@1 on the new TD-Data dataset by aligning text, music, and motion features through a CLIP-based framework.

citing papers explorer

Showing 1 of 1 citing paper after filters.

FIGMA: Towards FIne-Grained Music retrievAl cs.SD · 2026-06-04 · unverdicted · none · ref 13
FIGMA proposes a multi-view contrastive architecture plus the FGMCaps dataset to retrieve music from fine-grained textual descriptions of musical attributes, reporting up to 73.3% relative gains over CLAP baselines.

Clamp: Contrastive language-music pre-training for cross-modal symbolic music information retrieval

fields

years

verdicts

representative citing papers

citing papers explorer