ImageBind: One embedding space to bind them all

· 2023

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Audio-Image Cross-Modal Retrieval with Onomatopoeic Images

eess.AS · 2026-05-17 · unverdicted · novelty 6.0

Introduces a cross-modal retrieval framework using modality-specific projection heads on CLIP and CLAP embeddings together with the new MIAO dataset of 50 sound event classes for onomatopoeic image-sound pairs.

citing papers explorer

Showing 1 of 1 citing paper.

Audio-Image Cross-Modal Retrieval with Onomatopoeic Images eess.AS · 2026-05-17 · unverdicted · none · ref 5
Introduces a cross-modal retrieval framework using modality-specific projection heads on CLIP and CLAP embeddings together with the new MIAO dataset of 50 sound event classes for onomatopoeic image-sound pairs.

ImageBind: One embedding space to bind them all

fields

years

verdicts

representative citing papers

citing papers explorer