pith. sign in

ImageBind: One embedding space to bind them all

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

eess.AS 1

years

2026 1

verdicts

UNVERDICTED 1

representative citing papers

Audio-Image Cross-Modal Retrieval with Onomatopoeic Images

eess.AS · 2026-05-17 · unverdicted · novelty 6.0

Introduces a cross-modal retrieval framework using modality-specific projection heads on CLIP and CLAP embeddings together with the new MIAO dataset of 50 sound event classes for onomatopoeic image-sound pairs.

citing papers explorer

Showing 1 of 1 citing paper.

  • Audio-Image Cross-Modal Retrieval with Onomatopoeic Images eess.AS · 2026-05-17 · unverdicted · none · ref 5

    Introduces a cross-modal retrieval framework using modality-specific projection heads on CLIP and CLAP embeddings together with the new MIAO dataset of 50 sound event classes for onomatopoeic image-sound pairs.