Lokmanoglu and Dror Walter

Ayse D · 2025 · arXiv 2458.2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

MMTM: Tri-Modal Topic Modeling for Long-Form Video via Similarity-Gated Fusion

cs.LG · 2026-05-28 · unverdicted · novelty 5.0

MMTM improves topic coherence and temporal stability in long-form video by tri-modal similarity-gated fusion of speech, audio, and visual embeddings with BERTopic, shown on German and English news datasets with released code and corpus.

citing papers explorer

Showing 1 of 1 citing paper.

MMTM: Tri-Modal Topic Modeling for Long-Form Video via Similarity-Gated Fusion cs.LG · 2026-05-28 · unverdicted · none · ref 17
MMTM improves topic coherence and temporal stability in long-form video by tri-modal similarity-gated fusion of speech, audio, and visual embeddings with BERTopic, shown on German and English news datasets with released code and corpus.

Lokmanoglu and Dror Walter

fields

years

verdicts

representative citing papers

citing papers explorer