LaME performs latent multimodal embedding reasoning with K learnable reason tokens in a weakly supervised information bottleneck, matching some explicit CoT models while running 60x faster.
arXiv preprint arXiv:2602.10229 , year=
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CV 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
LaME: Learning to Think in Latent Space for Multimodal Embedding via Information Bottleneck
LaME performs latent multimodal embedding reasoning with K learnable reason tokens in a weakly supervised information bottleneck, matching some explicit CoT models while running 60x faster.