In: The Thirteenth International Conference on Learning Representations (2025)

Xie, J · 2025

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

How Far Are Video Models from True Multimodal Reasoning?

cs.CV · 2026-04-21 · unverdicted · novelty 6.0

Current video models succeed on basic understanding but achieve under 25% success on logically grounded generation and near 0% on interactive generation, exposing gaps in multimodal reasoning.

HyLaR: Hybrid Latent Reasoning with Decoupled Policy Optimization

cs.CV · 2026-04-22

citing papers explorer

Showing 2 of 2 citing papers.

How Far Are Video Models from True Multimodal Reasoning? cs.CV · 2026-04-21 · unverdicted · none · ref 80
Current video models succeed on basic understanding but achieve under 25% success on logically grounded generation and near 0% on interactive generation, exposing gaps in multimodal reasoning.
HyLaR: Hybrid Latent Reasoning with Decoupled Policy Optimization cs.CV · 2026-04-22 · unreviewed · ref 40

In: The Thirteenth International Conference on Learning Representations (2025)

fields

years

verdicts

representative citing papers

citing papers explorer