TVQA +: Spatio-Temporal Grounding for Video Question Answering

Lei, Jie, Yu, Licheng, Berg, Tamara, Bansal, Mohit · 2020 · DOI 10.18653/v1/2020.acl-main.730

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open at publisher browse 1 citing papers

representative citing papers

CCTVBench: Contrastive Consistency Traffic VideoQA Benchmark for Multimodal LLMs

cs.CV · 2026-04-22 · unverdicted · novelty 8.0

CCTVBench exposes a large gap between standard QA accuracy and contrastive consistency in traffic video reasoning for multimodal LLMs and introduces C-TCD to narrow that gap.

citing papers explorer

Showing 1 of 1 citing paper.

CCTVBench: Contrastive Consistency Traffic VideoQA Benchmark for Multimodal LLMs cs.CV · 2026-04-22 · unverdicted · none · ref 26
CCTVBench exposes a large gap between standard QA accuracy and contrastive consistency in traffic video reasoning for multimodal LLMs and introduces C-TCD to narrow that gap.

TVQA +: Spatio-Temporal Grounding for Video Question Answering

fields

years

verdicts

representative citing papers

citing papers explorer