Proceedings of the 25th ACM International Conference on Multimedia , pages =

Zhang, Tongtao, Whitehead, Spencer, Zhang, Hanwang, Li, Hongzhi, Ellis, Joseph, Huang, Lifu · 2017 · arXiv 3266.312329

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

Evaluation Pitfalls and Challenges in Multimedia Event Extraction

cs.CL · 2026-06-25 · unverdicted · novelty 7.0

A systematic analysis of evaluation practices in multimedia event extraction reveals that minor methodological choices cause large performance swings and overestimation of cross-modal grounding ability.

NEST: Narrative Event Structures in Time for Long Video Understanding

cs.CV · 2026-06-18 · unverdicted · novelty 7.0

NEST is a new benchmark dataset for narrative event structures in long videos, with baselines reporting ETD below 8%, EL under 6%, EAE below 11%, and ERE at 35-44% F1.

EVENT5Ws: A Large Dataset for Open-Domain Event Extraction from Documents

cs.CL · 2026-04-23 · unverdicted · novelty 7.0

EVENT5Ws is a new large-scale, manually verified open-domain event extraction dataset that benchmarks LLMs and demonstrates cross-context generalization.

A Multimodal Text- and Graph-Based Approach for Open-Domain Event Extraction from Documents

cs.CL · 2026-04-23 · unverdicted · novelty 5.0

MODEE is a multimodal system that integrates graphs with LLM embeddings to outperform prior open-domain event extraction methods on large datasets.

citing papers explorer

Showing 4 of 4 citing papers.

Evaluation Pitfalls and Challenges in Multimedia Event Extraction cs.CL · 2026-06-25 · unverdicted · none · ref 24
A systematic analysis of evaluation practices in multimedia event extraction reveals that minor methodological choices cause large performance swings and overestimation of cross-modal grounding ability.
NEST: Narrative Event Structures in Time for Long Video Understanding cs.CV · 2026-06-18 · unverdicted · none · ref 167
NEST is a new benchmark dataset for narrative event structures in long videos, with baselines reporting ETD below 8%, EL under 6%, EAE below 11%, and ERE at 35-44% F1.
EVENT5Ws: A Large Dataset for Open-Domain Event Extraction from Documents cs.CL · 2026-04-23 · unverdicted · none · ref 118
EVENT5Ws is a new large-scale, manually verified open-domain event extraction dataset that benchmarks LLMs and demonstrates cross-context generalization.
A Multimodal Text- and Graph-Based Approach for Open-Domain Event Extraction from Documents cs.CL · 2026-04-23 · unverdicted · none · ref 272
MODEE is a multimodal system that integrates graphs with LLM embeddings to outperform prior open-domain event extraction methods on large datasets.

Proceedings of the 25th ACM International Conference on Multimedia , pages =

fields

years

verdicts

representative citing papers

citing papers explorer