Structchart: Perception, structuring, reasoning for visual chart understanding

Renqiu Xia, Haoyang Peng, Hancheng Ye, Mingsheng Li, Xiangchao Yan, Peng Ye, Botian Shi, Yu Qiao, Junchi Yan, Bo Zhang · 2024 · arXiv 2309.11268

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

read on arXiv browse 4 citing papers

citation-role summary

method 2

citation-polarity summary

use method 2

representative citing papers

EpiCurveBench: Evaluating VLMs on Epidemic Curve Digitization

cs.CL · 2026-05-26 · unverdicted · novelty 7.0

EpiCurveBench supplies 1,000 epidemic curve images and ECS metric shows top VLMs reach only 52.3% while correlating 1.5-3.6 times more strongly than DTW with downstream epidemiological statistics.

Making Multimodal LLMs Reliable Chart Data Extractors: A Benchmark and Training Framework

cs.HC · 2026-06-29 · unverdicted · novelty 6.0

Introduces a benchmark for MLLM-based chart data extraction from unlabeled images and a human-centered training framework that reaches SOTA numerical accuracy with a 7B model.

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

cs.CV · 2024-12-06 · unverdicted · novelty 6.0

InternVL 2.5 is the first open-source MLLM to surpass 70% on the MMMU benchmark via model, data, and test-time scaling, with a 3.7-point gain from chain-of-thought reasoning.

Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction

cs.MM · 2024-10-28 · unverdicted · novelty 3.0

Survey proposing a taxonomy for document parsing into pipeline-based systems and VLM-driven unified models, reviewing components, metrics, benchmarks, and challenges.

citing papers explorer

Showing 4 of 4 citing papers.

EpiCurveBench: Evaluating VLMs on Epidemic Curve Digitization cs.CL · 2026-05-26 · unverdicted · none · ref 12
EpiCurveBench supplies 1,000 epidemic curve images and ECS metric shows top VLMs reach only 52.3% while correlating 1.5-3.6 times more strongly than DTW with downstream epidemiological statistics.
Making Multimodal LLMs Reliable Chart Data Extractors: A Benchmark and Training Framework cs.HC · 2026-06-29 · unverdicted · none · ref 74
Introduces a benchmark for MLLM-based chart data extraction from unlabeled images and a human-centered training framework that reaches SOTA numerical accuracy with a 7B model.
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling cs.CV · 2024-12-06 · unverdicted · none · ref 264
InternVL 2.5 is the first open-source MLLM to surpass 70% on the MMMU benchmark via model, data, and test-time scaling, with a 3.7-point gain from chain-of-thought reasoning.
Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction cs.MM · 2024-10-28 · unverdicted · none · ref 261
Survey proposing a taxonomy for document parsing into pipeline-based systems and VLM-driven unified models, reviewing components, metrics, benchmarks, and challenges.

Structchart: Perception, structuring, reasoning for visual chart understanding

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer