In Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems (CHI '25)

Danqing Shi, Yao Wang, Yunpeng Bai, Andreas Bulling, Antti Oulasvirta · 2025 · arXiv 6598.371312

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Making Multimodal LLMs Reliable Chart Data Extractors: A Benchmark and Training Framework

cs.HC · 2026-06-29 · unverdicted · novelty 6.0

Introduces a benchmark for MLLM-based chart data extraction from unlabeled images and a human-centered training framework that reaches SOTA numerical accuracy with a 7B model.

What Is Actually Being Annotated? Inter-Prompt Reliability as a Measurement Problem in LLM-Based Social Science Labeling

cs.CY · 2026-04-02 · unverdicted · novelty 6.0

LLM annotations for social science tasks vary substantially with prompt wording in interpretive cases but become more stable when majority voting is applied across multiple equivalent prompts.

citing papers explorer

Showing 2 of 2 citing papers.

Making Multimodal LLMs Reliable Chart Data Extractors: A Benchmark and Training Framework cs.HC · 2026-06-29 · unverdicted · none · ref 64
Introduces a benchmark for MLLM-based chart data extraction from unlabeled images and a human-centered training framework that reaches SOTA numerical accuracy with a 7B model.
What Is Actually Being Annotated? Inter-Prompt Reliability as a Measurement Problem in LLM-Based Social Science Labeling cs.CY · 2026-04-02 · unverdicted · none · ref 14
LLM annotations for social science tasks vary substantially with prompt wording in interpretive cases but become more stable when majority voting is applied across multiple equivalent prompts.

In Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems (CHI '25)

fields

years

verdicts

representative citing papers

citing papers explorer