A Comparative Study of Quality Evaluation Methods for Text Summarization, June 2024

Huyen Nguyen, Haihua Chen, Lavanya Pobbathi, Junhua Ding · 2024 · arXiv 2407.00747

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

TF1-EN-3M: Three Million Synthetic Moral Fables for Training Small, Open Language Models

cs.CL · 2025-04-29 · unverdicted · novelty 7.0

The authors generate and publicly release the first large-scale open dataset of three million structured moral fables produced by small open language models together with a reproducible LLM-judge evaluation pipeline.

LLM-ReSum: A Framework for LLM Reflective Summarization through Self-Evaluation

cs.CL · 2026-04-28 · unverdicted · novelty 6.0

LLM-ReSum uses LLM self-evaluation in a closed feedback loop to refine summaries, improving factual accuracy by up to 33% and coverage by 39% with 89% human preference.

LongSumEval: Question-Answering Based Evaluation and Feedback-Driven Refinement for Long Document Summarization

cs.CL · 2026-04-28 · unverdicted · novelty 6.0

LongSumEval evaluates long-document summaries via answerability and factual alignment of generated QA pairs, yielding stronger human correlation than prior metrics and enabling iterative self-improvement.

citing papers explorer

Showing 3 of 3 citing papers.

TF1-EN-3M: Three Million Synthetic Moral Fables for Training Small, Open Language Models cs.CL · 2025-04-29 · unverdicted · none · ref 34
The authors generate and publicly release the first large-scale open dataset of three million structured moral fables produced by small open language models together with a reproducible LLM-judge evaluation pipeline.
LLM-ReSum: A Framework for LLM Reflective Summarization through Self-Evaluation cs.CL · 2026-04-28 · unverdicted · none · ref 13
LLM-ReSum uses LLM self-evaluation in a closed feedback loop to refine summaries, improving factual accuracy by up to 33% and coverage by 39% with 89% human preference.
LongSumEval: Question-Answering Based Evaluation and Feedback-Driven Refinement for Long Document Summarization cs.CL · 2026-04-28 · unverdicted · none · ref 7
LongSumEval evaluates long-document summaries via answerability and factual alignment of generated QA pairs, yielding stronger human correlation than prior metrics and enabling iterative self-improvement.

A Comparative Study of Quality Evaluation Methods for Text Summarization, June 2024

fields

years

verdicts

representative citing papers

citing papers explorer