Can large language models provide useful feedback on research papers? a large-scale empirical analysis

URL https://arxiv · 2023 · arXiv 2310.01783

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

representative citing papers

AgentReview: Exploring Peer Review Dynamics with LLM Agents

cs.CL · 2024-06-18 · unverdicted · novelty 8.0

AgentReview is the first LLM-based simulation framework for peer review that quantifies a 37.1% decision variation attributable to reviewer biases.

FARS: A Fully Automated Research System Deployed at Scale

cs.AI · 2026-06-30 · unverdicted · novelty 7.0

FARS deployed at scale produced 166 AI/ML papers across 67 topics that received 282 structured human reviews indicating some review-worthy outputs alongside recurring failure modes.

FactReview: Evidence-Grounded Reviews with Literature Positioning and Execution-Based Claim Verification

cs.AI · 2026-04-05 · conditional · novelty 7.0

FactReview extracts claims from ML papers, positions them via literature retrieval, and verifies them through code execution, labeling each as Supported, Partially supported, or In conflict, as shown in a CompGCN case study.

From Passive Generation to Investigation: A Proactive Scientific Peer Review Agent

cs.CL · 2026-06-11 · unverdicted · novelty 6.0

ProReviewer is an MDP-formulated proactive peer review agent trained with SFT and RL on an 8B model that outperforms larger frontier LLMs on review quality metrics.

Jagged AI in Scientific Peer Review: Evidence from POMP Data Analysis

stat.AP · 2026-05-08 · unverdicted · novelty 5.0 · 2 refs

AI peer reviewers for POMP analyses show jagged performance: strong on technical error detection and invalid inference but weak on interpretive errors, narrative coherence, and domain-informed critique.

Rejoinder: The ICML 2023 Ranking Experiment: Examining Author Self-Assessment in ML/AI Peer Review

stat.AP · 2026-05-24 · unverdicted · novelty 2.0

A rejoinder organizing responses to discussants into four core themes on statistical modeling, equity, signals, and AI in peer review.

citing papers explorer

Showing 6 of 6 citing papers.

AgentReview: Exploring Peer Review Dynamics with LLM Agents cs.CL · 2024-06-18 · unverdicted · none · ref 18
AgentReview is the first LLM-based simulation framework for peer review that quantifies a 37.1% decision variation attributable to reviewer biases.
FARS: A Fully Automated Research System Deployed at Scale cs.AI · 2026-06-30 · unverdicted · none · ref 13
FARS deployed at scale produced 166 AI/ML papers across 67 topics that received 282 structured human reviews indicating some review-worthy outputs alongside recurring failure modes.
FactReview: Evidence-Grounded Reviews with Literature Positioning and Execution-Based Claim Verification cs.AI · 2026-04-05 · conditional · none · ref 7
FactReview extracts claims from ML papers, positions them via literature retrieval, and verifies them through code execution, labeling each as Supported, Partially supported, or In conflict, as shown in a CompGCN case study.
From Passive Generation to Investigation: A Proactive Scientific Peer Review Agent cs.CL · 2026-06-11 · unverdicted · none · ref 5
ProReviewer is an MDP-formulated proactive peer review agent trained with SFT and RL on an 8B model that outperforms larger frontier LLMs on review quality metrics.
Jagged AI in Scientific Peer Review: Evidence from POMP Data Analysis stat.AP · 2026-05-08 · unverdicted · none · ref 5 · 2 links
AI peer reviewers for POMP analyses show jagged performance: strong on technical error detection and invalid inference but weak on interpretive errors, narrative coherence, and domain-informed critique.
Rejoinder: The ICML 2023 Ranking Experiment: Examining Author Self-Assessment in ML/AI Peer Review stat.AP · 2026-05-24 · unverdicted · none · ref 6
A rejoinder organizing responses to discussants into four core themes on statistical modeling, equity, signals, and AI in peer review.

Can large language models provide useful feedback on research papers? a large-scale empirical analysis

fields

years

verdicts

representative citing papers

citing papers explorer