Atlas: A high-difficulty, multidisciplinary benchmark for frontier scientific reasoning.arXiv preprint arXiv:2511.14366, 2025

Hongwei Liu, Junnan Liu, Shudong Liu, Haodong Duan, Yuqiang Li, Mao Su, Xiaohong Liu, Guangtao Zhai, Xinyu Fang, Qianhong Ma, et al · 2025 · arXiv 2511.14366

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

representative citing papers

SCI-PRM: A Tool Aware Process Reward Model for Scientific Reasoning Verification

cs.AI · 2026-06-03 · unverdicted · novelty 6.0

Sci-PRM is a tool-aware process reward model trained on the SCIPRM70K dataset to provide fine-grained supervision for scientific reasoning and shown to boost foundation models via Best-of-N selection and RL.

ResearchClawBench: A Benchmark for End-to-End Autonomous Scientific Research

cs.LG · 2026-05-28 · unverdicted · novelty 6.0

ResearchClawBench is a new benchmark that evaluates autonomous AI research agents on 40 tasks grounded in published papers using expert rubrics, finding that top systems score only 20-26 out of 100.

citing papers explorer

Showing 2 of 2 citing papers after filters.

SCI-PRM: A Tool Aware Process Reward Model for Scientific Reasoning Verification cs.AI · 2026-06-03 · unverdicted · none · ref 36
Sci-PRM is a tool-aware process reward model trained on the SCIPRM70K dataset to provide fine-grained supervision for scientific reasoning and shown to boost foundation models via Best-of-N selection and RL.
ResearchClawBench: A Benchmark for End-to-End Autonomous Scientific Research cs.LG · 2026-05-28 · unverdicted · none · ref 31
ResearchClawBench is a new benchmark that evaluates autonomous AI research agents on 40 tasks grounded in published papers using expert rubrics, finding that top systems score only 20-26 out of 100.

Atlas: A high-difficulty, multidisciplinary benchmark for frontier scientific reasoning.arXiv preprint arXiv:2511.14366, 2025

fields

years

verdicts

representative citing papers

citing papers explorer