An End-to-end Architecture for Collider Physics and Beyond

URLhttps://arxiv · 2026 · arXiv 2603.14553

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

Collider-Bench: Benchmarking AI Agents with Particle Physics Analysis Reproduction

cs.LG · 2026-05-13 · unverdicted · novelty 7.0

Collider-Bench is a new benchmark showing that current LLM agents cannot reliably reproduce LHC analyses at the level of a physicist-in-the-loop.

PRL-Bench: A Comprehensive Benchmark Evaluating LLMs' Capabilities in Frontier Physics Research

cs.LG · 2026-04-16 · unverdicted · novelty 6.0

PRL-Bench evaluates frontier LLMs on 100 real physics research tasks and finds the best models score below 50, exposing a gap to autonomous discovery.

AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration

cs.AI · 2026-05-19 · unverdicted · novelty 5.0 · 2 refs

AutoResearchClaw introduces a multi-agent research pipeline with debate, self-healing, verifiable outputs, human collaboration modes, and cross-run evolution that outperforms AI Scientist v2 by 54.7% on ARC-Bench.

EasyScan_HEP 2: Agent-Ready Parameter Scans for High-Energy Physics

hep-ph · 2026-06-30 · unverdicted · novelty 4.0

EasyScan_HEP 2 adds AI-agent interfaces to a HEP parameter scan framework for natural-language to .ini config translation and new sampler integration.

citing papers explorer

Showing 4 of 4 citing papers.

Collider-Bench: Benchmarking AI Agents with Particle Physics Analysis Reproduction cs.LG · 2026-05-13 · unverdicted · none · ref 2
Collider-Bench is a new benchmark showing that current LLM agents cannot reliably reproduce LHC analyses at the level of a physicist-in-the-loop.
PRL-Bench: A Comprehensive Benchmark Evaluating LLMs' Capabilities in Frontier Physics Research cs.LG · 2026-04-16 · unverdicted · none · ref 9
PRL-Bench evaluates frontier LLMs on 100 real physics research tasks and finds the best models score below 50, exposing a gap to autonomous discovery.
AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration cs.AI · 2026-05-19 · unverdicted · none · ref 13 · 2 links
AutoResearchClaw introduces a multi-agent research pipeline with debate, self-healing, verifiable outputs, human collaboration modes, and cross-run evolution that outperforms AI Scientist v2 by 54.7% on ARC-Bench.
EasyScan_HEP 2: Agent-Ready Parameter Scans for High-Energy Physics hep-ph · 2026-06-30 · unverdicted · none · ref 17
EasyScan_HEP 2 adds AI-agent interfaces to a HEP parameter scan framework for natural-language to .ini config translation and new sampler integration.

An End-to-end Architecture for Collider Physics and Beyond

fields

years

verdicts

representative citing papers

citing papers explorer