Author contributions H.N

Gottweis, J · 2026 · DOI 10.1038/s41586-026-10644-y

9 Pith papers cite this work. Polarity classification is still indexing.

9 Pith papers citing it

open at publisher browse 9 citing papers

representative citing papers

NatureBench: Can Coding Agents Match the Published SOTA of Nature-Family Papers?

cs.CL · 2026-06-23 · unverdicted · novelty 7.0

NatureBench evaluates ten frontier AI coding agents on 90 tasks from Nature papers under web-search-disabled conditions and finds the strongest agent surpasses published SOTA on only 17.8% of tasks, succeeding mainly by translating problems into familiar supervised learning setups.

Closed-loop Auto Research for Molecular Property Prediction: Discovering and Certifying Generalizable Improvements

cs.AI · 2026-06-22 · unverdicted · novelty 6.0

Closed-loop LM-agent auto research finds some transferable gains on molecular property prediction benchmarks via external data but shows non-transfer for model and feature edits selected on validation.

Deterministic Integrity Gates for LLM-Assisted Clinical Manuscript Preparation: An Auditable Biomedical Informatics Architecture

cs.AI · 2026-06-08 · unverdicted · novelty 6.0

Presents MedSci Skills, an open-source toolkit with deterministic integrity gates for verifying LLM-assisted clinical manuscripts against reporting guidelines like STARD, PRISMA, and STROBE.

DN-Hypo-Pipeline: An AI-Driven Workflow for Generating Hypotheses using Large Language Models and Scientific Explanations

cs.AI · 2026-06-07 · unverdicted · novelty 6.0

DN-Hypo-Pipeline operationalizes three philosophy-of-science accounts to direct LLMs toward principle-based hypothesis generation, claims superior performance over direct prompting, and derives two new transformer algorithms from the resulting hypotheses.

Agentic Language-to-Objective Synthesis for Optofluidic Assembly

cs.RO · 2026-05-26 · unverdicted · novelty 6.0

Speak-to-Objective is a modular agentic pipeline that translates spoken or written commands into fully differentiable objective functions for optofluidic microparticle assembly using LLMs, inverse solvers, and experimental platforms.

Ontology-constrained multi-LLM scoring of hypothesis support in the predictive processing literature

q-bio.NC · 2026-05-23 · unverdicted · novelty 6.0

A multi-LLM council scores predictive processing papers on an expert ontology, maps results in 3D hypothesis space, and introduces a dispersion metric showing greater spread in global versus local oddball paradigms.

From Meta Idea to Advanced Mathematical Discovery -- Human-AI Co-Discovery of Sign-Embedding Quantum Algorithms

cs.LG · 2026-06-12 · unverdicted · novelty 5.0

Human-AI collaboration expanded a meta-idea on rational approximation into sign-embedding quantum algorithms for matrix problems, with humans retaining final judgment on routes and refinements.

Cross-domain benchmarks reveal when coordinated AI agents improve scientific inference from partial evidence

cs.AI · 2026-05-21 · unverdicted · novelty 5.0

Coordinated AI agents improve scientific inference from partial evidence in cross-domain tasks when single sources are incomplete, as demonstrated by AUROC gains in vector-borne disease and exoplanet benchmarks but tied performance in others.

Hephaestus: Toward a Cybersecurity AI Scientist

cs.CR · 2026-06-29 · unverdicted · novelty 4.0

The paper proposes the Cybersecurity AI Scientist as a modular multi-agent architecture for automating cybersecurity research, distinguished by its focus on non-stationary threats and anchored in a four-zeros risk-trust-incident-energy frame.

citing papers explorer

Showing 9 of 9 citing papers after filters.

NatureBench: Can Coding Agents Match the Published SOTA of Nature-Family Papers? cs.CL · 2026-06-23 · unverdicted · none · ref 29
NatureBench evaluates ten frontier AI coding agents on 90 tasks from Nature papers under web-search-disabled conditions and finds the strongest agent surpasses published SOTA on only 17.8% of tasks, succeeding mainly by translating problems into familiar supervised learning setups.
Closed-loop Auto Research for Molecular Property Prediction: Discovering and Certifying Generalizable Improvements cs.AI · 2026-06-22 · unverdicted · none · ref 19
Closed-loop LM-agent auto research finds some transferable gains on molecular property prediction benchmarks via external data but shows non-transfer for model and feature edits selected on validation.
Deterministic Integrity Gates for LLM-Assisted Clinical Manuscript Preparation: An Auditable Biomedical Informatics Architecture cs.AI · 2026-06-08 · unverdicted · none · ref 18
Presents MedSci Skills, an open-source toolkit with deterministic integrity gates for verifying LLM-assisted clinical manuscripts against reporting guidelines like STARD, PRISMA, and STROBE.
DN-Hypo-Pipeline: An AI-Driven Workflow for Generating Hypotheses using Large Language Models and Scientific Explanations cs.AI · 2026-06-07 · unverdicted · none · ref 7
DN-Hypo-Pipeline operationalizes three philosophy-of-science accounts to direct LLMs toward principle-based hypothesis generation, claims superior performance over direct prompting, and derives two new transformer algorithms from the resulting hypotheses.
Agentic Language-to-Objective Synthesis for Optofluidic Assembly cs.RO · 2026-05-26 · unverdicted · none · ref 46
Speak-to-Objective is a modular agentic pipeline that translates spoken or written commands into fully differentiable objective functions for optofluidic microparticle assembly using LLMs, inverse solvers, and experimental platforms.
Ontology-constrained multi-LLM scoring of hypothesis support in the predictive processing literature q-bio.NC · 2026-05-23 · unverdicted · none · ref 87
A multi-LLM council scores predictive processing papers on an expert ontology, maps results in 3D hypothesis space, and introduces a dispersion metric showing greater spread in global versus local oddball paradigms.
From Meta Idea to Advanced Mathematical Discovery -- Human-AI Co-Discovery of Sign-Embedding Quantum Algorithms cs.LG · 2026-06-12 · unverdicted · none · ref 14
Human-AI collaboration expanded a meta-idea on rational approximation into sign-embedding quantum algorithms for matrix problems, with humans retaining final judgment on routes and refinements.
Cross-domain benchmarks reveal when coordinated AI agents improve scientific inference from partial evidence cs.AI · 2026-05-21 · unverdicted · none · ref 3
Coordinated AI agents improve scientific inference from partial evidence in cross-domain tasks when single sources are incomplete, as demonstrated by AUROC gains in vector-borne disease and exoplanet benchmarks but tied performance in others.
Hephaestus: Toward a Cybersecurity AI Scientist cs.CR · 2026-06-29 · unverdicted · none · ref 19
The paper proposes the Cybersecurity AI Scientist as a modular multi-agent architecture for automating cybersecurity research, distinguished by its focus on non-stationary threats and anchored in a four-zeros risk-trust-incident-energy frame.

Author contributions H.N

fields

years

verdicts

representative citing papers

citing papers explorer