Delgado-Chaves, Matthew J

ISSN 1091-6490 · 2025 · DOI 10.1073/pnas.2411962122

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

open at publisher browse 4 citing papers

representative citing papers

Can AI Agents Synthesize Scientific Conclusions?

cs.AI · 2026-06-09 · unverdicted · novelty 7.0

A new benchmark and clean-room harness show frontier AI agents reach only 0.337 factual F1 when synthesizing conclusions from scientific evidence.

Talk is (Not) Cheap: A Taxonomy and Benchmark Coverage Audit for LLM Attacks

cs.CR · 2026-05-14 · unverdicted · novelty 7.0

A new 507-leaf taxonomy and 4x6 Target x Technique matrix audits six LLM attack benchmarks and finds they cover at most 25% of the threat surface with entire STRIDE categories untested.

Thinking Like a Scientist? A Structural Study of LLM-Generated Research Methods

cs.CL · 2026-06-15 · unverdicted · novelty 6.0

LLMs given only research questions from 1000 arXiv CS papers recommend a narrower set of methods than the original papers, with effective model-entity diversity dropping from 1232 to 59-96 and stronger agreement among LLMs than with papers.

Code Sharing In Prediction Model Research: A Scoping Review

cs.SE · 2026-03-16 · accept · novelty 5.0

Only 12.2% of 3,967 eligible prediction model studies share code, with shared repositories frequently lacking dependency specifications and modular structure needed for reproducibility.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Code Sharing In Prediction Model Research: A Scoping Review cs.SE · 2026-03-16 · accept · none · ref 18
Only 12.2% of 3,967 eligible prediction model studies share code, with shared repositories frequently lacking dependency specifications and modular structure needed for reproducibility.

Delgado-Chaves, Matthew J

fields

years

verdicts

representative citing papers

citing papers explorer