DiscoverPhysics is a new benchmark with 22 on-demand N-body simulated worlds where LLM agents design experiments to infer non-standard physics, evaluated via held-out trajectory MSE and LLM-judged explanation quality.
Biodiscoveryagent: An ai agent for designing genetic perturbation experiments
6 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
roles
background 1polarities
background 1representative citing papers
IR-Agent is a multi-agent LLM framework that emulates expert IR spectral analysis procedures to improve molecular structure elucidation accuracy and adaptability.
A principled reward design for tool selection and application in RL-trained LLMs delivers 17% gains over base models and 15% over SFT across benchmarks.
CLIO agent applies calibrated deference in closed-loop AORFB negolyte design, achieving 90 mV redox potential gain with restored reversibility after hypothesis-driven redesign from phosphonate to sulfonate.
A review paper that surveys AI uses across the food innovation pipeline for sustainable proteins and identifies four strategic priorities for the emerging field.
A survey that deconstructs LLM agent systems via a methodology-centered taxonomy linking design principles to emergent behaviors, applications, and challenges.
citing papers explorer
-
DiscoverPhysics: Benchmarking LLMs for Out-of-the-Box Scientific Thinking
DiscoverPhysics is a new benchmark with 22 on-demand N-body simulated worlds where LLM agents design experiments to infer non-standard physics, evaluated via held-out trajectory MSE and LLM-judged explanation quality.
-
IR-Agent: Expert-Inspired LLM Agents for Structure Elucidation from Infrared Spectra
IR-Agent is a multi-agent LLM framework that emulates expert IR spectral analysis procedures to improve molecular structure elucidation accuracy and adaptability.
-
ToolRL: Reward is All Tool Learning Needs
A principled reward design for tool selection and application in RL-trained LLMs delivers 17% gains over base models and 15% over SFT across benchmarks.
-
Closed-Loop Molecular Design with Calibrated Deference
CLIO agent applies calibrated deference in closed-loop AORFB negolyte design, achieving 90 mV redox potential gain with restored reversibility after hypothesis-driven redesign from phosphonate to sulfonate.
-
Artificial Intelligence for Food Innovation
A review paper that surveys AI uses across the food innovation pipeline for sustainable proteins and identifies four strategic priorities for the emerging field.
-
Large Language Model Agent: A Survey on Methodology, Applications and Challenges
A survey that deconstructs LLM agent systems via a methodology-centered taxonomy linking design principles to emergent behaviors, applications, and challenges.