Goodwin, Sonya E

Abacha, A · 2019 · DOI 10.3233/shti190176

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

representative citing papers

Can I Take Another Dose? Evaluating LLM Decision-Making Under Temporal Uncertainty in OTC Dosing QA

cs.CL · 2026-06-02 · unverdicted · novelty 6.0

Introduces DOSEBENCH benchmark and shows four LLMs often fail at rolling 24-hour dose calculations and constraint adherence in OTC dosing decisions despite appearing confident.

When Retrieval Doesn't Help: A Large-Scale Study of Biomedical RAG

cs.CL · 2026-06-02 · accept · novelty 6.0

Large-scale evaluation shows retrieval-augmented generation yields only marginal and inconsistent gains (1-2 points) over no-retrieval baselines in biomedical QA, with model choice dominating retriever or corpus effects.

citing papers explorer

Showing 2 of 2 citing papers.

Can I Take Another Dose? Evaluating LLM Decision-Making Under Temporal Uncertainty in OTC Dosing QA cs.CL · 2026-06-02 · unverdicted · none · ref 38
Introduces DOSEBENCH benchmark and shows four LLMs often fail at rolling 24-hour dose calculations and constraint adherence in OTC dosing decisions despite appearing confident.
When Retrieval Doesn't Help: A Large-Scale Study of Biomedical RAG cs.CL · 2026-06-02 · accept · none · ref 5
Large-scale evaluation shows retrieval-augmented generation yields only marginal and inconsistent gains (1-2 points) over no-retrieval baselines in biomedical QA, with model choice dominating retriever or corpus effects.

Goodwin, Sonya E

fields

years

verdicts

representative citing papers

citing papers explorer