Evaluating the Impact of Using GRASP Framework on Clinicians and Healthcare Professionals Decisions in Selecting Clinical Predictive Tools

Blanca Gallego; Farah Magrabi; Mohamed Khalifa

arxiv: 1907.11523 · v1 · pith:WYM7VZ2Jnew · submitted 2019-07-25 · 💻 cs.CY

Evaluating the Impact of Using GRASP Framework on Clinicians and Healthcare Professionals Decisions in Selecting Clinical Predictive Tools

Mohamed Khalifa , Farah Magrabi , Blanca Gallego This is my paper

Pith reviewed 2026-05-24 16:28 UTC · model grok-4.3

classification 💻 cs.CY

keywords GRASP frameworkclinical predictive toolsevidence-based decision makingtool selectionhealthcare professionalsrandomized experimentdecisional conflict

0 comments

The pith

Using the GRASP framework raised clinicians' correct selections of predictive tools by 64 percent in a controlled trial.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Clinicians must choose among many predictive tools that lack head-to-head evidence. The paper introduces GRASP as a structured system that grades tools from published studies and tests whether it changes selection behavior. In an online randomized experiment with 194 participants, those given GRASP chose the objectively best tools more often, relied less on personal guesses or prior experience, and reported lower decisional conflict along with higher confidence. The results indicate that the framework shifts decisions from subjective to evidence-based without adding time.

Core claim

In a randomized controlled experiment, clinicians and healthcare professionals who applied the GRASP framework selected the best predictive tools 64 percent more often than those who did not, increased objective decision making by 32 percent, reduced subjective choices based on guessing or experience, lowered decisional conflict, raised confidence and satisfaction, and rated the framework highly usable.

What carries the argument

GRASP, an evidence-based framework that grades and assesses predictive tools through critical appraisal of published evidence.

If this is right

Clinicians select predictive tools more accurately and objectively when guided by GRASP scores.
Use of GRASP reduces reliance on guessing and prior experience in tool selection.
GRASP lowers decisional conflict and raises reported confidence and satisfaction with choices.
The framework shows high usability and is viewed as useful by most participants.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

GRASP might be embedded in electronic health records to guide tool choices during routine care.
Similar evidence-grading methods could extend to selecting other clinical technologies beyond predictive models.
Widespread adoption could reduce variation in which tools different hospitals choose for the same clinical task.

Load-bearing premise

The two scenarios contain objectively correct best tools that can be identified without using GRASP itself, and survey answers match the decisions clinicians would make with real patients.

What would settle it

A study that tracks actual patient outcomes after clinicians use GRASP to pick tools versus not using it, to check whether higher survey accuracy translates to better clinical results.

read the original abstract

Background. When selecting predictive tools, clinicians and healthcare professionals are challenged with an overwhelming number of tools, most of which have never been evaluated for comparative effectiveness. To overcome this challenge, the authors developed and validated an evidence-based framework for grading and assessment of predictive tools (GRASP), based on the critical appraisal of published evidence. Methods. To examine GRASP impact on professionals decisions, a controlled experiment was conducted through an online survey. Randomising two groups of tools and two scenarios; participants were asked to select the best tools; most validated or implemented, with and without GRASP. A wide group of international participants were invited. Task completion time, rate of correct decisions, rate of objective vs subjective decisions, and level of decisional conflict were measured. Results. Valid responses received were 194. Compared to not using the framework, GRASP significantly increased correct decisions by 64% (T=8.53, p<0.001), increased objective decision making by 32% (T=9.24, p<0.001), and decreased subjective decision making; based on guessing and based on prior knowledge or experience by 20% (T=-5.47, p<0.001) and 8% (T=-2.99, p=0.003) respectively. GRASP significantly decreased decisional conflict; increasing confidence and satisfaction of participants with their decisions by 11% (T=4.27, p<0.001) and 13% (T=4.89, p<0.001) respectively. GRASP decreased task completion time by 52% (T=-0.87, p=0.384). The average system usability scale of GRASP was very good; 72.5%, and 88% of participants found GRASP useful. Discussion and Conclusions. Using GRASP has positively supported and significantly improved evidence-based decision making and increased accuracy and efficiency of selecting predictive tools.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The survey reports big gains from GRASP but the definition of correct decisions looks tied to the same evidence rules the framework uses.

read the letter

The main result is that a randomized online survey of 194 clinicians found GRASP raised correct tool selections by 64 percent, objective choices by 32 percent, and reduced decisional conflict while lifting confidence and satisfaction. Task time dropped but the change was not significant. Usability scored 72.5 on the SUS and 88 percent of users called it useful. These are the first quantitative impact numbers for the framework after its earlier introduction papers. The experiment itself is a standard two-arm comparison across randomized scenarios and tool sets, with clear reporting of t-statistics and p-values on the main outcomes. That supplies usable before-and-after data on how the framework changes stated decisions in a controlled setting. The soft spot is the ground-truth definition for a correct choice. The abstract describes participants picking the most validated or implemented tool, yet gives no sign of an external anchor such as regulatory approval, independent meta-analysis, or citation ranking used to set the right answer ahead of the survey. If that ranking was produced by the same critical-appraisal steps GRASP formalizes, then the no-GRASP arm is being scored against the framework's own logic. The reported lift then partly measures how well people follow GRASP rules rather than an independent gain in decision quality. The stress-test note on this point stands up from the abstract alone. Other issues are smaller: response handling and blinding details are not visible here, and the international sample is a plus but its size and composition need checking. The work shows straightforward design and honest outcome tracking. It is aimed at clinical informatics researchers who evaluate decision-support tools and want empirical numbers on an evidence-grading system. A reader focused on tool selection methods would get concrete data from it. The paper deserves peer review so the authors can show whether the correct labels were fixed independently of GRASP.

Referee Report

2 major / 1 minor

Summary. The paper reports results from a randomized online survey experiment (n=194 valid responses) comparing clinicians' selection of clinical predictive tools with and without the GRASP evidence-appraisal framework. It claims that GRASP use produced a 64% increase in correct decisions (T=8.53, p<0.001), a 32% increase in objective decision-making (T=9.24, p<0.001), reductions in subjective decisions based on guessing or prior experience, lower decisional conflict, higher confidence and satisfaction, and a non-significant reduction in task time, with high usability ratings.

Significance. If the reported improvements are shown to rest on an externally validated definition of correctness, the work would provide useful empirical support for structured frameworks that aim to increase evidence-based selection of predictive tools. The randomized design and quantitative reporting of effect sizes are positive features; the findings could inform implementation of decision-support tools in healthcare informatics if methodological details are clarified.

major comments (2)

[Abstract (Results)] Abstract (Results): The headline claim of a 64% increase in 'correct decisions' (T=8.53, p<0.001) requires an a priori, independent definition of which tool is 'best' (most validated or implemented) in each scenario. The provided text does not state whether this ground truth was fixed using criteria (e.g., regulatory approval or independent meta-analysis) distinct from the evidence-appraisal process that GRASP itself grades; if the two overlap, the measured gain partly reflects alignment with GRASP logic rather than external improvement in decision quality.
[Methods] Methods: The description of randomization of tools and scenarios, participant blinding, and criteria for including the 194 valid responses is too brief to allow verification that the t-tests and p-values are free of selection or response biases.

minor comments (1)

[Abstract (Results)] The abstract reports percentage changes (e.g., 64%, 32%) without stating the baseline rates in the control arm; adding these baselines would improve interpretability of the effect sizes.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments. We address each major point below and will revise the manuscript to improve clarity on the issues raised.

read point-by-point responses

Referee: [Abstract (Results)] Abstract (Results): The headline claim of a 64% increase in 'correct decisions' (T=8.53, p<0.001) requires an a priori, independent definition of which tool is 'best' (most validated or implemented) in each scenario. The provided text does not state whether this ground truth was fixed using criteria (e.g., regulatory approval or independent meta-analysis) distinct from the evidence-appraisal process that GRASP itself grades; if the two overlap, the measured gain partly reflects alignment with GRASP logic rather than external improvement in decision quality.

Authors: The ground truth for 'correct' decisions was defined a priori in the study design as selection of the tool that is most validated or implemented, based on external indicators such as regulatory approvals and independent validation studies available at the time the scenarios were constructed. This determination preceded the application of GRASP. However, the manuscript text is not explicit on the separation of these criteria from GRASP's grading logic, so we will revise the Methods section to detail exactly how the correct tool was identified for each scenario using criteria independent of GRASP. revision: yes
Referee: [Methods] Methods: The description of randomization of tools and scenarios, participant blinding, and criteria for including the 194 valid responses is too brief to allow verification that the t-tests and p-values are free of selection or response biases.

Authors: We agree the Methods section is too concise on these procedural details. In the revision we will expand the description to specify the randomization mechanism for tools and scenarios, how blinding was implemented for participants, and the exact inclusion/exclusion criteria applied to arrive at the 194 valid responses. This added detail will permit readers to evaluate potential biases in the reported statistical tests. revision: yes

Circularity Check

0 steps flagged

No circularity: randomized empirical comparison with pre-fixed ground truth

full rationale

The paper reports a controlled online survey experiment that randomizes participants to use or not use the GRASP framework when selecting tools in two scenarios. Correct decisions are defined as selection of the pre-determined 'most validated or implemented' tool; objective vs. subjective decisions and decisional conflict are measured via direct survey responses. No equations, fitted parameters, or self-citation chains are used to derive the reported percentages or t-statistics; the outcome definitions do not reduce to GRASP scores by construction. The study is self-contained against external benchmarks of survey-based decision experiments.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the validity of the randomized survey design, the objective identifiability of 'correct' tools in the scenarios, and the appropriateness of t-tests for the measured proportions and times; no free parameters or invented entities are introduced.

axioms (1)

standard math Standard two-sample t-tests are appropriate for comparing decision rates and times between independent groups of this size
The paper reports T statistics and p-values for all primary outcomes.

pith-pipeline@v0.9.0 · 5905 in / 1344 out tokens · 26591 ms · 2026-05-24T16:28:31.438246+00:00 · methodology

Evaluating the Impact of Using GRASP Framework on Clinicians and Healthcare Professionals Decisions in Selecting Clinical Predictive Tools

Core claim

What carries the argument

If this is right

Where Pith is reading between the lines

Load-bearing premise

What would settle it

discussion (0)