Fine-tuned on-device LLMs achieve up to 87.9% diagnostic accuracy on clinical tasks, approaching GPT-5.1 at 89.4% while remaining smaller and local.
Instruction: Given the following task description, the true disease, and the model output, assign a score from 1 to 5 according to the rubric
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2025 1verdicts
CONDITIONAL 1representative citing papers
citing papers explorer
-
Benchmarking and Adapting On-Device LLMs for Clinical Decision Support
Fine-tuned on-device LLMs achieve up to 87.9% diagnostic accuracy on clinical tasks, approaching GPT-5.1 at 89.4% while remaining smaller and local.