MedHal-Loc benchmark shows KG-triple hallucination detectors localize errors no better than chance on controlled medical statements due to entity extraction limits, while NLI and consistency methods succeed above chance, and real hallucinations are mostly diffuse conclusion changes.
Medical hallucination in foundation models and their impact on healthcare
3 Pith papers cite this work. Polarity classification is still indexing.
years
2026 3verdicts
UNVERDICTED 3representative citing papers
A literature synthesis that unifies hallucination taxonomies across medical imaging modalities, finds general-purpose foundation models hallucinate less than specialized ones, and maps mitigation to FDA lifecycle frameworks.
The system integrates a Neo4j knowledge graph, four-stage symptom matching with LLM verification, genetic-algorithm-optimized proactive questioning, and multimodal evidence-based visualizations to improve diagnostic transparency and treatment interpretability in TCM, reporting 32% fewer non-standard
citing papers explorer
-
Hallucination in Medical Imaging AI: A Cross-Modality Analytical Framework for Taxonomy, Detection, and Mitigation under Regulatory Constraints
A literature synthesis that unifies hallucination taxonomies across medical imaging modalities, finds general-purpose foundation models hallucinate less than specialized ones, and maps mitigation to FDA lifecycle frameworks.
-
Evidence-Based Intelligent Diagnostic and Therapeutic Visualization System with Large Language Models: Multi-Turn Interaction and Multimodal Treatment Plan Generation
The system integrates a Neo4j knowledge graph, four-stage symptom matching with LLM verification, genetic-algorithm-optimized proactive questioning, and multimodal evidence-based visualizations to improve diagnostic transparency and treatment interpretability in TCM, reporting 32% fewer non-standard