pith. machine review for the scientific record. sign in

hub Tool reference

Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning

Tool reference. 71% of classified Pith citations use this work as a method, library, or software dependency, not as a substantive claim.

20 Pith papers citing it
Method reference 71% of classified citations

hub tools

citation-role summary

dataset 5 background 1 method 1

citation-polarity summary

representative citing papers

ReflectCAP: Detailed Image Captioning with Reflective Memory

cs.AI · 2026-04-14 · unverdicted · novelty 6.0

ReflectCAP distills model-specific hallucination and oversight patterns into Structured Reflection Notes that steer LVLMs toward more factual and complete image captions, reaching the Pareto frontier on factuality-coverage trade-offs.

LLaVA-OneVision: Easy Visual Task Transfer

cs.CV · 2024-08-06 · unverdicted · novelty 5.0

LLaVA-OneVision is the first single open LMM to simultaneously achieve strong performance in single-image, multi-image, and video scenarios with cross-scenario transfer capabilities.

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

cs.CV · 2024-08-03 · conditional · novelty 5.0

MiniCPM-Llama3-V 2.5 delivers GPT-4V-level multimodal performance on phones through architecture, pretraining, and alignment optimizations.

citing papers explorer

Showing 20 of 20 citing papers.