pith. sign in

super hub Canonical reference

Survey of Hallucination in Natural Language Generation

Canonical reference. 88% of citing Pith papers cite this work as background.

104 Pith papers citing it
2,906 external citations · Crossref
Background 88% of classified citations

hub tools

citation-role summary

background 17

citation-polarity summary

claims ledger

  • background [315, 361]. Furthermore, Liu et al. [185], Zong et al. [395] and Liu et al. [184] show that LVLMs can be easily fooled and experience a severe performance drop due to their over-reliance on the strong language prior, as well as its inferior ability to defend against inappropriate user inputs [112, 134]. Jiang et al. [138], Wang et al. [315] and Jing et al. [141] took a step forward to holistically evaluate multi-modal hallucination. What's more, when presented with multiple images, LVLMs sometim

authors

co-cited works

clear filters

representative citing papers

Knowledge Editing in Masked Diffusion Language Models

cs.CL · 2026-06-02 · unverdicted · novelty 7.0

Locate-then-edit succeeds at the same early-to-mid MLP locations in masked diffusion models as in autoregressive models, but requires optimization over intermediate partial-mask states to handle multi-token targets.

Honest Lying: Understanding Memory Confabulation in Reflexive Agents

cs.LG · 2026-05-28 · unverdicted · novelty 7.0

Reflexive agents confabulate incorrect task interpretations in memory, detected via Reflection Repetition Rate metric, with a programmatic mitigation raising correct object mentions from 0% to 86% in frozen ALFWorld cases.

CyberCertBench: Evaluating LLMs in Cybersecurity Certification Knowledge

cs.CR · 2026-04-22 · unverdicted · novelty 7.0

CyberCertBench shows frontier LLMs reach human-expert performance on general IT and networking security but drop on vendor-specific and formal standards questions such as IEC 62443, with a new framework for producing interpretable explanations.

citing papers explorer

Showing 3 of 3 citing papers after filters.

  • RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval cs.CL · 2024-01-31 · unverdicted · none · ref 98

    RAPTOR introduces a tree-organized retrieval method using recursive abstractive summaries, achieving a 20% absolute accuracy improvement on the QuALITY benchmark when paired with GPT-4.

  • Corrective Retrieval Augmented Generation cs.CL · 2024-01-29 · unverdicted · none · ref 11

    CRAG improves RAG robustness via a retrieval quality evaluator that triggers web augmentation and a decompose-recompose filter to focus on relevant information, yielding better results on short- and long-form generation tasks.

  • Large Language Models: A Survey cs.CL · 2024-02-09 · accept · none · ref 145

    The paper surveys key large language models, their training methods, datasets, evaluation benchmarks, and future research directions in the field.