ALCUNA : Large Language Models Meet New Knowledge

Yin, Xunjian, Huang, Baizhou, Wan, Xiaojun · 2023 · DOI 10.18653/v1/2023.emnlp-main.87

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open at publisher browse 1 citing papers

representative citing papers

PhantomBench: Benchmarking the Non-existential Threat of Language Models

cs.CL · 2026-06-09 · unverdicted · novelty 7.0

PhantomBench is a new benchmark of 60K+ non-existent terms showing language models hallucinate at rates up to 86.7 percent even when inputs assume the concepts exist.

citing papers explorer

Showing 1 of 1 citing paper.

PhantomBench: Benchmarking the Non-existential Threat of Language Models cs.CL · 2026-06-09 · unverdicted · none · ref 45
PhantomBench is a new benchmark of 60K+ non-existent terms showing language models hallucinate at rates up to 86.7 percent even when inputs assume the concepts exist.

ALCUNA : Large Language Models Meet New Knowledge

fields

years

verdicts

representative citing papers

citing papers explorer