Slater, Ali Ziaee and Morgan Nguyen

Gaurav Suri, Lily R · 2023 · arXiv 2305.04400

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

How reliable are LLMs when it comes to playing dice?

cs.CL · 2026-06-05 · unverdicted · novelty 5.0

LLMs score 0.96 on standard probability exercises but 0.59 on counterintuitive ones and drop further with biased wording or misleading cues, indicating they are not genuine probabilistic reasoners.

Don't Look at the Numbers: Visual Anchoring Bias and Layer-wise Representation in VLMs

cs.AI · 2026-05-11 · unverdicted · novelty 5.0

Numeric anchors embedded in images systematically bias VLM quality judgments more than severe visual degradation, with layer-wise probing showing that anchor-saturated layers are suboptimal for quality prediction.

citing papers explorer

Showing 2 of 2 citing papers.

How reliable are LLMs when it comes to playing dice? cs.CL · 2026-06-05 · unverdicted · none · ref 22
LLMs score 0.96 on standard probability exercises but 0.59 on counterintuitive ones and drop further with biased wording or misleading cues, indicating they are not genuine probabilistic reasoners.
Don't Look at the Numbers: Visual Anchoring Bias and Layer-wise Representation in VLMs cs.AI · 2026-05-11 · unverdicted · none · ref 9
Numeric anchors embedded in images systematically bias VLM quality judgments more than severe visual degradation, with layer-wise probing showing that anchor-saturated layers are suboptimal for quality prediction.

Slater, Ali Ziaee and Morgan Nguyen

fields

years

verdicts

representative citing papers

citing papers explorer