arXiv preprint arXiv:2201.11706 , year=

· 2022 · arXiv 2201.11706

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling

cs.CL · 2023-04-03 · accept · novelty 8.0

Pythia releases 16 identically trained LLMs with full checkpoints and data tools to study training dynamics, scaling, memorization, and bias in language models.

No Safe Dose: How Training Data Drives Unsafe Image Generation

cs.CV · 2026-05-27 · unverdicted · novelty 6.0

Proportion of unsafe images in training data directly increases unsafe outputs in text-to-image models, independent of absolute count, with complementary risk reduction from safer text encoders.

Bias in the Tails: How Name-conditioned Evaluative Framing in Resume Summaries Destabilizes LLM-based Hiring

cs.CY · 2026-04-21 · unverdicted · novelty 6.0

LLM resume summaries exhibit name-conditioned evaluative bias concentrated in distribution tails, transforming directional harm into symmetric instability that may evade conventional fairness audits.

The Platonic Representation Hypothesis

cs.LG · 2024-05-13 · unverdicted · novelty 5.0

Representations learned by large AI models are converging toward a shared statistical model of reality.

citing papers explorer

Showing 4 of 4 citing papers.

Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling cs.CL · 2023-04-03 · accept · none · ref 188
Pythia releases 16 identically trained LLMs with full checkpoints and data tools to study training dynamics, scaling, memorization, and bias in language models.
No Safe Dose: How Training Data Drives Unsafe Image Generation cs.CV · 2026-05-27 · unverdicted · none · ref 23
Proportion of unsafe images in training data directly increases unsafe outputs in text-to-image models, independent of absolute count, with complementary risk reduction from safer text encoders.
Bias in the Tails: How Name-conditioned Evaluative Framing in Resume Summaries Destabilizes LLM-based Hiring cs.CY · 2026-04-21 · unverdicted · none · ref 5
LLM resume summaries exhibit name-conditioned evaluative bias concentrated in distribution tails, transforming directional harm into symmetric instability that may evade conventional fairness audits.
The Platonic Representation Hypothesis cs.LG · 2024-05-13 · unverdicted · none · ref 246
Representations learned by large AI models are converging toward a shared statistical model of reality.

arXiv preprint arXiv:2201.11706 , year=

fields

years

verdicts

representative citing papers

citing papers explorer