The llama 3 herd of models

Aaron Grattafiori et al · 2024

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

browse 4 citing papers

representative citing papers

Rethinking Vacuity for OOD Detection in Evidential Deep Learning

cs.AI · 2026-05-07 · accept · novelty 7.0

Vacuity-based OOD detection in evidential deep learning is highly sensitive to class cardinality differences between ID and OOD, which can artificially inflate AUROC and AUPR without any change in model predictions.

PathCal: State-Aware Reflection-Marker Calibration for Efficient Reasoning

cs.AI · 2026-05-21 · unverdicted · novelty 6.0

PathCal calibrates reasoning paths by type-aware soft rebalancing of reflection-marker logits at uncertain states, yielding better efficiency-performance trade-offs on six benchmarks.

Geometry-Calibrated Conformal Abstention for Language Models

cs.CL · 2026-04-30 · unverdicted · novelty 6.0

Geometry-calibrated conformal abstention lets language models abstain from uncertain queries with finite-sample guarantees on both participation rate and conditional correctness of answers.

Manifold of Failure: Behavioral Attraction Basins in Language Models

cs.LG · 2026-02-25 · unverdicted · novelty 6.0

MAP-Elites maps continuous vulnerability topologies in three LLMs, achieving up to 63% behavioral coverage and 370 niches with model-specific signatures that existing attack methods cannot provide.

citing papers explorer

Showing 4 of 4 citing papers.

Rethinking Vacuity for OOD Detection in Evidential Deep Learning cs.AI · 2026-05-07 · accept · none · ref 15
Vacuity-based OOD detection in evidential deep learning is highly sensitive to class cardinality differences between ID and OOD, which can artificially inflate AUROC and AUPR without any change in model predictions.
PathCal: State-Aware Reflection-Marker Calibration for Efficient Reasoning cs.AI · 2026-05-21 · unverdicted · none · ref 20
PathCal calibrates reasoning paths by type-aware soft rebalancing of reflection-marker logits at uncertain states, yielding better efficiency-performance trade-offs on six benchmarks.
Geometry-Calibrated Conformal Abstention for Language Models cs.CL · 2026-04-30 · unverdicted · none · ref 51
Geometry-calibrated conformal abstention lets language models abstain from uncertain queries with finite-sample guarantees on both participation rate and conditional correctness of answers.
Manifold of Failure: Behavioral Attraction Basins in Language Models cs.LG · 2026-02-25 · unverdicted · none · ref 5
MAP-Elites maps continuous vulnerability topologies in three LLMs, achieving up to 63% behavioral coverage and 370 niches with model-specific signatures that existing attack methods cannot provide.

The llama 3 herd of models

fields

years

verdicts

representative citing papers

citing papers explorer