Measurement and Fairness

Jacobs, Abigail Z · 1912 · arXiv 1912.05511

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

When Behavioral Safety Evaluation Fails: A Representation-Level Perspective

cs.LG · 2026-06-06 · unverdicted · novelty 6.0

Behavioral safety metrics for LLMs are insufficient because models can maintain safe outputs while remaining vulnerable to latent-space interventions, as shown via dissociated models and the new Latent Vulnerability Score.

Making AI Evaluation Deployment Relevant Through Context Specification

cs.AI · 2026-03-06 · unverdicted · novelty 4.0

Context specification is a process that turns diffuse stakeholder perspectives into explicit definitions of properties, behaviors, and outcomes to guide context-aware AI evaluations.

citing papers explorer

Showing 2 of 2 citing papers after filters.

When Behavioral Safety Evaluation Fails: A Representation-Level Perspective cs.LG · 2026-06-06 · unverdicted · none · ref 8
Behavioral safety metrics for LLMs are insufficient because models can maintain safe outputs while remaining vulnerable to latent-space interventions, as shown via dissociated models and the new Latent Vulnerability Score.
Making AI Evaluation Deployment Relevant Through Context Specification cs.AI · 2026-03-06 · unverdicted · none · ref 12
Context specification is a process that turns diffuse stakeholder perspectives into explicit definitions of properties, behaviors, and outcomes to guide context-aware AI evaluations.

Measurement and Fairness

fields

years

verdicts

representative citing papers

citing papers explorer