Measuring massive multitask language understanding

Dan Hendrycks, Collin Burns, Steven Basart, Andy Zou, Mantas Mazeika, Dawn Song, Jacob Steinhardt · 2021

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

The Translation Tax Is Not a Scalar: A Counterfactual Audit of English-Source Cue Inheritance in Chinese Multilingual Benchmarks

cs.CL · 2026-05-08 · conditional · novelty 7.0

An empirical audit shows the translation tax in English-to-Chinese benchmarks is estimator- and item-dependent rather than a single scalar effect, with a residue dose-response in naturalization tests.

Massive Activations in Large Language Models

cs.CL · 2024-02-27 · unverdicted · novelty 7.0

Massive activations are constant large values in LLMs that function as indispensable bias terms and concentrate attention probabilities on specific tokens.

citing papers explorer

Showing 2 of 2 citing papers.

The Translation Tax Is Not a Scalar: A Counterfactual Audit of English-Source Cue Inheritance in Chinese Multilingual Benchmarks cs.CL · 2026-05-08 · conditional · none · ref 5
An empirical audit shows the translation tax in English-to-Chinese benchmarks is estimator- and item-dependent rather than a single scalar effect, with a residue dose-response in naturalization tests.
Massive Activations in Large Language Models cs.CL · 2024-02-27 · unverdicted · none · ref 124
Massive activations are constant large values in LLMs that function as indispensable bias terms and concentrate attention probabilities on specific tokens.

Measuring massive multitask language understanding

fields

years

verdicts

representative citing papers

citing papers explorer