pith. sign in

Title resolution pending

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

fields

cs.CL 2 cs.AI 1

clear filters

representative citing papers

GPQA: A Graduate-Level Google-Proof Q&A Benchmark

cs.AI · 2023-11-20 · accept · novelty 7.0

GPQA is a new graduate-level benchmark where PhD experts score 65% (74% after corrections), skilled non-experts score 34% with web access, and GPT-4 scores 39%, intended to enable realistic tests of human supervision over superhuman AI.

citing papers explorer

Showing 2 of 2 citing papers after filters.