Benchmarking and Improving Text-to- SQL Generation under Ambiguity

Bhaskar, Adithya, Tomar, Tushar, Sathe, Ashutosh, Sarawagi, Sunita · 2023 · DOI 10.18653/v1/2023.emnlp-main.436

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open at publisher browse 1 citing papers

representative citing papers

SPENCE: A Syntactic Probe for Detecting Contamination in NL2SQL Benchmarks

cs.CL · 2026-04-20 · unverdicted · novelty 6.0

SPENCE shows older NL2SQL benchmarks like Spider have high performance sensitivity to syntactic changes, indicating likely training contamination, while newer ones like BIRD show little sensitivity and appear largely clean.

citing papers explorer

Showing 1 of 1 citing paper.

SPENCE: A Syntactic Probe for Detecting Contamination in NL2SQL Benchmarks cs.CL · 2026-04-20 · unverdicted · none · ref 37
SPENCE shows older NL2SQL benchmarks like Spider have high performance sensitivity to syntactic changes, indicating likely training contamination, while newer ones like BIRD show little sensitivity and appear largely clean.

Benchmarking and Improving Text-to- SQL Generation under Ambiguity

fields

years

verdicts

representative citing papers

citing papers explorer