Releases a large real-world dataset of dirty postal addresses with ground truth for benchmarking data cleaning algorithms.
Ilyas, Mourad Ouzzani, Paolo Papotti, Michael Stonebraker, and Nan Tang
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.DB 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Clean Me If You Can: A Large Collection of Real-World Addresses for Data Cleaning Benchmarking
Releases a large real-world dataset of dirty postal addresses with ground truth for benchmarking data cleaning algorithms.