All figures and extended tables referenced below are available in Appendix A

Dataset Statistics, Analysis This section provides a quantitative characterization of the accepted repositories in CIDR · 2011

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

CIDR: A Large-Scale Industrial Source Code Dataset for Software Engineering Research

cs.SE · 2026-05-12 · unverdicted · novelty 8.0

CIDR is a large-scale curated dataset of proprietary industrial source code repositories spanning 138 languages and 373 million lines of code, collected via formal agreements with industry partners.

citing papers explorer

Showing 1 of 1 citing paper.

CIDR: A Large-Scale Industrial Source Code Dataset for Software Engineering Research cs.SE · 2026-05-12 · unverdicted · none · ref 5
CIDR is a large-scale curated dataset of proprietary industrial source code repositories spanning 138 languages and 373 million lines of code, collected via formal agreements with industry partners.

All figures and extended tables referenced below are available in Appendix A

fields

years

verdicts

representative citing papers

citing papers explorer