This section describes the tooling developed to track repositories through their lifecycle and to facilitate structured interaction with contributing partners

Data Management, Accounting System As the scale of the collection effort grew, the need for a robust data management system became apparent

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

CIDR: A Large-Scale Industrial Source Code Dataset for Software Engineering Research

cs.SE · 2026-05-12 · unverdicted · novelty 8.0

CIDR is a large-scale curated dataset of proprietary industrial source code repositories spanning 138 languages and 373 million lines of code, collected via formal agreements with industry partners.

citing papers explorer

Showing 1 of 1 citing paper.

CIDR: A Large-Scale Industrial Source Code Dataset for Software Engineering Research cs.SE · 2026-05-12 · unverdicted · none · ref 3
CIDR is a large-scale curated dataset of proprietary industrial source code repositories spanning 138 languages and 373 million lines of code, collected via formal agreements with industry partners.

This section describes the tooling developed to track repositories through their lifecycle and to facilitate structured interaction with contributing partners

fields

years

verdicts

representative citing papers

citing papers explorer