An empirical comparison of model validation techniques for defect prediction models,

· 2017 · arXiv 2016.258405

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Evaluation-Strategy Gap in Fault Diagnosis of Deep Learning Programs

cs.SE · 2026-06-25 · unverdicted · novelty 6.0

Using a corpus of 5542 fault-injected traces from 38 DL programs, the study finds a 0.19 balanced accuracy gap in fault diagnosis between within-program and cross-program evaluation caused by program-specific feature structures.

PatchTrack: A Comprehensive Analysis of ChatGPT's Influence on Pull Request Outcomes

cs.SE · 2025-05-12 · conditional · novelty 6.0

Empirical analysis of 338 PRs with self-admitted ChatGPT usage shows low full integration (median 25%), selective adaptation patterns, and broader influence on developer reasoning during reviews.

citing papers explorer

Showing 2 of 2 citing papers.

Evaluation-Strategy Gap in Fault Diagnosis of Deep Learning Programs cs.SE · 2026-06-25 · unverdicted · none · ref 12
Using a corpus of 5542 fault-injected traces from 38 DL programs, the study finds a 0.19 balanced accuracy gap in fault diagnosis between within-program and cross-program evaluation caused by program-specific feature structures.
PatchTrack: A Comprehensive Analysis of ChatGPT's Influence on Pull Request Outcomes cs.SE · 2025-05-12 · conditional · none · ref 16
Empirical analysis of 338 PRs with self-admitted ChatGPT usage shows low full integration (median 25%), selective adaptation patterns, and broader influence on developer reasoning during reviews.

An empirical comparison of model validation techniques for defect prediction models,

fields

years

verdicts

representative citing papers

citing papers explorer