A literature synthesis and field catalogue identify Governance Debt as the main driver of Data Lake failures and introduce assessment tools and frameworks for practitioners.
Data Lakes: A Survey of Functions and Systems,
3 Pith papers cite this work. Polarity classification is still indexing.
verdicts
UNVERDICTED 3representative citing papers
MoRER builds an ER model repository via feature distribution clustering of tasks, achieving competitive results with limited labels versus active learning, transfer learning, and self-supervised methods on three multi-source datasets.
Introduces a cross-paradigm database selection framework based on nine dimensions, analyzes thirteen paradigms to identify three evolution patterns, and demonstrates hybrid architectures via a financial fraud detection case study.
citing papers explorer
-
Efficient Model Repository for Entity Resolution: Construction, Search, and Integration
MoRER builds an ER model repository via feature distribution clustering of tasks, achieving competitive results with limited labels versus active learning, transfer learning, and self-supervised methods on three multi-source datasets.
-
Architectural Evolution and Selection Framework for Database Systems in AI-Ready Data Platforms
Introduces a cross-paradigm database selection framework based on nine dimensions, analyzes thirteen paradigms to identify three evolution patterns, and demonstrates hybrid architectures via a financial fraud detection case study.