Recognition: unknown
PianoCoRe: Combined and Refined Piano MIDI Dataset
Pith reviewed 2026-05-08 04:05 UTC · model grok-4.3
The pith
A combined and refined piano MIDI dataset provides the largest collection of score-aligned performances to date and improves model robustness on new pieces.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
PianoCoRe unifies major open-source piano MIDI datasets into a large-scale resource with 250,046 performances totaling 21,763 hours. The note-aligned PianoCoRe-A subset is the largest open collection of its kind. A MIDI quality classifier detects corrupted files, and the RAScoP pipeline refines alignments by cleaning temporal errors and interpolating missing notes. Refinement reduces temporal noise and eliminates tempo outliers. An expressive rendering model trained on PianoCoRe shows improved robustness to unseen pieces.
What carries the argument
The PianoCoRe dataset with its tiered subsets and the RAScoP alignment refinement pipeline, which cleans temporal alignment errors and interpolates missing notes while the quality classifier removes corrupted files.
If this is right
- Models for expressive piano performance can be trained on larger, cleaner aligned data, leading to better generalization.
- The refinement process reduces temporal noise and removes tempo outliers from the data.
- Researchers gain access to subsets suited for pre-training, large-scale analysis, or precise alignment tasks.
- Future work in music information retrieval can build on this unified resource instead of fragmented smaller datasets.
Where Pith is reading between the lines
- Such a dataset could support development of more accurate automatic accompaniment systems or piano transcription tools.
- Extending similar refinement pipelines to other instruments might create comparable resources for broader music research.
- Improved robustness suggests that data quality and alignment matter more than sheer volume alone for performance modeling tasks.
Load-bearing premise
The MIDI quality classifier and RAScoP pipeline correctly identify corrupted files and fix alignment errors without introducing systematic biases or removing valid expressive variations.
What would settle it
If a performance rendering model trained on PianoCoRe fails to show improved robustness on a held-out set of unseen pieces compared to models trained on raw data, or if manual review reveals that many valid performances were incorrectly removed or alignments distorted.
Figures
read the original abstract
Symbolic music datasets with matched scores and performances are essential for many music information retrieval (MIR) tasks. Yet, existing resources often cover a narrow range of composers, lack performance variety, omit note-level alignments, or use inconsistent naming formats. This work presents PianoCoRe, a large-scale piano MIDI dataset that unifies and refines major open-source piano corpora. The dataset contains 250,046 performances of 5,625 pieces written by 483 composers, totaling 21,763 h of performed music. PianoCoRe is released in tiered subsets to support different applications: from large-scale analysis and pre-training (PianoCoRe-C and deduplicated PianoCoRe-B) to expressive performance modeling with note-level score alignment (PianoCoRe-A/A*). The note-aligned subset, PianoCoRe-A, provides the largest open-source collection of 157,207 performances aligned to 1,591 scores to date. In addition to the dataset, the contributions are: (1) a MIDI quality classifier for detecting corrupted and score-like transcriptions and (2) RAScoP, an alignment refinement pipeline that cleans temporal alignment errors and interpolates missing notes. The analysis shows that the refinement reduces temporal noise and eliminates tempo outliers. Moreover, an expressive performance rendering model trained on PianoCoRe demonstrates improved robustness to unseen pieces compared to models trained on raw or smaller datasets. PianoCoRe provides a ready-to-use foundation for the next generation of expressive piano performance research.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper presents PianoCoRe, a unified large-scale piano MIDI dataset aggregating and refining existing open-source corpora into 250,046 performances of 5,625 pieces by 483 composers (21,763 hours total). It releases tiered subsets, with PianoCoRe-A providing the largest open note-aligned collection (157,207 performances aligned to 1,591 scores). Contributions include a MIDI quality classifier to remove corrupted/score-like files and the RAScoP pipeline for correcting temporal alignments and interpolating notes. The work claims that refinement reduces temporal noise and tempo outliers, and that an expressive performance rendering model trained on PianoCoRe shows improved robustness to unseen pieces compared to models trained on raw or smaller datasets.
Significance. If the validation claims hold, PianoCoRe would be a substantial resource for MIR tasks and expressive performance modeling, offering greater scale, alignment, and cleanliness than prior collections while supporting different use cases via its tiers. The open release and practical focus on note-level alignment could enable more reproducible and robust research in symbolic music.
major comments (3)
- [Abstract and §6] Abstract and §6 (analysis of refinement): the claim that 'the refinement reduces temporal noise and eliminates tempo outliers' is presented without any quantitative metrics (e.g., before/after distributions of timing variance, tempo statistics, or error rates), baselines, or comparison to ground-truth alignments, which is load-bearing for asserting improved data quality.
- [§4 and §5] §4 (MIDI quality classifier) and §5 (RAScoP pipeline): no external validation, inter-annotator agreement, or ablation is reported to confirm that the classifier and pipeline correctly discard only corrupted files and fix alignments without systematically removing valid expressive variations or introducing biases in timing/dynamics; this directly affects the central claim that PianoCoRe-A is the largest high-quality aligned collection.
- [§7] §7 (expressive performance rendering experiments): the improved robustness to unseen pieces is asserted but without details on model architecture, training hyperparameters, exact evaluation metrics (e.g., note-onset F1, velocity/timing error), the specific unseen test set, or an ablation comparing the same model trained on raw data with only trivial cleaning, making it impossible to attribute gains to the refinement versus scale alone.
minor comments (3)
- [§2 and Table 1] Clarify the exact differences and intended use cases between PianoCoRe-A and PianoCoRe-A* in the main text and any summary tables.
- [References and §3] Add explicit DOIs or persistent links for all source corpora in the references and dataset description to improve reproducibility.
- [Figures in §5] Ensure all figures showing before/after alignment examples include scale bars or quantitative annotations for visual clarity.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback. We agree that additional quantitative evidence and experimental details are needed to strengthen the claims and will revise the manuscript accordingly.
read point-by-point responses
-
Referee: [Abstract and §6] Abstract and §6 (analysis of refinement): the claim that 'the refinement reduces temporal noise and eliminates tempo outliers' is presented without any quantitative metrics (e.g., before/after distributions of timing variance, tempo statistics, or error rates), baselines, or comparison to ground-truth alignments, which is load-bearing for asserting improved data quality.
Authors: We acknowledge that the current manuscript presents the refinement effects qualitatively without supporting statistics. In the revised version we will add explicit before/after quantitative metrics in §6, including distributions of timing variance, tempo outlier counts, and any available alignment error rates relative to ground-truth scores where they exist. These will also be summarized in the abstract. revision: yes
-
Referee: [§4 and §5] §4 (MIDI quality classifier) and §5 (RAScoP pipeline): no external validation, inter-annotator agreement, or ablation is reported to confirm that the classifier and pipeline correctly discard only corrupted files and fix alignments without systematically removing valid expressive variations or introducing biases in timing/dynamics; this directly affects the central claim that PianoCoRe-A is the largest high-quality aligned collection.
Authors: We agree that external validation and ablations are important. We will add an ablation study in the revised manuscript quantifying the effect of the classifier and RAScoP on dataset size and quality metrics, plus a description of our internal validation procedure (manual sampling and held-out checks). We will clarify that inter-annotator agreement is not applicable here as the process is automated with post-hoc inspection rather than multi-annotator labeling. revision: yes
-
Referee: [§7] §7 (expressive performance rendering experiments): the improved robustness to unseen pieces is asserted but without details on model architecture, training hyperparameters, exact evaluation metrics (e.g., note-onset F1, velocity/timing error), the specific unseen test set, or an ablation comparing the same model trained on raw data with only trivial cleaning, making it impossible to attribute gains to the refinement versus scale alone.
Authors: We will expand §7 with complete details on model architecture, training hyperparameters, exact evaluation metrics (including note-onset F1 and timing/velocity errors), and the composition of the unseen test set. We will also include an ablation comparing the identical model trained on the raw versus refined data to isolate the contribution of refinement from scale. revision: yes
Circularity Check
No circularity: dataset curation with no derivation chain or fitted predictions
full rationale
The paper describes aggregation of existing MIDI corpora, application of a quality classifier, and a heuristic alignment pipeline (RAScoP) followed by empirical checks on temporal noise and model robustness. No equations, first-principles derivations, or predictions are presented that could reduce to inputs by construction. Claims about scale and improved robustness rest on data processing and external comparisons rather than self-referential definitions or self-citation load-bearing arguments. This is standard data-resource work with no load-bearing mathematical steps.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Transactions of the International Society for Music Information Retrieval , doi =
Lerch, Alexander and Arthur, Claire and Pati, Ashis and Gururani, Siddharth , year =. Transactions of the International Society for Music Information Retrieval , doi =
-
[2]
2018 , journal =
Cancino-Chac. 2018 , journal =
2018
-
[3]
Proceedings of the 36th International Conference on Machine Learning
Jeong, Dasaem and Kwon, Taegyun and Kim, Yoojin and Nam, Juhan , year =. Proceedings of the 36th International Conference on Machine Learning
-
[4]
Proceedings of the 20th International Society for Music Information Retrieval Conference
Dasaem Jeong and Taegyun Kwon and Yoojin Kim and Kyogu Lee and Juhan Nam , year =. Proceedings of the 20th International Society for Music Information Retrieval Conference
-
[5]
Proceedings of the 23rd International Society for Music Information Retrieval Conference
Rhyu, Seungyeon and Kim, Sarah and Lee, Kyogu , year =. Proceedings of the 23rd International Society for Music Information Retrieval Conference
-
[6]
Proceedings of the 28th ACM International Conference on Multimedia , pages =
Huang, Yu-Siang and Yang, Yi-Hsuan , year =. Proceedings of the 28th ACM International Conference on Multimedia , pages =
-
[7]
Proceedings of the 35th AAAI Conference on Artificial Intelligence , volume =
Hsiao, Wen-Yi and Liu, Jen-Yu and Yeh, Yin-Cheng and Yang, Yi-Hsuan , year =. Proceedings of the 35th AAAI Conference on Artificial Intelligence , volume =
-
[8]
Proceedings of the 21st International Society for Music Information Retrieval Conference
Foscarin, Francesco and Mcleod, Andrew and Rigaux, Philippe and Jacquemard, Florent and Sakai, Masahiko , year =. Proceedings of the 21st International Society for Music Information Retrieval Conference
-
[9]
2022 , booktitle =
Zhang, Huan and Tang, Jingjing and Rafee, Syed Rifat Mahmud and Fazekas, Simon Dixon Gy. 2022 , booktitle =
2022
-
[10]
Proceedings of the 18th International Society for Music Information Retrieval Conference
Eita Nakamura and Kazuyoshi Yoshii and Haruhiro Katayose , year =. Proceedings of the 18th International Society for Music Information Retrieval Conference
-
[11]
Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 , pages =
Zeng, Mingliang and Tan, Xu and Wang, Rui and Ju, Zeqian and Qin, Tao and Liu, Tie-Yan , year =. Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 , pages =
2021
-
[12]
Xia, Gus Guangyu , year =
-
[13]
Open Engineering , publisher =
Zacharov, Igor and Arslanov, Rinat and Gunin, Maksim and Stefonishin, Daniil and Bykov, Andrey and Pavlov, Sergey and Panarin, Oleg and Maliutin, Anton and Rykovanov, Sergey and Fedorov, Maxim , year =. Open Engineering , publisher =
-
[14]
Proceedings of the 24th International Society for Music Information Retrieval Conference
Borovik, Ilya and Viro, Vladimir , year =. Proceedings of the 24th International Society for Music Information Retrieval Conference
-
[15]
2023 , journal =
Peter, Silvan David and Cancino-Chac. 2023 , journal =
2023
-
[16]
Proceedings of the 24th International Society for Music Information Retrieval Conference
Peter, Silvan David , year =. Proceedings of the 24th International Society for Music Information Retrieval Conference
-
[17]
2024 , journal =
Zhang, Huan and Chowdhury, Shreyan and Cancino-Chac. 2024 , journal =
2024
-
[18]
Transactions of the International Society for Music Information Retrieval , volume =
Kong, Qiuqiang and Li, Bochen and Chen, Jitong and Wang, Yuxuan , year =. Transactions of the International Society for Music Information Retrieval , volume =
-
[19]
Long, Phillip and Novack, Zachary and Berg-Kirkpatrick, Taylor and McAuley, Julian , year =. Proceeding of the 50th IEEE International Conference on Acoustics, Speech and Signal Processing. doi:10.1109/ICASSP49660.2025.10890217 , organization =
-
[20]
Proceedings of the 25th International Society for Music Information Retrieval Conference
Yujia Yan and Zhiyao Duan , year =. Proceedings of the 25th International Society for Music Information Retrieval Conference
-
[21]
Proceedings of the 25th International Society for Music Information Retrieval Conference
Tim Beyer and Angela Dai , year =. Proceedings of the 25th International Society for Music Information Retrieval Conference
-
[22]
Proceedings of the 24th International Society for Music Information Retrieval Conference
Hu, Patricia and Widmer, Gerhard , year =. Proceedings of the 24th International Society for Music Information Retrieval Conference
-
[23]
Extended Abstracts for the Late-Breaking Demo Session of the 25th International Society for Music Information Retrieval Conference
Yikai Liao and Zhongqi Luo and Yue Wang and Yujie Yin , year =. Extended Abstracts for the Late-Breaking Demo Session of the 25th International Society for Music Information Retrieval Conference
-
[24]
Proceeding of the 50th IEEE International Conference on Acoustics, Speech and Signal Processing
Tang, Jingjing and Cooper, Erica and Wang, Xin and Yamagishi, Junichi and Fazekas, György , year =. Proceeding of the 50th IEEE International Conference on Acoustics, Speech and Signal Processing
-
[25]
Proceedings of the Second Workshop on the LLVM Compiler Infrastructure in HPC , pages =
Lam, Siu Kwan and Pitrou, Antoine and Seibert, Stanley , year =. Proceedings of the Second Workshop on the LLVM Compiler Infrastructure in HPC , pages =
-
[26]
2022 , booktitle =
Cancino-Chac. 2022 , booktitle =
2022
-
[27]
Journal of the Musical Arts in Africa , publisher =
Watson, Maike , year =. Journal of the Musical Arts in Africa , publisher =
-
[28]
2011 , booktitle =
M. 2011 , booktitle =
2011
-
[29]
Proceedings of the 4th International Conference on Technologies for Music Notation and Representation
Kosta, Katerina and Bandtlow, Oscar F and Chew, Elaine , year =. Proceedings of the 4th International Conference on Technologies for Music Notation and Representation
-
[30]
Proceedings of the 15th Sound and Music Computing Conference
Hashida, Mitsuyo and Nakamura, Eita and Katayose, Haruhiro , year =. Proceedings of the 15th Sound and Music Computing Conference
-
[31]
Proceeding of the 20th International Society on Music Information Retrieval
Shi, Zhengshan and Sapp, Craig and Arul, Kumaran and McBride, Jerry and Smith III, Julius O , year =. Proceeding of the 20th International Society on Music Information Retrieval
-
[32]
Proceedings of the 7th International Conference on Representation Learning
Curtis Hawthorne and Andriy Stasyuk and Adam Roberts and Ian Simon and Cheng-Zhi Anna Huang and Sander Dieleman and Erich Elsen and Jesse Engel and Douglas Eck , year =. Proceedings of the 7th International Conference on Representation Learning
-
[33]
Goebl, Werner , year =
-
[34]
IEEE/ACM Transactions on Audio, Speech, and Language Processing , publisher =
Kong, Qiuqiang and Li, Bochen and Song, Xuchen and Wan, Yuan and Wang, Yuxuan , year =. IEEE/ACM Transactions on Audio, Speech, and Language Processing , publisher =
-
[35]
The Virtual Score: Representation, Retrieval, Restoration , publisher =
Good, Michael , year =. The Virtual Score: Representation, Retrieval, Restoration , publisher =
-
[36]
Neurocomputing , publisher =
Su, Jianlin and Ahmed, Murtadha and Lu, Yu and Pan, Sh engfeng and Bo, Wen and Liu, Yunfeng , year =. Neurocomputing , publisher =
-
[37]
Proceedings of the 22nd International Society for Music Information Retrieval Conference
Hung, Hsiao-Tzu and Ching, Joann and Doh, Seungheon and Kim, Nabin and Nam, Juhan and Yang, Yi-Hsuan , year =. Proceedings of the 22nd International Society for Music Information Retrieval Conference
-
[38]
Proceedings of the 13th International Conference on Representation Learning
Bradshaw, Louis and Colton, Simon , year =. Proceedings of the 13th International Conference on Representation Learning
-
[39]
Proceedings of the 26th International Society for Music Information Retrieval Conference
Bradshaw, Louis and Fan, Honglu and Spangher, Alexander and Biderman, Stella and Colton, Simon , year =. Proceedings of the 26th International Society for Music Information Retrieval Conference
-
[40]
Proceedings of the 33rd ACM International Conference on Multimedia , pages =
Borovik, Ilya and Gavrilev, Dmitrii and Viro, Vladimir , year =. Proceedings of the 33rd ACM International Conference on Multimedia , pages =
-
[41]
2024 , booktitle =
Hu, Patricia and Mart. 2024 , booktitle =
2024
-
[42]
Transactions of the International Society for Music Information Retrieval , volume =
Ycart, Adrien and Liu, Lele and Benetos, Emmanouil and Pearce, Marcus , year =. Transactions of the International Society for Music Information Retrieval , volume =
-
[43]
Multimedia Tools and Applications , publisher =
Simonetta, Federico and Avanzini, Federico and Ntalampiras, Stavros , year =. Multimedia Tools and Applications , publisher =
-
[44]
Transactions of the International Society for Music Information Retrieval , volume =
Lee, Keon Ju Maverick and Ens, Jeff and Adkins, Sara and Sarmento, Pedro and Barthet, Mathieu and Pasquier, Philippe , year =. Transactions of the International Society for Music Information Retrieval , volume =
-
[45]
Transactions of the International Society for Music Information Retrieval , volume =
Edwards, Drew and Dixon, Simon and Benetos, Emmanouil , year =. Transactions of the International Society for Music Information Retrieval , volume =
-
[46]
Journal of Creative Music Systems , publisher =
Chou, Yi-Hui and Chen, I-Chun and Ching, Joann and Chang, Chin-Jui and Yang, Yi-Hsuan , year =. Journal of Creative Music Systems , publisher =
-
[47]
IEEE Signal Processing Magazine , publisher =
Benetos, Emmanouil and Dixon, Simon and Duan, Zhiyao and Ewert, Sebastian , year =. IEEE Signal Processing Magazine , publisher =
-
[48]
IEEE Signal Processing Letters , publisher =
Edwards, Drew and Dixon, Simon and Benetos, Emmanouil and Maezawa, Akira and Kusaka, Yuta , year =. IEEE Signal Processing Letters , publisher =
-
[49]
Proceedings of the 37th AAAI Conference on Artificial Intelligence , volume =
Guo, Zixun and Kang, Jaeyong and Herremans, Dorien , year =. Proceedings of the 37th AAAI Conference on Artificial Intelligence , volume =
-
[50]
Proceeding of the 25th IEEE International Conference on Multimedia and Expo
Liang, Xiao and Zhao, Zijian and Zeng, Weichao and He, Yutong and He, Fupeng and Wang, Yiyi and Gao, Chengying , year =. Proceeding of the 25th IEEE International Conference on Multimedia and Expo. doi:10.1109/ICME57554.2024.10688332 , organization =
-
[51]
Advances in Neural Information Processing Systems
Vaswani, Ashish and Shazeer, Noam and Parmar, Niki and Uszkoreit, Jakob and Jones, Llion and Gomez, Aidan N and Kaiser, ukasz and Polosukhin, Illia , year =. Advances in Neural Information Processing Systems
-
[52]
The Proceedings of the 11th International Conference on Learning Representations
Lipman, Yaron and Chen, Ricky TQ and Ben-Hamu, Heli and Nickel, Maximilian and Le, Matthew , year =. The Proceedings of the 11th International Conference on Learning Representations
-
[53]
Emerson, Katelyn and Harrison, Peter M. C. , year =. Transactions of the International Society for Music Information Retrieval , volume =
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.