Supervised Latent Restructuring for Small-Data Quantum Learning in Plant Phenomics

Alakananda Mitra; Chittaranjan Ray; David H. Fleisher; Vangimalla Reddy

arxiv: 2605.20413 · v1 · pith:M2O7PXW5new · submitted 2026-05-19 · 💻 cs.LG

Supervised Latent Restructuring for Small-Data Quantum Learning in Plant Phenomics

Alakananda Mitra , David H. Fleisher , Vangimalla Reddy , Chittaranjan Ray This is my paper

Pith reviewed 2026-05-21 07:47 UTC · model grok-4.3

classification 💻 cs.LG

keywords quantum kernel alignmentplant phenomicslatent space restructuringsmall data learningdimensionality reductionsilhouette coefficientLDAdeep image embeddings

0 comments

The pith

Supervised LDA restructuring raises silhouette coefficient to 0.197 in compressed plant phenomics embeddings.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper addresses the difficulty of reliable classification when biological image data has far more features than available samples, a common issue in fine-grained plant phenomics. It proposes a workflow that first compresses 1280-dimensional deep embeddings to 64 dimensions with PCA and then applies supervised LDA to produce an 11-dimensional latent space. The central result is that this supervised restructuring raises the Silhouette coefficient from near zero or negative values to 0.197, showing clearer class geometry. A reader would care because the work tests whether engineered latent geometry can make small-data problems more amenable to quantum kernel methods even when sample sizes remain severely limited.

Core claim

The paper claims that supervised latent restructuring with Linear Discriminant Analysis after PCA compression substantially improves the geometric separability of the compressed representation. This is shown by the Silhouette coefficient increasing from 0.003 in the raw 1280-dimensional embedding space and -0.006 in the 64-dimensional PCA space to 0.197 in the 11-dimensional supervised LDA space. Downstream evaluation finds that Linear SVM and XGBoost benefit from the restructured space while RBF-SVM and Random Forest degrade, and that Quantum Kernel Alignment remains difficult to train effectively under a constrained optimization budget.

What carries the argument

Supervised latent restructuring via Linear Discriminant Analysis, which projects PCA-compressed embeddings into an 11-dimensional space that maximizes the ratio of between-class to within-class variance before quantum kernel alignment is applied.

If this is right

Linear SVM and XGBoost achieve higher accuracy in the supervised 11-dimensional space than in the unsupervised PCA space.
RBF-SVM and Random Forest show lower accuracy after the same supervised compression to 11 dimensions.
Quantum Kernel Alignment does not produce strong trainable performance even after the geometry improvement.
Representation geometry must be treated as a central design variable when building small-data quantum learning systems.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Nonlinear supervised reduction methods might preserve more structure than LDA and further improve quantum trainability.
The same compression-plus-restructuring sequence could be tested on other high-dimensional biological datasets to check generality.
Relaxing the optimization budget in future experiments might reveal whether the improved geometry can be exploited by quantum kernels.

Load-bearing premise

The 11-dimensional LDA projection preserves enough of the original class-separating structure to make downstream quantum kernel alignment trainable despite the reduction from 1280 dimensions.

What would settle it

Running quantum kernel alignment to convergence in both the LDA-11 space and the PCA-64 space with identical optimization budgets and then comparing final classification accuracy would show whether the restructuring step enables better quantum performance.

Figures

Figures reproduced from arXiv: 2605.20413 by Alakananda Mitra, Chittaranjan Ray, David H. Fleisher, Vangimalla Reddy.

**Figure 1.** Figure 1: Latent space geometries for the 12-class pathology task. (a) Unsupervised PCA retains variance but exhibits [PITH_FULL_IMAGE:figures/full_fig_p007_1.png] view at source ↗

**Figure 2.** Figure 2: Classical Benchmarking under Dimensionality Constraints. (a) Accuracy and (b) Macro-F1 for the 12-class [PITH_FULL_IMAGE:figures/full_fig_p008_2.png] view at source ↗

**Figure 3.** Figure 3: Class-wise F1 comparison across classical baselines under PCA-64 and PCA→LDA-11 representations. The figure highlights the supervised compression trade-off: the compressed supervised latent space improves some learners, but substantially degrades high-capacity nonlinear models on several minority and visually similar classes. While these results trail the classical PCA-64 benchmarks, they demonstrate that … view at source ↗

read the original abstract

High-dimensional biological data often exhibit a severe mismatch between feature dimensionality and sample size, making reliable classification difficult in extremely small-data regimes. In these settings, kernel methods can lose discriminative power when latent compression fails to preserve class-separating structure. We study this problem in fine-grained plant phenomics and propose a hybrid workflow that compresses 1280-dimensional deep image embeddings into a 64-dimensional PCA space and then restructures them into an 11-dimensional supervised latent space using Linear Discriminant Analysis (LDA), followed by GPU-accelerated Quantum Kernel Alignment (QKA) on NVIDIA L40S hardware. Empirically, supervised latent restructuring substantially improves the geometric separability of the compressed representation, increasing the Silhouette coefficient from 0.003 in the raw embedding space and -0.006 in PCA-64 to 0.197 in the supervised LDA-11 space. However, downstream classical evaluation reveals a clear compression trade-off: Linear SVM and XGBoost improve in the restructured latent space, whereas RBF-SVM and Random Forest degrade under the same 11-dimensional bottleneck. Under a constrained optimization budget, QKA in this regime remains challenging, indicating that latent geometry alone is not sufficient for strong trainable quantum performance. These findings position representation geometry as a central design variable in small-data quantum learning and expose the practical difficulty of recovering nonlinear discriminative structure from aggressively compressed biological representations.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

LDA after PCA lifts silhouette scores in these plant embeddings but leaves QKA hard to train under the given budget.

read the letter

This paper runs 1280-dimensional deep embeddings from plant phenomics through PCA down to 64 dimensions and then LDA to 11 supervised dimensions, then tries quantum kernel alignment on top. The clearest result is the silhouette coefficient rising from 0.003 raw and -0.006 after PCA to 0.197 after LDA, with the added note that this geometry still does not make QKA reliably trainable under their optimization limits. They also show the expected classical trade-off: linear SVM and XGBoost improve while RBF-SVM and random forest degrade in the 11-dimensional space. That honesty about what does and does not improve is the most useful part of the write-up. The workflow itself is straightforward and the empirical observation on geometry as a design variable lands cleanly without overclaiming quantum advantage. The stress-test note is right that there is no circularity or unsupported extrapolation in the reported numbers. The main limitation is that the method is an application of existing PCA-plus-LDA-plus-QKA steps to one new dataset rather than a new algorithm or derivation. Prior work already contains similar pipelines, so the contribution stays domain-specific. Sample sizes and error bars are not visible in the abstract, which leaves the robustness of the 0.197 figure harder to judge until the full tables are checked. The paper does not hide that latent geometry alone proved insufficient for strong quantum performance, which keeps the claims proportionate. This is the sort of targeted empirical note that researchers working on small-data quantum kernels in biology or agriculture would find practical. It does not reshape the broader field but gives a concrete data point on representation choices. I would send it for peer review so referees can verify the experimental protocol and suggest whether additional runs or larger budgets could move the QKA results.

Referee Report

2 major / 2 minor

Summary. The paper proposes a hybrid classical-quantum workflow for small-data plant phenomics classification: 1280-dimensional deep embeddings are first compressed via PCA to 64 dimensions and then restructured via supervised LDA into an 11-dimensional latent space, after which GPU-accelerated Quantum Kernel Alignment (QKA) is attempted. The central empirical result is a measured increase in Silhouette coefficient from 0.003 (raw) and -0.006 (PCA-64) to 0.197 (LDA-11), accompanied by the observation that Linear SVM and XGBoost improve while RBF-SVM and Random Forest degrade under the same bottleneck, and that QKA remains difficult to train under the stated optimization budget. The manuscript concludes that representation geometry is a key design variable but that linear supervised restructuring alone does not suffice for reliable quantum performance in this regime.

Significance. If the reported silhouette gains and classical trade-offs are reproducible, the work supplies a concrete, falsifiable demonstration that supervised linear restructuring can materially improve geometric separability in aggressively compressed biological embeddings, while simultaneously exposing the practical limits of such restructuring for downstream quantum kernel methods. The explicit negative result on QKA trainability under constrained budget is a useful boundary condition for the field.

major comments (2)

The abstract and experimental description report silhouette coefficients (0.003, -0.006, 0.197) without error bars, standard deviations, or the number of bootstrap or cross-validation repetitions used to obtain them. Because the central claim is a quantitative improvement in separability, the absence of uncertainty quantification makes it impossible to judge whether the jump to 0.197 is statistically distinguishable from the baselines under the small-sample regime implied by the title.
The manuscript states that the 11-dimensional LDA projection is followed by QKA, yet provides no explicit description of the quantum feature map, the kernel alignment objective, or the precise optimization budget (number of epochs, learning-rate schedule, or hardware precision). Without these details it is difficult to evaluate the claim that “latent geometry alone is not sufficient” versus the possibility that the optimization procedure itself was under-powered.

minor comments (2)

The transition from 1280-dimensional embeddings to PCA-64 to LDA-11 is described only at the level of target dimensions; the fraction of variance retained by the PCA step and the number of classes in the LDA step are not stated.
Figure captions and axis labels should explicitly indicate whether the reported silhouette values are computed on the training set, a held-out test set, or both.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed and constructive feedback. We address each major comment below and will revise the manuscript accordingly to improve clarity and reproducibility.

read point-by-point responses

Referee: The abstract and experimental description report silhouette coefficients (0.003, -0.006, 0.197) without error bars, standard deviations, or the number of bootstrap or cross-validation repetitions used to obtain them. Because the central claim is a quantitative improvement in separability, the absence of uncertainty quantification makes it impossible to judge whether the jump to 0.197 is statistically distinguishable from the baselines under the small-sample regime implied by the title.

Authors: We agree that uncertainty quantification is essential for evaluating the reported silhouette improvement in this small-data setting. In the revised manuscript we will add standard deviations computed via bootstrap resampling (100 iterations) of the silhouette scores on the held-out test partitions, along with the exact number of repetitions and the data-splitting protocol used. This will allow readers to assess whether the increase to 0.197 is statistically distinguishable from the near-zero baselines. revision: yes
Referee: The manuscript states that the 11-dimensional LDA projection is followed by QKA, yet provides no explicit description of the quantum feature map, the kernel alignment objective, or the precise optimization budget (number of epochs, learning-rate schedule, or hardware precision). Without these details it is difficult to evaluate the claim that “latent geometry alone is not sufficient” versus the possibility that the optimization procedure itself was under-powered.

Authors: We concur that the QKA implementation details must be expanded for proper evaluation. The revised methods section will specify the quantum feature map (angle embedding followed by a single layer of ZZ interactions), the kernel alignment objective (squared loss between the quantum kernel matrix and the ideal label kernel), the optimization budget (100 epochs with a cosine-annealing learning-rate schedule starting at 0.1), and the floating-point precision employed on the NVIDIA L40S. These additions will clarify that the reported training difficulties persist even under the stated budget. revision: yes

Circularity Check

0 steps flagged

No significant circularity identified

full rationale

The manuscript presents a standard pipeline of PCA dimensionality reduction from 1280 to 64 dimensions followed by supervised LDA projection to 11 dimensions, with empirical evaluation of geometric separability via Silhouette coefficient and downstream classifier performance. The reported Silhouette increase (0.003 raw, -0.006 PCA-64, 0.197 LDA-11) is a direct post-projection measurement on the data points and does not reduce to any parameter fitted to produce that specific value. No equations, self-citations, or uniqueness claims are invoked that would make the central empirical observations tautological or forced by construction. The quantum kernel alignment results are presented as challenging under the given budget, without any derivation that loops back to presuppose success. The workflow is self-contained against external benchmarks and uses off-the-shelf techniques whose outputs are independently verifiable.

Axiom & Free-Parameter Ledger

2 free parameters · 1 axioms · 0 invented entities

The workflow rests on standard linear algebra assumptions for PCA and LDA plus the empirical claim that 11 dimensions suffice for QKA; no new entities or heavily fitted constants beyond the two chosen dimensions.

free parameters (2)

PCA target dimension = 64
Chosen compression target from 1280 to 64 dimensions
LDA target dimension = 11
Supervised latent space size after PCA

axioms (1)

domain assumption LDA projection on PCA-reduced embeddings improves class separability without destroying information needed for quantum kernels
Invoked when claiming the silhouette gain enables better QKA performance

pith-pipeline@v0.9.0 · 5792 in / 1324 out tokens · 53536 ms · 2026-05-21T07:47:26.175374+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Foundation/AbsoluteFloorClosure.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

supervised latent restructuring substantially improves the geometric separability... Silhouette coefficient from 0.003... to 0.197 in the supervised LDA-11 space
IndisputableMonolith/Foundation/AlphaCoordinateFixation.lean J_uniquely_calibrated_via_higher_derivative unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Quantum Kernel Alignment (QKA) using a RealAmplitudes ansatz... 11-qubit ZZFeatureMap

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

20 extracted references · 20 canonical work pages

[1]

Mohanty, David P

Sharada P. Mohanty, David P. Hughes, and Marcel Salathé. Using deep learning for image-based plant disease detection.Frontiers in Plant Science, 7:1419, 2016

work page 2016
[2]

Ferentinos

Konstantinos P. Ferentinos. Deep learning models for plant disease detection and diagnosis.Computers and Electronics in Agriculture, 145:311–318, 2018

work page 2018
[3]

Jayme Garcia Arnal Barbedo. Impact of dataset size and variety on the effectiveness of deep learning and transfer learning for plant disease classification.Computers and Electronics in Agriculture, 153:46–53, 2018

work page 2018
[4]

Córcoles, Kristan Temme, Aram W

V ojtˇech Havlíˇcek, Antonio D. Córcoles, Kristan Temme, Aram W. Harrow, Abhinav Kandala, Jerry M. Chow, and Jay M. Gambetta. Supervised learning with quantum-enhanced feature spaces.Nature, 567:209–212, 2019

work page 2019
[5]

Quantum machine learning in feature hilbert spaces.Physical Review Letters, 122(4):040504, 2019

Maria Schuld and Nathan Killoran. Quantum machine learning in feature hilbert spaces.Physical Review Letters, 122(4):040504, 2019

work page 2019
[6]

Jolliffe and Jorge Cadima

Ian T. Jolliffe and Jorge Cadima. Principal component analysis: a review and recent developments.Philosophical Transactions of the Royal Society A, 374(2065):20150202, 2016

work page 2065
[7]

Ronald A. Fisher. The use of multiple measurements in taxonomic problems.Annals of Eugenics, 7(2):179–188, 1936

work page 1936
[8]

Efficientnet: Rethinking model scaling for convolutional neural networks

Mingxing Tan and Quoc Le. Efficientnet: Rethinking model scaling for convolutional neural networks. In International conference on machine learning, pages 6105–6114. PMLR, 2019

work page 2019
[9]

Glick, Tanvi P

Jennifer R. Glick, Tanvi P. Gujarati, Antonio D. Corcoles, Youngseok Kim, Abhinav Kandala, Jay M. Gambetta, and Kristan Temme. Covariant quantum kernels for data with group structure.Nature Phys., 20(3):479–483, 2024

work page 2024
[10]

Barren plateaus in quantum neural network training landscapes.Nature communications, 9(1):4812, 2018

Jarrod R McClean, Sergio Boixo, Vadim N Smelyanskiy, Ryan Babbush, and Hartmut Neven. Barren plateaus in quantum neural network training landscapes.Nature communications, 9(1):4812, 2018. 10 APREPRINT- MAY21, 2026

work page 2018
[11]

Multivariate stochastic approximation using a simultaneous perturbation gradient approximation

James C Spall. Multivariate stochastic approximation using a simultaneous perturbation gradient approximation. IEEE transactions on automatic control, 37(3):332–341, 1992

work page 1992
[12]

aGRodet 2.0: An automated real-time approach for multiclass plant disease detection.SN Computer Science, 4(5):657, 2023

Alakananda Mitra, Saraju P Mohanty, and Elias Kougianos. aGRodet 2.0: An automated real-time approach for multiclass plant disease detection.SN Computer Science, 4(5):657, 2023

work page 2023
[13]

Belhumeur, João P

Peter N. Belhumeur, João P. Hespanha, and David J. Kriegman. Eigenfaces vs. fisherfaces: Recognition using class specific linear projection.IEEE Transactions on pattern analysis and machine intelligence, 19(7):711–720, 1997

work page 1997
[14]

Bishop.Pattern Recognition and Machine Learning

Christopher M. Bishop.Pattern Recognition and Machine Learning. Springer, 2006

work page 2006
[15]

Hardware-efficient variational quantum eigensolver for small molecules and quantum magnets

Abhinav Kandala, Antonio Mezzacapo, Kristan Temme, Kristine Mauser, Markus Brink, Jerry M Chow, and Jay M Gambetta. Hardware-efficient variational quantum eigensolver for small molecules and quantum magnets. Nature, 549(7671):242–246, 2017

work page 2017
[16]

The plant pathology challenge 2021: Fine-grained classification of plant diseases

Rishi Thapa, Kai Zhang, Noah Snavely, Serge Belongie, and Abeer Khan. The plant pathology challenge 2021: Fine-grained classification of plant diseases. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, pages 2777–2781, 2021

work page 2021
[17]

Support-vector networks.Machine Learning, 20(3):273–297, 1995

Corinna Cortes and Vladimir Vapnik. Support-vector networks.Machine Learning, 20(3):273–297, 1995

work page 1995
[18]

Smola.Learning with Kernels: Support V ector Machines, Regularization, Optimization, and Beyond

Bernhard Schölkopf and Alexander J. Smola.Learning with Kernels: Support V ector Machines, Regularization, Optimization, and Beyond. MIT Press, 2002

work page 2002
[19]

Random forests.Machine Learning, 45(1):5–32, 2001

Leo Breiman. Random forests.Machine Learning, 45(1):5–32, 2001

work page 2001
[20]

Xgboost: A scalable tree boosting system

Tianqi Chen and Carlos Guestrin. Xgboost: A scalable tree boosting system. InProceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 785–794, 2016. 11

work page 2016

[1] [1]

Mohanty, David P

Sharada P. Mohanty, David P. Hughes, and Marcel Salathé. Using deep learning for image-based plant disease detection.Frontiers in Plant Science, 7:1419, 2016

work page 2016

[2] [2]

Ferentinos

Konstantinos P. Ferentinos. Deep learning models for plant disease detection and diagnosis.Computers and Electronics in Agriculture, 145:311–318, 2018

work page 2018

[3] [3]

Jayme Garcia Arnal Barbedo. Impact of dataset size and variety on the effectiveness of deep learning and transfer learning for plant disease classification.Computers and Electronics in Agriculture, 153:46–53, 2018

work page 2018

[4] [4]

Córcoles, Kristan Temme, Aram W

V ojtˇech Havlíˇcek, Antonio D. Córcoles, Kristan Temme, Aram W. Harrow, Abhinav Kandala, Jerry M. Chow, and Jay M. Gambetta. Supervised learning with quantum-enhanced feature spaces.Nature, 567:209–212, 2019

work page 2019

[5] [5]

Quantum machine learning in feature hilbert spaces.Physical Review Letters, 122(4):040504, 2019

Maria Schuld and Nathan Killoran. Quantum machine learning in feature hilbert spaces.Physical Review Letters, 122(4):040504, 2019

work page 2019

[6] [6]

Jolliffe and Jorge Cadima

Ian T. Jolliffe and Jorge Cadima. Principal component analysis: a review and recent developments.Philosophical Transactions of the Royal Society A, 374(2065):20150202, 2016

work page 2065

[7] [7]

Ronald A. Fisher. The use of multiple measurements in taxonomic problems.Annals of Eugenics, 7(2):179–188, 1936

work page 1936

[8] [8]

Efficientnet: Rethinking model scaling for convolutional neural networks

Mingxing Tan and Quoc Le. Efficientnet: Rethinking model scaling for convolutional neural networks. In International conference on machine learning, pages 6105–6114. PMLR, 2019

work page 2019

[9] [9]

Glick, Tanvi P

Jennifer R. Glick, Tanvi P. Gujarati, Antonio D. Corcoles, Youngseok Kim, Abhinav Kandala, Jay M. Gambetta, and Kristan Temme. Covariant quantum kernels for data with group structure.Nature Phys., 20(3):479–483, 2024

work page 2024

[10] [10]

Barren plateaus in quantum neural network training landscapes.Nature communications, 9(1):4812, 2018

Jarrod R McClean, Sergio Boixo, Vadim N Smelyanskiy, Ryan Babbush, and Hartmut Neven. Barren plateaus in quantum neural network training landscapes.Nature communications, 9(1):4812, 2018. 10 APREPRINT- MAY21, 2026

work page 2018

[11] [11]

Multivariate stochastic approximation using a simultaneous perturbation gradient approximation

James C Spall. Multivariate stochastic approximation using a simultaneous perturbation gradient approximation. IEEE transactions on automatic control, 37(3):332–341, 1992

work page 1992

[12] [12]

aGRodet 2.0: An automated real-time approach for multiclass plant disease detection.SN Computer Science, 4(5):657, 2023

Alakananda Mitra, Saraju P Mohanty, and Elias Kougianos. aGRodet 2.0: An automated real-time approach for multiclass plant disease detection.SN Computer Science, 4(5):657, 2023

work page 2023

[13] [13]

Belhumeur, João P

Peter N. Belhumeur, João P. Hespanha, and David J. Kriegman. Eigenfaces vs. fisherfaces: Recognition using class specific linear projection.IEEE Transactions on pattern analysis and machine intelligence, 19(7):711–720, 1997

work page 1997

[14] [14]

Bishop.Pattern Recognition and Machine Learning

Christopher M. Bishop.Pattern Recognition and Machine Learning. Springer, 2006

work page 2006

[15] [15]

Hardware-efficient variational quantum eigensolver for small molecules and quantum magnets

Abhinav Kandala, Antonio Mezzacapo, Kristan Temme, Kristine Mauser, Markus Brink, Jerry M Chow, and Jay M Gambetta. Hardware-efficient variational quantum eigensolver for small molecules and quantum magnets. Nature, 549(7671):242–246, 2017

work page 2017

[16] [16]

The plant pathology challenge 2021: Fine-grained classification of plant diseases

Rishi Thapa, Kai Zhang, Noah Snavely, Serge Belongie, and Abeer Khan. The plant pathology challenge 2021: Fine-grained classification of plant diseases. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, pages 2777–2781, 2021

work page 2021

[17] [17]

Support-vector networks.Machine Learning, 20(3):273–297, 1995

Corinna Cortes and Vladimir Vapnik. Support-vector networks.Machine Learning, 20(3):273–297, 1995

work page 1995

[18] [18]

Smola.Learning with Kernels: Support V ector Machines, Regularization, Optimization, and Beyond

Bernhard Schölkopf and Alexander J. Smola.Learning with Kernels: Support V ector Machines, Regularization, Optimization, and Beyond. MIT Press, 2002

work page 2002

[19] [19]

Random forests.Machine Learning, 45(1):5–32, 2001

Leo Breiman. Random forests.Machine Learning, 45(1):5–32, 2001

work page 2001

[20] [20]

Xgboost: A scalable tree boosting system

Tianqi Chen and Carlos Guestrin. Xgboost: A scalable tree boosting system. InProceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 785–794, 2016. 11

work page 2016