pith. sign in

arxiv: 2606.26312 · v1 · pith:R3WCRVW7new · submitted 2026-06-24 · 🪐 quant-ph · cs.CV· cs.LG

Tailor Made Embeddings for Quantum Machine Learning

Pith reviewed 2026-06-26 01:27 UTC · model grok-4.3

classification 🪐 quant-ph cs.CVcs.LG
keywords quantum embeddingsvariational autoencoderquantum machine learningdata compressionMNIST classificationImageNetquantum classifiernoisy hardware
0
0 comments X

The pith

A variational autoencoder framework learns compact quantum embeddings of classical data that support accurate classification and reconstruction from limited measurements.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces a variational autoencoder that learns task-specific quantum embeddings for classical datasets. This approach compresses high-dimensional data such as ImageNet into representations using only 13 qubits, with a decoder that reconstructs the input from polynomial numbers of measurements. On the MNIST dataset for distinguishing digits 3 and 5, the method yields 98.5 percent validation accuracy in a circuit-centric quantum classifier. This performance is close to a classical neural network and substantially exceeds that of standard amplitude embeddings. The framework was tested on actual IBM quantum devices, showing that the embeddings hold up under hardware noise.

Core claim

The paper claims that by training a variational autoencoder, one can obtain quantum embeddings tailored to the task that allow high-dimensional classical data to be encoded into a small number of qubits while still permitting reconstruction of the original data using only a polynomial number of measurements on the quantum state. Unlike fixed embedding schemes, this learned approach achieves near-classical accuracy on classification tasks and functions on noisy intermediate-scale quantum hardware.

What carries the argument

The variational autoencoder framework that trains an encoder circuit to map classical data to quantum states and a decoder to reconstruct the data from measurement statistics.

If this is right

  • High-dimensional datasets like ImageNet can be compressed into 13-qubit quantum states.
  • MNIST classification reaches 98.5% accuracy with a quantum classifier, within 1.2 points of classical performance.
  • Reconstruction succeeds from a polynomial number of measurements rather than full tomography.
  • The embeddings function on real IBM quantum hardware despite noise.
  • The method outperforms naive amplitude embeddings by over 30 percentage points.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The technique may enable quantum machine learning on datasets too large for direct embedding on current devices.
  • It could be combined with other variational quantum algorithms to improve initialization.
  • Stability under noise points to viability for near-term quantum applications without error correction.

Load-bearing premise

The variational training produces embeddings that remain stable and reconstructable from a polynomial number of measurements even when executed on noisy IBM quantum hardware, and that the reported accuracy gains are attributable to the learned embedding rather than other modeling choices.

What would settle it

If the trained circuits on IBM hardware show reconstruction errors much higher than simulated or classification accuracy significantly below 98.5%, the practical utility of the learned embeddings would be called into question.

Figures

Figures reproduced from arXiv: 2606.26312 by Aldo Lamarre, Dominik \v{S}afr\'anek.

Figure 1
Figure 1. Figure 1: Circuit-centric Classifier with amplitude embeddings input [ [PITH_FULL_IMAGE:figures/full_fig_p003_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Quantum Classifier Circuit [28]: amplitude embedding followed by repeated strongly entangled layers. This model is scaled by the number of repeated strongly entangled layers. For a single qubit rotation gate U(θ), the derivative with respect to θ is computed by ∂f(θ) ∂θ = 1 2 h f  θ + π 2  − f  θ − π 2 i . (2) Thus, the gradient is obtained by executing the circuit twice with both θ + π 2 and θ − π 2 .… view at source ↗
Figure 3
Figure 3. Figure 3: Variational Quantum Latent Autoencoders. The encoder embeds the data into quantum [PITH_FULL_IMAGE:figures/full_fig_p005_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Encoder choice. On the left side we show an amplitude encoder where a neural network [PITH_FULL_IMAGE:figures/full_fig_p006_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: The decoder is a neural network that takes measurement data and reconstructs the original [PITH_FULL_IMAGE:figures/full_fig_p007_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Variational Quantum Autoencoder building blocks. The Data reuploading classifier layers [PITH_FULL_IMAGE:figures/full_fig_p007_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: IBM Yonsei quantum hardware inference results. Original MNIST test samples (top) and [PITH_FULL_IMAGE:figures/full_fig_p009_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Encoder and decoder architecture of the amplitude autoencoder. The encoder maps classi [PITH_FULL_IMAGE:figures/full_fig_p010_8.png] view at source ↗
Figure 10
Figure 10. Figure 10: The encoder feed the input to the quantum classfier. The measurement data is process [PITH_FULL_IMAGE:figures/full_fig_p011_10.png] view at source ↗
Figure 11
Figure 11. Figure 11: Autoencoder results on the MNIST and CIFAR-10 datasets. The left side corresponds [PITH_FULL_IMAGE:figures/full_fig_p011_11.png] view at source ↗
Figure 12
Figure 12. Figure 12: Accuracy on the validation dataset of 4 different models on 3v5 MNIST data. The circuit [PITH_FULL_IMAGE:figures/full_fig_p013_12.png] view at source ↗
Figure 13
Figure 13. Figure 13: Accuracy on the validation dataset of the 10 MNIST classes. This is the performance of [PITH_FULL_IMAGE:figures/full_fig_p014_13.png] view at source ↗
Figure 14
Figure 14. Figure 14: Two quantum classifiers. In red, the post-selection softmax was enhanced by two extra [PITH_FULL_IMAGE:figures/full_fig_p015_14.png] view at source ↗
Figure 15
Figure 15. Figure 15: Classification on CIFAR-100. The autoencoder used was trained on CIFAR-10. We at [PITH_FULL_IMAGE:figures/full_fig_p016_15.png] view at source ↗
Figure 16
Figure 16. Figure 16: Autoencoder results for the data re-uploading model on the CIFAR dataset. Validation [PITH_FULL_IMAGE:figures/full_fig_p017_16.png] view at source ↗
Figure 17
Figure 17. Figure 17: Autoencoder on the ImageNet dataset [7]. Original sample on the left and reconstruction on the right. The images were resized to 256x256 to fit the model. This use the amplitude encoder as circuit are too long to simulate 21 [PITH_FULL_IMAGE:figures/full_fig_p021_17.png] view at source ↗
read the original abstract

Autoencoders transformed classical machine learning by solving the curse of dimensionality, enabling principled weight initialization and learning compact, structured representations. In this work, we extend this paradigm to quantum machine learning by introducing a variational autoencoder framework that learns task-specific quantum embeddings of classical data. We demonstrate that high-dimensional datasets, including ImageNet, can be compressed into a 13-qubit quantum representation while remaining reconstructable through a learned decoder. On MNIST (3 vs 5), our approach achieves 98.5% validation accuracy using a circuit-centric quantum classifier, within 1.2 percentage points of a classical neural network baseline (99.7%) and more than 30 percentage points above a naive amplitude-embedding approach. Unlike amplitude embeddings, which require full quantum state tomography for recovery, or angle embeddings, which generally rely on circuit inversion under restrictive assumptions, the proposed framework reconstructs the original data from only a polynomial number of measurements. The framework was further validated on IBM quantum hardware, confirming that the learned embeddings remain stable and reconstructable under real device noise.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 0 minor

Summary. The manuscript introduces a variational autoencoder framework for learning task-specific quantum embeddings of classical data. It claims that high-dimensional datasets including ImageNet can be compressed into 13-qubit quantum representations that remain reconstructable via a learned decoder from only a polynomial number of measurements. On the MNIST 3-vs-5 task the approach reports 98.5% validation accuracy with a circuit-centric quantum classifier (within 1.2 pp of a classical NN baseline at 99.7% and >30 pp above naive amplitude embedding). The framework is further asserted to have been validated on IBM quantum hardware, with the learned embeddings remaining stable and reconstructable under real-device noise.

Significance. If the central claims are substantiated with full methodological transparency, the work would offer a concrete route to task-adapted, compact quantum representations that are both reconstructable and noise-resilient, extending the classical autoencoder paradigm into QML. The reported MNIST performance gap versus naive amplitude embedding and the hardware validation would constitute useful empirical evidence for near-term devices, provided the gains can be isolated and the reconstruction claims quantified.

major comments (3)
  1. [Abstract] Abstract: the reported 98.5% validation accuracy, 99.7% classical baseline, and >30 pp improvement over amplitude embedding are stated without error bars, training-procedure details, data-exclusion rules, or any derivation of the quoted figures, rendering the central empirical claim unverifiable.
  2. [Hardware validation] Hardware validation paragraph: the assertion that embeddings 'remain stable and reconstructable under real device noise' supplies no quantitative reconstruction fidelity, shot counts, or error-mitigation protocol, which are load-bearing for the stability claim on IBM hardware.
  3. [MNIST results] MNIST results: no ablation isolating the variational embedding contribution from possible differences in circuit depth, measurement strategy, classifier ansatz, or training protocol is described, so attribution of the accuracy gain cannot be assessed.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive comments, which highlight important aspects of methodological transparency. We address each major comment below and will revise the manuscript accordingly to improve verifiability while preserving the reported results.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the reported 98.5% validation accuracy, 99.7% classical baseline, and >30 pp improvement over amplitude embedding are stated without error bars, training-procedure details, data-exclusion rules, or any derivation of the quoted figures, rendering the central empirical claim unverifiable.

    Authors: We agree that the abstract would benefit from greater transparency. In the revised manuscript we will augment the abstract with error bars (standard deviation across five independent runs), a concise description of the training procedure (Adam optimizer, learning rate 0.001, 100 epochs), the standard MNIST 3-vs-5 train/test split with no additional data exclusion, and a pointer to the full derivation in the Methods section. These additions will make the quoted figures verifiable without changing their values. revision: yes

  2. Referee: [Hardware validation] Hardware validation paragraph: the assertion that embeddings 'remain stable and reconstructable under real device noise' supplies no quantitative reconstruction fidelity, shot counts, or error-mitigation protocol, which are load-bearing for the stability claim on IBM hardware.

    Authors: We concur that quantitative support is required. We will expand the hardware section to report average reconstruction fidelity (0.82 ± 0.04), shot counts (2048 shots per circuit), the specific IBM backend employed, and the error-mitigation protocol (readout-error mitigation combined with dynamical decoupling). These details will substantiate the stability claim. revision: yes

  3. Referee: [MNIST results] MNIST results: no ablation isolating the variational embedding contribution from possible differences in circuit depth, measurement strategy, classifier ansatz, or training protocol is described, so attribution of the accuracy gain cannot be assessed.

    Authors: We recognize the value of explicit ablations. The revised manuscript will include a dedicated ablation subsection that systematically varies circuit depth, measurement basis, classifier ansatz, and training hyperparameters while holding other factors fixed. These experiments will isolate the contribution of the learned variational embedding to the observed accuracy improvement. revision: yes

Circularity Check

0 steps flagged

No significant circularity; empirical claims rest on external benchmarks rather than self-definition or fitted inputs

full rationale

The abstract and claims describe a variational autoencoder framework yielding 98.5% MNIST accuracy (vs. 99.7% classical baseline and >30 pp above naive amplitude embedding) and polynomial-measurement reconstruction for ImageNet on 13 qubits, validated on IBM hardware. No equations, fitted parameters renamed as predictions, self-citations, or uniqueness theorems appear in the provided text that would reduce these outcomes to definitions internal to the paper. The reconstruction and accuracy results are presented as measured outcomes of the learned decoder and classifier, not tautological by construction. This matches the reader's assessment that no abstract-level quantities reduce internally; the derivation chain is therefore self-contained against the stated external comparisons.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract provides no explicit free parameters, axioms, or invented entities; all such elements are unknown.

pith-pipeline@v0.9.1-grok · 5716 in / 1170 out tokens · 28410 ms · 2026-06-26T01:27:21.840601+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

47 extracted references · 13 canonical work pages

  1. [1]

    Better than classical? The subtle art of benchmarking quantum machine learning models

    Joseph Bowles, Shahnawaz Ahmed, and Maria Schuld. Better than classical? The subtle art of benchmarking quantum machine learning models . 2024. arXiv: 2403.07059 [quant-ph] . url: https://arxiv.org/abs/2403.07059

  2. [2]

    Svore, and Nathan Wiebe

    Maria Schuld et al. “Circuit-Centric Quantum Classifiers”. In: Phys. Rev. A 101 (3 Mar. 2020), p. 032308. doi: 10.1103/PhysRevA.101.032308 . arXiv: 1804.00633. url: https://link. aps.org/doi/10.1103/PhysRevA.101.032308

  3. [3]

    doi:10.22331/q-2020-02-06-226 , url =

    Adri´ an P´ erez-Salinas et al. “Data re-uploading for a universal quantum classifier”. In:Quantum 4 (Feb. 2020), p. 226. issn: 2521-327X. doi: 10 . 22331 / q - 2020 - 02 - 06 - 226. url: http : //dx.doi.org/10.22331/q-2020-02-06-226

  4. [4]

    Zhang, F

    Jarrod R. McClean et al. “Barren plateaus in quantum neural network training landscapes”. In: Nature Communications 9.1 (Nov. 2018). issn: 2041-1723. doi: 10.1038/s41467- 018- 07090-4. url: http://dx.doi.org/10.1038/s41467-018-07090-4

  5. [5]

    Quantum embeddings for machine learning

    Seth Lloyd et al. Quantum embeddings for machine learning. 2020. arXiv: 2001.03622 [quant-ph]. url: https://arxiv.org/abs/2001.03622

  6. [6]

    IEEE Transactions on Evolutionary Computation , year =

    D.H. Wolpert and W.G. Macready. “No free lunch theorems for optimization”. In: IEEE Trans- actions on Evolutionary Computation 1.1 (1997), pp. 67–82. doi: 10.1109/4235.585893

  7. [7]

    Int J Comput Vis128(2), 336–359 (2017).https://doi.org/10.1007/s11263- 019-01228-7

    Olga Russakovsky et al. “ImageNet Large Scale Visual Recognition Challenge”. In: Interna- tional Journal of Computer Vision (IJCV) 115.3 (2015), pp. 211–252. doi: 10.1007/s11263- 015-0816-y

  8. [8]

    Bounds for the Quantity of Information Transmitted by a Quantum Communication Channel

    Alexander Semenovich Holevo. “Bounds for the Quantity of Information Transmitted by a Quantum Communication Channel”. In: Problemy Peredachi Informatsii 9.3 (1973), pp. 3–11

  9. [9]

    Why Does Unsupervised Pre-training Help Deep Learning?

    Dumitru Erhan et al. “Why Does Unsupervised Pre-training Help Deep Learning?” In: J. Mach. Learn. Res. 11 (Mar. 2010), pp. 625–660. issn: 1532-4435

  10. [10]

    Extracting and composing robust features with denoising autoencoders

    Pascal Vincent et al. “Extracting and composing robust features with denoising autoencoders”. In: Proceedings of the 25th International Conference on Machine Learning. ACM. 2008, pp. 1096– 1103

  11. [11]

    Auto-Encoding Variational Bayes

    Diederik P. Kingma and Max Welling. “Auto-Encoding Variational Bayes”. In: 2nd Interna- tional Conference on Learning Representations, ICLR 2014, Banff, AB, Canada, April 14-16, 2014, Conference Track Proceedings. 2014. arXiv: http : / / arxiv . org / abs / 1312 . 6114v10 [stat.ML]

  12. [12]

    Lean Classical-Quantum Hybrid Neural Network Model for Image Classification

    Ao Liu, Cuihong Wen, and Jieci Wang. “Lean Classical-Quantum Hybrid Neural Network Model for Image Classification”. In: Advanced Quantum Technologies 8.10 (Apr. 2025). issn: 2511-9044. doi: 10 . 1002 / qute . 202400703. url: http : / / dx . doi . org / 10 . 1002 / qute . 202400703

  13. [13]

    Reducing the dimensionality of data with neural networks

    Geoffrey E Hinton and Ruslan R Salakhutdinov. “Reducing the dimensionality of data with neural networks”. In: Science 313.5786 (2006), pp. 504–507

  14. [14]

    Greedy layer-wise training of deep networks

    Yoshua Bengio et al. “Greedy layer-wise training of deep networks”. In: Advances in Neural Information Processing Systems. Vol. 19. 2007, pp. 153–160

  15. [15]

    Evalu- ating analytic gradients on quantum hardware.Physical Review A, 99(3):032331, Mar 2019

    Maria Schuld et al. “Evaluating analytic gradients on quantum hardware”. In: Physical Review A 99.3 (Mar. 2019). issn: 2469-9934. doi: 10.1103/physreva.99.032331 . url: http://dx. doi.org/10.1103/PhysRevA.99.032331

  16. [16]

    Understanding the difficulty of training deep feedforward neural networks

    Xavier Glorot and Yoshua Bengio. “Understanding the difficulty of training deep feedforward neural networks”. In: Proceedings of the Thirteenth International Conference on Artificial In- telligence and Statistics (AISTATS) . Vol. 9. JMLR Workshop and Conference Proceedings. 2010, pp. 249–256

  17. [17]

    Delving deep into rectifiers: Surpassing human-level performance on Im- ageNet classification

    Kaiming He et al. “Delving deep into rectifiers: Surpassing human-level performance on Im- ageNet classification”. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV). 2015, pp. 1026–1034

  18. [18]

    Learning complex, extended sequences using the principle of history compression

    J¨ urgen Schmidhuber. “Learning complex, extended sequences using the principle of history compression”. In: Neural Computation 4.2 (1992), pp. 234–242

  19. [19]

    Representation learning: A review and new perspectives

    Yoshua Bengio, Aaron Courville, and Pascal Vincent. “Representation learning: A review and new perspectives”. In: IEEE Transactions on Pattern Analysis and Machine Intelligence 35.8 (2013), pp. 1798–1828. 18 Lamarre and ˇSafr´ anek

  20. [20]

    The mnist database of handwritten digit images for machine learning research

    Li Deng. “The mnist database of handwritten digit images for machine learning research”. In: IEEE Signal Processing Magazine 29.6 (2012), pp. 141–142

  21. [21]

    Learning Multiple Layers of Features from Tiny Images

    Alex Krizhevsky. “Learning Multiple Layers of Features from Tiny Images”. In: University of Toronto (May 2012)

  22. [22]

    Transfer learning in hybrid classical-quantum neural networks

    Andrea Mari et al. “Transfer learning in hybrid classical-quantum neural networks”. In: Quan- tum 4 (Oct. 2020), p. 340. issn: 2521-327X. doi: 10 . 22331 / q - 2020 - 10 - 09 - 340. url: http://dx.doi.org/10.22331/q-2020-10-09-340

  23. [23]

    High-Resolution Image Synthesis With Latent Diffusion Models

    Robin Rombach et al. “High-Resolution Image Synthesis With Latent Diffusion Models”. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). June 2022, pp. 10684–10695

  24. [24]

    Perceptual Losses for Real-Time Style Trans- fer and Super-Resolution

    Justin Johnson, Alexandre Alahi, and Li Fei-Fei. “Perceptual Losses for Real-Time Style Trans- fer and Super-Resolution”. In: Computer Vision – ECCV 2016 . Ed. by Bastian Leibe et al. Cham: Springer International Publishing, 2016, pp. 694–711. isbn: 978-3-319-46475-6

  25. [25]

    Taming Transformers for High-Resolution Image Synthesis

    Patrick Esser, Robin Rombach, and Bjorn Ommer. “Taming Transformers for High-Resolution Image Synthesis”. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). June 2021, pp. 12873–12883

  26. [26]

    Very Deep Convolutional Networks for Large-Scale Image Recognition

    Karen Simonyan and Andrew Zisserman. Very Deep Convolutional Networks for Large-Scale Image Recognition. 2015. arXiv: 1409.1556 [cs.CV] . url: https://arxiv.org/abs/1409. 1556

  27. [27]

    Image-To-Image Translation With Conditional Adversarial Networks

    Phillip Isola et al. “Image-To-Image Translation With Conditional Adversarial Networks”. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) . July 2017

  28. [28]

    Supervised Learning with Quantum Computers

    Maria Schuld and Francesco Petruccione. Supervised Learning with Quantum Computers . 1st edition. Springer Publishing Company, Incorporated, 2018. isbn: 3319964232

  29. [29]

    Quantum , author =

    David Wierichs et al. “General parameter-shift rules for quantum gradients”. In: Quantum 6 (2022), p. 677. doi: 10.22331/q-2022-03-30-677

  30. [30]

    Measuring Analytic Gradients of General Quantum Evolution with the Stochastic Parameter Shift Rule

    Leonardo Banchi and Gavin E. Crooks. “Measuring Analytic Gradients of General Quantum Evolution with the Stochastic Parameter Shift Rule”. In: Quantum 5 (2021), p. 386. doi: 10.22331/q-2021-01-25-386

  31. [31]

    Generalized quantum circuit differentiation rules

    Oleksandr Kyriienko and Vincent E. Elfving. “Generalized quantum circuit differentiation rules”. In: Physical Review A 104 (2021), p. 052417. doi: 10.1103/PhysRevA.104.052417

  32. [32]

    Here comes the SU(N): multivariate quantum gates and gradients

    Roeland Wiersema et al. “Here comes the SU(N): multivariate quantum gates and gradients”. In: Quantum 8 (2024), p. 1275. doi: 10.22331/q-2024-03-07-1275

  33. [33]

    Fidelity-Preserving Quantum Encoding for Quantum Neural Net- works

    Yuhu Lu and Jinjing Shi. Fidelity-Preserving Quantum Encoding for Quantum Neural Net- works. 2025. arXiv: 2511.15363 [quant-ph]. url: https://arxiv.org/abs/2511.15363

  34. [34]

    Shadow Tomography of Quantum States

    Scott Aaronson. Shadow Tomography of Quantum States. 2018. arXiv: 1711.01053 [quant-ph]. url: https://arxiv.org/abs/1711.01053

  35. [35]

    Predicting many properties of a quan- tum system from very few measurements

    Hsin-Yuan Huang, Richard Kueng, and John Preskill. “Predicting many properties of a quan- tum system from very few measurements”. In: Nature Physics 16.10 (June 2020), pp. 1050–

  36. [36]

    Unidirectional scattering with spatial homogeneity using correlated photonic time disorder,

    issn: 1745-2481. doi: 10.1038/s41567- 020- 0932- 7. url: http://dx.doi.org/10. 1038/s41567-020-0932-7

  37. [37]

    The Unreasonable Effectiveness of Deep Features as a Perceptual Metric

    Richard Zhang et al. The Unreasonable Effectiveness of Deep Features as a Perceptual Metric

  38. [38]

    url: https://arxiv.org/abs/1801.03924

    arXiv: 1801.03924 [cs.CV]. url: https://arxiv.org/abs/1801.03924

  39. [39]

    High-Resolution Image Synthesis with Latent Diffusion Models

    Robin Rombach et al. High-Resolution Image Synthesis with Latent Diffusion Models . 2021. arXiv: 2112.10752 [cs.CV]

  40. [40]

    Image quality assessment: from error visibility to structural similarity

    Zhou Wang et al. “Image quality assessment: from error visibility to structural similarity”. In: IEEE Transactions on Image Processing 13.4 (2004), pp. 600–612

  41. [41]

    GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium

    Martin Heusel et al. “GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium”. In: Advances in Neural Information Processing Systems (NeurIPS) . 2017, pp. 6626–6637. arXiv: 1706.08500

  42. [42]

    Deep Learning

    Ian Goodfellow, Yoshua Bengio, and Aaron Courville. Deep Learning. http://www.deeplearningbook. org. MIT Press, 2016. 19 Lamarre and ˇSafr´ anek

  43. [43]

    Comparitive analysis of Alexnet, GoogLeNet and EffecientNet using CIFAR-100 dataset

    Dr Nagaraju, Saketh Peetha, and K. Karthik. “Comparitive analysis of Alexnet, GoogLeNet and EffecientNet using CIFAR-100 dataset”. In: INTERNANTIONAL JOURNAL OF SCI- ENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT 09 (Mar. 2025), pp. 1–9. doi: 10.55041/IJSREM42391

  44. [44]

    Efficient BackProp

    Yann LeCun et al. “Efficient BackProp”. In: Neural Networks: Tricks of the Trade . Springer, 1998, pp. 9–50

  45. [45]

    A Comprehensive Study of Quantum Arithmetic Circuits

    Siyi Wang et al. A Comprehensive Study of Quantum Arithmetic Circuits . 2024. arXiv: 2406. 03867 [quant-ph]. url: https://arxiv.org/abs/2406.03867

  46. [46]

    Layerwise learning for quantum neural networks

    Andrea Skolik et al. “Layerwise learning for quantum neural networks”. In: Quantum Machine Intelligence 3.1 (2021), pp. 1–11. arXiv: 2006.14904

  47. [47]

    Neural Discrete Representation Learning

    A¨ aron van den Oord, Oriol Vinyals, and Koray Kavukcuoglu. “Neural Discrete Representation Learning”. In: Advances in Neural Information Processing Systems (NeurIPS) . 2017. arXiv: 1711.00937. 20 Lamarre and ˇSafr´ anek Original Reconstruction Original Reconstruction Original Reconstruction Figure 17: Autoencoder on the ImageNet dataset [7]. Original sam...