Hybrid Classical-Quantum Neural Networks for Multi-Characteristic Co-Optimization of Recessed-Gate AlGaN/GaN MIS-HEMTs

Doan Viet Nguyen; Nan-Yow Chen; Niall Tumilty; Pei-Jie Chang; Rushat Rai; Simon See; Tai-Yue Li; Tian-Li Wu; Wen-Jay Lee; Yuan-Chieh Chiu

arxiv: 2605.27420 · v1 · pith:6TUG2SLSnew · submitted 2026-05-19 · 🪐 quant-ph · physics.app-ph

Hybrid Classical-Quantum Neural Networks for Multi-Characteristic Co-Optimization of Recessed-Gate AlGaN/GaN MIS-HEMTs

Rushat Rai , Pei-Jie Chang , Doan Viet Nguyen , Yuan-Chieh Chiu , Niall Tumilty , Yun-Yuan Wang , Simon See , Wen-Jay Lee

show 3 more authors

Tai-Yue Li Nan-Yow Chen Tian-Li Wu

This is my paper

Pith reviewed 2026-06-30 18:20 UTC · model grok-4.3

classification 🪐 quant-ph physics.app-ph

keywords hybrid quantum neural networkAlGaN/GaN MIS-HEMTdevice modelingmulti-characteristic optimizationquantum circuit ansatzprocess variabilityneural network comparisonexperimental semiconductor data

0 comments

The pith

Hybrid quantum-classical neural network reduces modeling error for GaN transistors by 24.4 percent on experimental data.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper seeks to demonstrate that a hybrid classical-quantum neural network can jointly predict six electrical characteristics of recessed-gate AlGaN/GaN MIS-HEMTs from a 24-dimensional process vector more accurately than a classical artificial neural network. It screens multiple quantum circuit templates on data from 468 fabricated devices across 17 process splits, then selects and evaluates one specific circuit. The selected model lowers overall normalized root mean square error by 24.4 percent, with measurable drops in root mean square error for threshold voltages, subthreshold swing, and drain current. A sympathetic reader would care because semiconductor process data is expensive to obtain and contains variability that pure simulations miss, so better predictive models could speed up device optimization. The work also reports that circuit depth, parameter count, and two-qubit gate count correlate with improved accuracy while expressibility correlates negatively.

Core claim

On 468 experimental fabricated devices spanning 17 process splits, the selected HQNN, Circuit (13, 5) at L = 2, reduces overall normalized root mean square error by 24.4 percent relative to ANN, with target-wise improvements including Vth,lin RMSE from 0.297 V to 0.270 V, Vth,rev from 0.278 V to 0.263 V, DeltaVth from 0.049 V to 0.045 V, SS from 22.22 mV/dec to 19.87 mV/dec, and Id from 5.75e-8 A to 4.35e-8 A, while Ion remains competitive.

What carries the argument

Hybrid classical-quantum neural network (HQNN) built from screened quantum-circuit templates that processes a 24-dimensional fabrication vector to predict six electrical targets simultaneously.

If this is right

Performance improves when circuit depth, parameter count, and two-qubit gate count are increased.
Controlled-rotation entanglers outperform static CNOT-based circuits across the screened templates.
Expressibility measured by DKL correlates negatively with accuracy on this dataset.
A depolarizing-noise study indicates that similar HQNNs remain trainable or deployable on near-term quantum hardware.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same screening approach could be applied to other semiconductor technologies that produce costly experimental datasets with process variability.
The negative correlation between expressibility and accuracy suggests that, for this task, circuits that are too expressive may overfit the limited experimental samples.
Retraining the selected circuit on new process splits held out from the original 17 would test whether the accuracy gain generalizes beyond the training distribution.

Load-bearing premise

The observed accuracy gains come from properties of the quantum circuits rather than from differences in total parameter count, network depth, or other hyperparameter choices.

What would settle it

A direct comparison in which a classical network is given exactly the same number of trainable parameters and layers as the HQNN and still shows higher error on the same 468-device dataset would falsify the claim that the quantum component supplies the advantage.

Figures

Figures reproduced from arXiv: 2605.27420 by Doan Viet Nguyen, Nan-Yow Chen, Niall Tumilty, Pei-Jie Chang, Rushat Rai, Simon See, Tai-Yue Li, Tian-Li Wu, Wen-Jay Lee, Yuan-Chieh Chiu, Yun-Yuan Wang.

**Figure 2.** Figure 2: Typical IDS–VGS transfer curve of a recessed-gate AlGaN/GaN MIS-HEMT. Electrical characterization. Transfer characteristics were measured with a gate voltage (VG) sweep from -8 V to +4 V (forward) and back (reverse), yielding both the on-to-off and off-to-on branches of the IDS–VGS curve ( [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 3.** Figure 3: Wafer map showing die positions encoded as Cartesian [PITH_FULL_IMAGE:figures/full_fig_p004_3.png] view at source ↗

**Figure 4.** Figure 4: Strict-bottleneck HQNN backbone used for the single-template sweep and descriptor-based ablation analysis. The classical encoder compresses 24 process features to a 4-dimensional latent code, the PQC performs a structured nonlinear transformation, and a minimal linear head maps 12 Pauli expectations to 6 device targets. The dual-branch hybrid used for mixed-circuit model selection is described in the text.… view at source ↗

**Figure 5.** Figure 5: Quantum processing pipeline for a representative 4 [PITH_FULL_IMAGE:figures/full_fig_p006_5.png] view at source ↗

**Figure 6.** Figure 6: Wafer-level means (left) and global parity plots (right) for the selected HQNN configuration: Circuit (13, 5) (L = 2); overall R² = 0.9229. Among the tested baselines, Circuit (13, 5) yields the lowest aggregate error, reducing overall nRMSE from 2395.6 for the ANN to 1811.4, a ~24% reduction. As summarized visually in [PITH_FULL_IMAGE:figures/full_fig_p008_6.png] view at source ↗

**Figure 7.** Figure 7: Comparison of overall nRMSE (IQR-macro) and per-target raw RMSE for the selected HQNN and the classical baselines. Lower values indicate better predictive accuracy [PITH_FULL_IMAGE:figures/full_fig_p008_7.png] view at source ↗

read the original abstract

Optimizing recessed-gate AlGaN/GaN MIS-HEMTs requires accurate multi-characteristic models, but experimental semiconductor datasets remain costly and encode process-induced variability that simulations cannot faithfully reproduce. This work proposes a hybrid classical-quantum neural network (HQNN) for joint optimization of six electrical targets from a 24-dimensional fabrication/process vector. We systematically screen quantum-circuit templates to extract circuit-design guidance, then select a final HQNN and compare it directly with classical baselines. On 468 experimental fabricated devices spanning 17 process splits, the selected HQNN, Circuit (13, 5) at L = 2, reduces overall normalized root mean square error (nRMSE) by 24.4% relative to ANN. Target-wise, the HQNN lowers Vth,lin RMSE from 0.297 V to 0.270 V, Vth,rev RMSE from 0.278 V to 0.263 V, DeltaVth RMSE from 0.049 V to 0.045 V, SS RMSE from 22.22 mV/dec to 19.87 mV/dec, and Id RMSE from 5.75 x 10^-8 A to 4.35 x 10^-8 A, while Ion RMSE remains competitive (0.053 A vs. 0.056 A). Controlled ansatz ablations further show that performance depends strongly on architecture: parameter count, depth, and two-qubit gate count correlate positively with accuracy, expressibility (DKL) correlates negatively, and controlled-rotation entanglers outperform static controlled-NOT (CNOT)-based circuits in aggregate. A depolarizing-noise study on a representative 4-qubit circuit further suggests that comparable HQNNs may be trainable or deployable on near-term quantum hardware.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

HQNN cuts error 24% vs ANN on 468 GaN devices but the classical baseline may not match parameter count.

read the letter

The paper applies existing hybrid quantum-classical networks to predicting six electrical targets for recessed-gate AlGaN/GaN MIS-HEMTs from a 24-dimensional process vector. On 468 real fabricated devices across 17 splits, their selected circuit (13,5) at depth 2 lowers overall nRMSE by 24.4% relative to an ANN, with concrete drops on Vth,lin, Vth,rev, DeltaVth, SS, and Id.

What is new is the controlled screening of multiple quantum ansatzes on this specific experimental dataset, plus ablations that tie accuracy to parameter count, depth, two-qubit gates, and entangler choice. The noise study on a 4-qubit circuit is a practical addition. Using actual device measurements rather than simulation is the clearest strength.

The soft spot is the ANN baseline. The abstract states that performance tracks strongly with parameter count and depth, yet gives no numbers showing the classical model was sized to the same total trainable parameters as the HQNN. If the quantum version simply has more capacity, the reported gains could be classical rather than from entanglement or expressibility. The stress-test note flags exactly this, and nothing in the abstract resolves it.

This is for readers working on quantum ML for device or process modeling, or anyone needing regression benchmarks on real semiconductor data. It shows clear thinking and honest engagement with the experimental constraints. It deserves peer review because the dataset and ablations are grounded, even if the capacity comparison needs tightening.

Referee Report

3 major / 1 minor

Summary. The manuscript proposes hybrid classical-quantum neural networks (HQNNs) for joint modeling of six electrical targets (Vth,lin, Vth,rev, DeltaVth, SS, Id, Ion) of recessed-gate AlGaN/GaN MIS-HEMTs from a 24-dimensional process vector. Using data from 468 experimental devices across 17 process splits, it screens quantum-circuit ansatze, selects Circuit (13,5) at depth L=2, and reports a 24.4% overall nRMSE reduction versus a classical ANN baseline, with target-wise RMSE improvements (e.g., Vth,lin: 0.297 V to 0.270 V; Id: 5.75e-8 A to 4.35e-8 A). Ablations link accuracy to parameter count, depth, and two-qubit gates, while a depolarizing-noise study suggests near-term hardware viability.

Significance. If the reported accuracy gains are shown to arise from quantum-circuit properties rather than unmatched model capacity, the work would provide concrete evidence of HQNN utility for multi-objective semiconductor device modeling on limited experimental datasets. The systematic ansatz screening, explicit correlation of expressibility/entanglement metrics with performance, and use of real fabricated-device data (rather than simulation) are strengths that could guide future quantum-classical co-design in electronics.

major comments (3)

[Abstract] Abstract (comparison paragraph): The central claim of 24.4% nRMSE reduction (and target-wise RMSE drops such as Vth,lin 0.297 V to 0.270 V) is load-bearing for asserting HQNN superiority, yet no information is given on whether the ANN baseline was matched in total trainable parameters to Circuit (13,5) at L=2. The abstract itself states that performance depends strongly on parameter count, depth, and two-qubit gate count; without an explicit capacity-matched ablation, the observed gains cannot be attributed to quantum expressibility or entanglement rather than classical model size.
[Abstract] Abstract (results and methods description): The reported numerical improvements on 468 devices lack accompanying statistical tests, confidence intervals, or error bars on the RMSE values, and provide no details on data splits, cross-validation procedure, or hyperparameter tuning protocol for either model. These omissions directly affect verifiability of whether the target-wise reductions are robust or could arise from random variation or overfitting.
[Abstract] Abstract (ablations paragraph): The positive correlation of accuracy with parameter count is noted, but the manuscript does not report the actual parameter counts for the selected HQNN versus the ANN, nor does it include a controlled experiment at fixed parameter budget. This leaves open the possibility that the 24.4% aggregate improvement is explained by capacity differences rather than the quantum-circuit features highlighted in the ablations.

minor comments (1)

[Abstract] The notation 'Circuit (13, 5)' is used without an explicit definition of what the two numbers index (e.g., qubit count and entangling-gate count); a brief parenthetical clarification would improve readability.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive comments emphasizing the need for capacity-matched baselines and improved statistical reporting. We address each point below and will revise the manuscript accordingly.

read point-by-point responses

Referee: The central claim of 24.4% nRMSE reduction is load-bearing for asserting HQNN superiority, yet no information is given on whether the ANN baseline was matched in total trainable parameters to Circuit (13,5) at L=2. Without an explicit capacity-matched ablation, the observed gains cannot be attributed to quantum expressibility or entanglement rather than classical model size.

Authors: We agree that the absence of explicit parameter counts and a capacity-matched comparison weakens the attribution of gains to quantum features. Although the manuscript reports ablations correlating accuracy with parameter count, depth, and two-qubit gates, it does not tabulate the trainable parameters of the final HQNN versus ANN. We will revise the abstract and main text to report these counts and add a controlled ablation at matched parameter budgets. revision: yes
Referee: The reported numerical improvements on 468 devices lack accompanying statistical tests, confidence intervals, or error bars on the RMSE values, and provide no details on data splits, cross-validation procedure, or hyperparameter tuning protocol for either model.

Authors: The manuscript describes the 468-device dataset and 17 process splits but does not detail the train/test protocol, cross-validation, hyperparameter search, or provide error bars/CIs. We will expand the methods section with these details and add statistical measures (e.g., standard deviation across runs or bootstrap CIs) to the reported RMSE values. revision: yes
Referee: The manuscript does not report the actual parameter counts for the selected HQNN versus the ANN, nor does it include a controlled experiment at fixed parameter budget. This leaves open the possibility that the 24.4% aggregate improvement is explained by capacity differences rather than the quantum-circuit features.

Authors: This overlaps with the first comment. We will explicitly report parameter counts for both models and include a fixed-budget comparison to isolate quantum-circuit contributions (e.g., entanglement type) from capacity effects, building on the existing architecture ablations. revision: yes

Circularity Check

0 steps flagged

No significant circularity; empirical ML comparison on held-out experimental data

full rationale

The paper trains HQNN and ANN models via supervised learning on 468 fabricated devices to predict six electrical targets from 24 process parameters. Reported nRMSE reductions are direct empirical outcomes of that training and comparison, not algebraic identities or self-definitions. No load-bearing self-citations, uniqueness theorems, or ansatzes imported from prior author work are invoked to force the result. Architecture ablations are presented as controlled experiments, not as derivations that collapse to the inputs. This is standard non-circular supervised modeling.

Axiom & Free-Parameter Ledger

2 free parameters · 1 axioms · 0 invented entities

The central claim rests on empirical fitting of a neural network with many trainable parameters to a finite experimental dataset; no new physical entities are introduced. Since only the abstract is available, the ledger is necessarily incomplete.

free parameters (2)

HQNN trainable parameters
Weights, rotation angles, and classical layer coefficients are fitted to the 468-device dataset to minimize the multi-target loss.
Circuit depth L
Selected value of 2 after screening; affects expressibility and parameter count.

axioms (1)

domain assumption The 468 experimental measurements across 17 process splits are representative and sufficient to train and evaluate the model without severe overfitting or selection bias.
Implicit in reporting aggregate and target-wise performance metrics on the collected data.

pith-pipeline@v0.9.1-grok · 5920 in / 1577 out tokens · 44685 ms · 2026-06-30T18:20:52.449521+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

3 extracted references · 2 canonical work pages · 1 internal anchor

[1]

Table-Based Nonlinear HEMT Model Extracted from Time-Domain Large-Signal Measurements

M. C. Curras-Francos, “Table-Based Nonlinear HEMT Model Extracted from Time-Domain Large-Signal Measurements”, IEEE Transactions on Microwave Theory and Techniques, vol. 53, no. 5, pp. 1593–1600, 2005. [4] J. Xu, D. Gunyan, M. Iwamoto, A. Cognata, D. E. Root, “Measurement-Based Non-Quasi-Static Large-Signal FET Model Using Artificial Neural Networks”, in ...

work page arXiv 2005
[2]

Exponential quantum advantage in processing massive classical data

V. Havlíček, A. D. Córcoles, K. Temme, A. W. Harrow, A. Kandala, J. M. Chow, J. M. Gambetta, “Supervised Learning with Quantum-Enhanced Feature Spaces”, Nature, vol. 567, no. 7747, pp. 209–212, 2019. [15] J. R. Glick, T. P. Gujarati, A. D. Córcoles, Y. Kim, A. Kandala, J. M. Gambetta, K. Temme, “Covariant Quantum Kernels for Data with Group Structure”, Na...

work page internal anchor Pith review Pith/arXiv arXiv 2019
[3]

CUDA Quantum: The Platform for Integrated Quantum-Classical Computing

J. Kim, A. McCaskey, B. Heim, M. Modani, S. Stanwyck, T. Costa, “CUDA Quantum: The Platform for Integrated Quantum-Classical Computing”, 2023 60th ACM/IEEE Design Automation Conference (DAC), pp. 1–4, 2023. Appendix A Table 3: Variational ansatz template inventory for Q = 4 qubits [18]. RX(i), RY(i), RZ(i), H(i): single-qubit gates on qubit i. CX(c,t), CZ...

2023

[1] [1]

Table-Based Nonlinear HEMT Model Extracted from Time-Domain Large-Signal Measurements

M. C. Curras-Francos, “Table-Based Nonlinear HEMT Model Extracted from Time-Domain Large-Signal Measurements”, IEEE Transactions on Microwave Theory and Techniques, vol. 53, no. 5, pp. 1593–1600, 2005. [4] J. Xu, D. Gunyan, M. Iwamoto, A. Cognata, D. E. Root, “Measurement-Based Non-Quasi-Static Large-Signal FET Model Using Artificial Neural Networks”, in ...

work page arXiv 2005

[2] [2]

Exponential quantum advantage in processing massive classical data

V. Havlíček, A. D. Córcoles, K. Temme, A. W. Harrow, A. Kandala, J. M. Chow, J. M. Gambetta, “Supervised Learning with Quantum-Enhanced Feature Spaces”, Nature, vol. 567, no. 7747, pp. 209–212, 2019. [15] J. R. Glick, T. P. Gujarati, A. D. Córcoles, Y. Kim, A. Kandala, J. M. Gambetta, K. Temme, “Covariant Quantum Kernels for Data with Group Structure”, Na...

work page internal anchor Pith review Pith/arXiv arXiv 2019

[3] [3]

CUDA Quantum: The Platform for Integrated Quantum-Classical Computing

J. Kim, A. McCaskey, B. Heim, M. Modani, S. Stanwyck, T. Costa, “CUDA Quantum: The Platform for Integrated Quantum-Classical Computing”, 2023 60th ACM/IEEE Design Automation Conference (DAC), pp. 1–4, 2023. Appendix A Table 3: Variational ansatz template inventory for Q = 4 qubits [18]. RX(i), RY(i), RZ(i), H(i): single-qubit gates on qubit i. CX(c,t), CZ...

2023