pith. sign in

arxiv: 2503.19002 · v2 · pith:H2VBK6F2new · submitted 2025-03-24 · 🪐 quant-ph · cs.LG

Quantum Complex-Valued Self-Attention Model

classification 🪐 quant-ph cs.LG
keywords quantumself-attentioncomplex-valuedqcsamstatesattentionfashion-mnistframework
0
0 comments X
read the original abstract

Self-attention has revolutionized classical machine learning, yet existing quantum self-attention models underutilize quantum states' potential due to oversimplified or incomplete mechanisms. To address this limitation, we introduce the Quantum Complex-Valued Self-Attention Model (QCSAM), the first framework to leverage complex-valued similarities, which captures amplitude and phase relationships between quantum states more comprehensively. To achieve this, QCSAM extends the Linear Combination of Unitaries (LCUs) into the Complex LCUs (CLCUs) framework, enabling precise complex-valued weighting of quantum states and supporting quantum multi-head attention. Experiments on MNIST and Fashion-MNIST show that QCSAM outperforms recent quantum self-attention models, including QKSAN, QSAN, and GQHAN. With only 4 qubits, QCSAM achieves 100% and 99.2% test accuracies on MNIST and Fashion-MNIST, respectively. Furthermore, we evaluate scalability across 3-8 qubits and 2-4 class tasks, while ablation studies validate the advantages of complex-valued attention weights over real-valued alternatives. This work advances quantum machine learning by enhancing the expressiveness and precision of quantum self-attention in a way that aligns with the inherent complexity of quantum mechanics.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Quantum Parameterized Self-Attention Network for Image Classification

    quant-ph 2026-05 unverdicted novelty 6.0

    QPSAN implements self-attention via PQCs with 5 parameters, establishes a theoretical framework for its scoring properties, and reports outperformance over ViT on four vision datasets that grows with data complexity.