Recognition: 1 theorem link
· Lean TheoremQuPort: Topology-, Port-, and Congestion-Aware Compilation for Modular Multi-QPU Quantum Systems
Pith reviewed 2026-05-14 20:41 UTC · model grok-4.3
The pith
QuPort's TPCCAP finds circuit partitions for multi-QPU systems that jointly minimize cut distance, port overflow, and link congestion on a three-level graph.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central claim is that an explicit three-level model together with the TPCCAP objective (weighted cut distance plus port overflow plus routed link load) produces mappings that respect both intra-QPU topology and inter-QPU communication limits, and that the supporting steps of heavy-edge clustering, balanced greedy partitioning, simulated-annealing refinement, port-aware layout, and remote-gate extraction are sufficient to realize those mappings.
What carries the argument
TPCCAP, the partitioning routine that optimizes the combined objective of weighted cut distance, communication-port overflow, and routed link-load congestion on the three-level graph model.
If this is right
- Fewer remote two-qubit operations need to be scheduled across QPUs.
- Communication ports are less likely to exceed their capacity.
- Interconnect links carry more balanced traffic loads.
- Local routing inside each QPU can proceed independently once the partition is fixed.
- Schedule estimates can incorporate the actual topology of both intra- and inter-QPU links.
Where Pith is reading between the lines
- The same three-level objective could be reused for dynamic re-partitioning if hardware allows mid-circuit qubit movement.
- Weight tuning experiments on synthetic benchmarks with varying port counts would reveal which term dominates for realistic interconnect densities.
- Extending the model to include qubit reset or measurement costs could further tighten the link between compiler output and hardware runtime.
- Classical distributed-memory compilers face analogous port and congestion constraints; the TPCCAP structure might transfer directly.
Load-bearing premise
That minimizing the weighted combination of cut distance, port overflow, and congestion on the abstract three-level graphs produces mappings that improve actual execution on modular hardware.
What would settle it
Compile the same circuit with TPCCAP and with a standard single-graph partitioner, then execute both versions on a calibrated multi-QPU simulator or hardware testbed and compare measured total runtime or error rate.
Figures
read the original abstract
Modular quantum processors require a compiler to reason about two resources at the same time: local device connectivity and communication across QPUs. A mapping that is acceptable on a single coupling graph may be unsuitable for a modular machine if it creates excessive cross-QPU traffic, concentrates that traffic on a small number of interconnect links, or assigns many boundary qubits to a QPU with few communication ports. This paper presents QuPort, a Python and Qiskit-based compilation framework that studies this setting through an explicit three-level model: a weighted logical interaction graph, a directed physical coupling map, and an undirected QPU-level interconnect graph. The main partitioning method, TPCCAP, optimizes the implemented objective formed by weighted cut distance, communication-port overflow, and routed link-load congestion. The framework also includes heavy-edge clustering, balanced greedy partitioning, simulated-annealing refinement, communication-port-aware layout, extraction of remote two-qubit operations, local-only routing of per-QPU circuits, and topology-aware schedule estimation. The model is a compiler-level abstraction. It does not claim a calibrated hardware runtime or an implementation of a physical remote-gate protocol.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces QuPort, a Qiskit-based compilation framework for modular multi-QPU quantum systems. It defines a three-level graph model (weighted logical interaction graph, directed physical coupling map, undirected QPU interconnect graph) and presents TPCCAP as the main partitioning algorithm that optimizes an explicit objective combining weighted cut distance, communication-port overflow, and routed link-load congestion. The framework additionally incorporates heavy-edge clustering, balanced greedy partitioning, simulated-annealing refinement, port-aware layout, remote two-qubit operation extraction, local-only per-QPU routing, and topology-aware schedule estimation. The work is framed strictly as a compiler-level abstraction without claims of calibrated hardware runtimes or physical remote-gate protocols.
Significance. The explicit three-level model and the joint optimization of cut distance, port overflow, and congestion address a timely compiler challenge for scaling quantum systems via modularity. The parameter-free formulation of the objective in terms of directly measurable graph quantities is a strength that could serve as a reusable foundation for future modular compilers. However, the absence of any quantitative benchmarks or baseline comparisons limits the demonstrated impact to the design description alone.
major comments (1)
- [Evaluation section] Evaluation section: the manuscript reports no quantitative benchmarks, comparisons to existing partitioners, or metrics (e.g., achieved objective values, port utilization, or congestion levels) on any circuit suite. Because the central claim is that TPCCAP optimizes the stated objective, the lack of even synthetic-graph results or ablation studies on the three components makes it impossible to verify that the implemented algorithm produces the intended improvements.
minor comments (2)
- [Abstract] Abstract: the phrase 'the implemented objective' is used without an equation or explicit weight values; adding a compact definition (e.g., min w1·cut + w2·overflow + w3·load) would improve immediate readability.
- [Model definition] Notation: the three-level graph model is introduced with descriptive names but without a single consolidated table or figure that lists all symbols (G_L, G_P, G_Q, etc.) and their edge-weight interpretations; a dedicated notation table would reduce ambiguity for readers.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback on our manuscript. We agree that the absence of quantitative benchmarks limits the ability to verify the effectiveness of TPCCAP and will revise the manuscript to include a dedicated Evaluation section with benchmarks, comparisons, and ablation studies.
read point-by-point responses
-
Referee: [Evaluation section] Evaluation section: the manuscript reports no quantitative benchmarks, comparisons to existing partitioners, or metrics (e.g., achieved objective values, port utilization, or congestion levels) on any circuit suite. Because the central claim is that TPCCAP optimizes the stated objective, the lack of even synthetic-graph results or ablation studies on the three components makes it impossible to verify that the implemented algorithm produces the intended improvements.
Authors: We acknowledge that this observation is correct and that the current manuscript is limited to a design description of the three-level model and TPCCAP without empirical results. In the revised version we will add a full Evaluation section containing: (i) experiments on synthetic interaction graphs that report the achieved values of the composite objective (weighted cut distance + port overflow + routed congestion), (ii) direct comparisons against baseline partitioners (e.g., METIS and a greedy multi-level variant) using the same three-level graph model, (iii) per-metric breakdowns of port utilization and link-load congestion, and (iv) ablation studies that isolate the contribution of each term in the objective. These additions will allow readers to verify that the implemented algorithm produces the intended improvements. revision: yes
Circularity Check
No significant circularity identified
full rationale
The paper defines TPCCAP as a partitioning method that directly optimizes an explicitly constructed objective (weighted cut distance + port overflow + routed link-load congestion) on a three-level graph model. This is a standard algorithmic definition with no reduction of any claimed prediction or result to a fitted parameter, self-citation chain, or self-referential definition. No equations or steps are shown that rename a known result, smuggle an ansatz via prior work, or import uniqueness from the authors' own citations. The framework description (heavy-edge clustering, simulated annealing, etc.) is self-contained as a compiler abstraction and does not rely on external benchmarks or prior self-citations to establish its central claim.
Axiom & Free-Parameter Ledger
free parameters (1)
- objective weights
axioms (1)
- domain assumption Communication costs in modular quantum systems can be accurately captured by graph-cut distance, port counts, and link-load congestion.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
The implemented TPCCAP objective is J(π) = α Σ w_ij d(π(i),π(j)) + β Σ max(0,b_q−P)^2 + η Σ L_e^2
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Computer Networks254, 110672 (2024).https://doi.org/10.1016/j.comnet.2024.110672
Caleffi, M., Amoretti, M., Ferrari, D., Illiano, J., Manzalini, A., Cacciapuoti, A.S.: Distributed quantum computing: A survey. Computer Networks254, 110672 (2024).https://doi.org/10.1016/j.comnet.2024.110672
-
[2]
Physical Review A59, 4249–4254 (1999).https: //doi.org/10.1103/PhysRevA.59.4249
Cirac, J.I., Ekert, A.K., Huelga, S.F., Macchiavello, C.: Distributed quantum com- putation over noisy channels. Physical Review A59, 4249–4254 (1999).https: //doi.org/10.1103/PhysRevA.59.4249
-
[3]
Scientific Reports12, 15421 (2022).https://doi
Davarzani, Z., Zomorodi, M., Houshmand, M.: A hierarchical approach for building distributed quantum systems. Scientific Reports12, 15421 (2022).https://doi. org/10.1038/s41598-022-18989-w
-
[4]
IEEE Transactions on Quantum Engineering2, 1–20 (2021).https://doi.org/10.1109/TQE.2021.3053921
Ferrari, D., Cacciapuoti, A.S., Amoretti, M., Caleffi, M.: Compiler design for dis- tributed quantum computing. IEEE Transactions on Quantum Engineering2, 1–20 (2021).https://doi.org/10.1109/TQE.2021.3053921
-
[5]
Ferrari,D.,Carretta,S.,Amoretti,M.:Amodularquantumcompilationframework for distributed quantum computing. IEEE Transactions on Quantum Engineering 4, 1–13 (2023).https://doi.org/10.1109/TQE.2023.3303935 14 S. Sarkar, S. Bhattacharjee
-
[6]
IBM Quantum: Couplingmap. Qiskit API Documentation (2026),https: //quantum.cloud.ibm.com/docs/en/api/qiskit/2.3/qiskit.transpiler. CouplingMap
work page 2026
-
[7]
IBM Quantum Documentation (2026),https://quantum.cloud.ibm.com/docs/en/guides/transpile
IBM Quantum: Introduction to transpilation. IBM Quantum Documentation (2026),https://quantum.cloud.ibm.com/docs/en/guides/transpile
work page 2026
-
[8]
SIAM Journal on Scientific Computing20(1), 359–392 (1998)
Karypis, G., Kumar, V.: A fast and high quality multilevel scheme for partitioning irregular graphs. SIAM Journal on Scientific Computing20(1), 359–392 (1998). https://doi.org/10.1137/S1064827595287997
-
[9]
Li, G., Ding, Y., Xie, Y.: Tackling the qubit mapping problem for NISQ-era quan- tum devices. In: Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems. pp. 1001–1014 (2019).https://doi.org/10.1145/3297858.3304023
-
[10]
Nature638, 383–388 (2025).https://doi.org/10.1038/ s41586-024-08404-x
Main, D., Drmota, P., Nadlinger, D.P., Ainley, E.M., Agrawal, A., Nichol, B.C., Srinivas, R., Araneda, G., Lucas, D.M.: Distributed quantum computing across an optical network link. Nature638, 383–388 (2025).https://doi.org/10.1038/ s41586-024-08404-x
work page 2025
-
[11]
Nature Elec- tronics8, 610–619 (2025).https://doi.org/10.1038/s41928-025-01404-3
Mollenhauer, M., Irfan, A., Cao, X., Mandal, S., Pfaff, W.: A high-efficiency ele- mentary network of interchangeable superconducting qubit devices. Nature Elec- tronics8, 610–619 (2025).https://doi.org/10.1038/s41928-025-01404-3
-
[12]
Physical Review A89, 022317 (2014).https://doi
Monroe, C., Raussendorf, R., Ruthven, A., Brown, K.R., Maunz, P., Duan, L.M., Kim, J.: Large-scale modular quantum-computer architecture with atomic memory and photonic interconnects. Physical Review A89, 022317 (2014).https://doi. org/10.1103/PhysRevA.89.022317
-
[13]
GitHub repository (2026),https://github.com/neuralsorcerer/ quport
Sarkar, S.: Quport: multi-qpu circuit mapping, routing, splitting, scheduling, and benchmarking. GitHub repository (2026),https://github.com/neuralsorcerer/ quport
work page 2026
-
[14]
Sivarajah, S., Dilkes, S., Cowtan, A., Simmons, W., Edgington, A., Duncan, R.: t|ket>: A retargetable compiler for NISQ devices. Quantum Science and Technol- ogy6(1), 014003 (2020).https://doi.org/10.1088/2058-9565/ab8e92 A Notation T able 1.Notation used in the manuscript. Symbol Meaning QL logical-qubit set GL weighted logical interaction graph GQ undir...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.