pith. sign in

arxiv: 2605.20287 · v1 · pith:OVPOIUKPnew · submitted 2026-05-19 · 💻 cs.LG · cs.AI· cs.CV

FusionCell: Cross-Attentive Fusion of Layout Geometry and Netlist Topology for Standard-Cell Performance Prediction

Pith reviewed 2026-05-21 08:18 UTC · model grok-4.3

classification 💻 cs.LG cs.AIcs.CV
keywords standard cell performance predictionlayout geometrynetlist topologycross-attention fusionmachine learning for EDA7nm cell characterizationdelay and power predictiongraph transformer
0
0 comments X

The pith

By fusing routed layout geometry and netlist topology through cross-attention, a model predicts standard-cell delay and power with 0.92 percent average error.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Standard cells determine the speed and energy use of every digital chip, yet accurate characterization still depends on slow circuit simulations that cannot keep up with modern design flows. Many fast machine-learning predictors ignore the physical layout, missing how nearby wires and devices create coupling effects that change performance. FusionCell builds a single model that takes both the three-layer routed layout image and the heterogeneous device-net graph as inputs. A vision transformer encodes the layout while a graph transformer encodes the topology, and a topology-guided attention step lets the netlist actively query relevant spatial regions so the two views reinforce each other. On more than 19,500 automatically generated cells spanning 149 types in a 7 nm process, the fused model reports low regression error and stronger ranking agreement than prior methods while running orders of magnitude faster than simulation.

Core claim

FusionCell treats routed layout geometry and netlist topology as dual modalities that are fused explicitly inside one model. A DeiT encoder processes the three-layer routed layout while a graph transformer models the heterogeneous device and net graph. These representations are integrated by a topology-guided mechanism in which the netlist acts as a structural map that actively queries the most relevant physical regions in the layout for joint geometric and topological reasoning. The resulting predictor targets six metrics—signal rise and fall delay, transition times, and power—on a 7 nm dataset of over 19,500 cells from 149 types generated automatically from the ASAP7 PDK.

What carries the argument

Topology-guided cross-attention fusion in which netlist nodes serve as queries that selectively attend to spatial regions inside the encoded layout representation.

If this is right

  • Library development teams can evaluate thousands of cell variants in minutes rather than days of simulation time.
  • Design-space exploration gains reliable ranking of cell options, improving choices for timing and power closure.
  • The same fused representation can serve as a fast surrogate inside larger place-and-route or optimization loops.
  • Characterization throughput increases by orders of magnitude, allowing more exhaustive library coverage within fixed compute budgets.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • If the cross-modal fusion proves robust, similar topology-guided attention could be applied to predict full-chip timing or power from floorplan and netlist data.
  • The approach opens a route to closed-loop layout optimization where performance feedback is obtained in seconds instead of hours.
  • Industrial adoption would require direct comparison against measured silicon data from multiple foundry nodes to confirm transfer beyond the training PDK.

Load-bearing premise

The automatically generated 19.5k cells from the ASAP7 PDK already contain enough variety of layout-dependent coupling and parasitics for the learned fusion to generalize to unseen cells and real design flows.

What would settle it

Evaluating the trained model on standard cells drawn from a different process design kit or from silicon measurements and observing average MAPE well above 1 percent on the same six metrics would show that the learned fusion does not generalize as claimed.

Figures

Figures reproduced from arXiv: 2605.20287 by Bojie Zhang, Haoyi Zhang, Kairong Guo, Runsheng Wang, Yibo Lin.

Figure 1
Figure 1. Figure 1: Comparison between traditional characterization, prior ML approaches, and FusionCell. Traditional flow (R/C extrac￾tion+simulation) is accurate but slow. Previous ML methods (e.g., vision-only or netlist-only) sacrifice accuracy for speed, often fail￾ing to capture layout effects or topological constraints. FusionCell achieves both high speed (milliseconds) and high correlation with golden tools via topolo… view at source ↗
Figure 2
Figure 2. Figure 2: Overview of FusionCell and topology-guided multimodal fusion. FusionCell predicts standard-cell performance (including delay and power) from routed layout geometry and netlist topology. (Middle Up) A DeiT encoder processes three-layer routed layouts (M0/M1/M2) into layout tokens ZL, including patch, class, and distillation tokens. (Middle Down) A graph transformer encodes the heterogeneous netlist—comprisi… view at source ↗
Figure 4
Figure 4. Figure 4: Heterogeneous netlist construction. Netlists are parsed into heterogeneous graphs with distinct device/net nodes. Edges are typed to distinguish physical connectivity (Device-Net) from structural correlation (Net-Net). bias is crucial for encoding the heterogeneity of the graph. We define: ϕ(u, v) = B mask u,v , where Bmask u,v enforces graph connectivity, allowing attention only between connected nodes (i… view at source ↗
Figure 6
Figure 6. Figure 6: Directed Cross-Attention Map. Graph-query → image￾key attention for the OA22x1 cell, where rows denote graph nodes and columns denote layout tokens. capture global signal flow context and potential crosstalk interference that local message passing would miss. Notes on topology-only baselines. A purely topology-only predictor is ill-posed for within-family layout variants that share identical netlists but d… view at source ↗
Figure 5
Figure 5. Figure 5: Ranking Correlation Distribution. Boxplots of Spear￾man’s ρ (top) and Kendall’s τ (bottom) for FusionCell vs. Proto￾CellLayout. FusionCell consistently maintains high correlation and low variance. A1: Topology-Guided vs. Symmetrical Fusion. This ab￾lation validates our core hypothesis: EDA fusion must be asymmetric. Tables 2 and 3 show that Symmetrical Fusion (MAPE 2.19%) offers negligible improvement over… view at source ↗
read the original abstract

Standard cells form the building blocks of digital circuits, so their delay and power critically influence chip-level performance; yet characterization still relies on slow simulation sweeps, and many fast predictors ignore layout geometry, missing coupling and layout-dependent effects. The challenge is to jointly represent layout geometry and netlist topology so models capture fine-grained spatial details together with structural connectivity for accurate performance prediction. We introduce FusionCell, a dual-modality predictor that treats routed layout geometry and netlist topology as inputs and fuses them explicitly in a unified model. A DeiT encoder processes three-layer routed layouts, while a graph transformer models heterogeneous device/net graphs. The modalities are integrated through a topology-guided mechanism, where the netlist acts as a structural "map" to actively query relevant physical regions in the layout for joint geometric and topological reasoning. We build a 7nm dataset based on the ASAP7 PDK with over 19.5k cells spanning 149 types using automatic tools, targeting six metrics: signal rise/fall delay, transition, and power. Experimental results demonstrate that FusionCell reduces regression error, with an average MAPE of 0.92 percent, and improves Spearman/Kendall ranking over baselines, while accelerating the characterization process by orders of magnitude compared to circuit simulation.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper introduces FusionCell, a dual-modality architecture that fuses routed layout geometry (via DeiT encoder on three-layer layouts) with netlist topology (via graph transformer on heterogeneous device/net graphs) through a topology-guided cross-attentive mechanism. The model is trained to regress six performance metrics (rise/fall delay, transition time, power) on a dataset of 19.5k automatically generated standard cells spanning 149 types from the ASAP7 7nm PDK. It claims an average MAPE of 0.92%, improved Spearman/Kendall ranking correlations over baselines, and orders-of-magnitude speedup relative to full circuit simulation.

Significance. If the central performance claims hold under proper validation, the work could meaningfully accelerate standard-cell library characterization in VLSI flows by replacing slow simulation sweeps with fast, layout-aware predictions. The explicit cross-attentive fusion addresses a recognized gap where pure netlist or pure geometry models miss coupling and layout-dependent parasitics. The scale of the generated dataset (19.5k cells) provides a useful training resource, and the reproducible use of open PDK tools is a positive attribute.

major comments (2)
  1. [§4 and Table 2] §4 (Experimental Setup) and Table 2: the reported average MAPE of 0.92% and ranking improvements are presented without specifying baseline implementations, train/test split strategy, statistical significance tests, or error bars; this information is load-bearing for the claim that the fusion mechanism itself drives the gains rather than dataset artifacts or implementation differences.
  2. [§3.2 and §5] §3.2 (Dataset) and §5 (Discussion): the generalizability claim rests on the untested assumption that automatically generated ASAP7 cells contain representative layout-dependent coupling, parasitics, and routing patterns; no ablation or hold-out experiments on foundry-specific DRCs, multi-patterning, or irregular placements are shown, which directly affects whether the cross-attentive fusion captures the intended physics beyond the synthetic distribution.
minor comments (2)
  1. [§3.3] The description of the topology-guided query mechanism in §3.3 would benefit from a diagram or pseudocode to clarify how netlist nodes actively attend to layout regions.
  2. [§2] Missing references to prior work on layout-aware timing models (e.g., recent graph-neural or vision-transformer approaches in EDA) in the related-work section.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive and detailed feedback on our manuscript. The comments highlight important aspects of experimental rigor and generalizability that we address point by point below. We indicate where revisions will be incorporated to strengthen the paper.

read point-by-point responses
  1. Referee: [§4 and Table 2] §4 (Experimental Setup) and Table 2: the reported average MAPE of 0.92% and ranking improvements are presented without specifying baseline implementations, train/test split strategy, statistical significance tests, or error bars; this information is load-bearing for the claim that the fusion mechanism itself drives the gains rather than dataset artifacts or implementation differences.

    Authors: We agree that these details are essential to substantiate our performance claims. In the revised manuscript, we will expand §4 to provide: complete descriptions of all baseline implementations (including architectures, hyperparameters, and training protocols); the exact train/test split strategy (including ratio, stratification by cell type, and any cross-validation procedure); results from statistical significance tests (e.g., paired t-tests or Wilcoxon signed-rank tests with p-values); and error bars (standard deviation across runs) in Table 2 and associated figures. These additions will be supported by updated supplementary material to demonstrate that gains arise from the topology-guided cross-attention rather than implementation or data artifacts. revision: yes

  2. Referee: [§3.2 and §5] §3.2 (Dataset) and §5 (Discussion): the generalizability claim rests on the untested assumption that automatically generated ASAP7 cells contain representative layout-dependent coupling, parasitics, and routing patterns; no ablation or hold-out experiments on foundry-specific DRCs, multi-patterning, or irregular placements are shown, which directly affects whether the cross-attentive fusion captures the intended physics beyond the synthetic distribution.

    Authors: We acknowledge this as a valid limitation of the current evaluation. Our dataset leverages the open ASAP7 PDK and automated generation for reproducibility, yet it cannot fully replicate proprietary foundry variations. In the revised §5 we will add an explicit limitations paragraph discussing this assumption and outlining future validation paths. We will also incorporate a new ablation in the experiments that holds out cell subsets with simulated routing irregularities and varied placement densities to probe the fusion mechanism's sensitivity to layout patterns. Experiments involving actual foundry DRCs or multi-patterning rules remain outside the present scope due to access constraints, but the proposed additions will better bound the claims. revision: partial

Circularity Check

0 steps flagged

No significant circularity; supervised regressor trained on external simulation labels

full rationale

The paper introduces a neural architecture (DeiT for layouts + graph transformer for netlists + topology-guided cross-attention) and trains it end-to-end as a supervised regressor on 19.5k simulation-generated labels for delay/power metrics. No equations or claims reduce a prediction to a fitted parameter by construction, no self-citation chain justifies a uniqueness result, and no ansatz is smuggled in. The reported MAPE and ranking gains are empirical outcomes on held-out cells from the same generated distribution; the derivation chain remains self-contained against the external simulation oracle.

Axiom & Free-Parameter Ledger

1 free parameters · 0 axioms · 0 invented entities

The performance claim rests on the representativeness of the automatically generated 7nm dataset and on the assumption that the cross-attention fusion captures the dominant layout-dependent effects; no explicit free parameters or invented physical entities are described in the abstract.

free parameters (1)
  • DeiT and graph transformer training hyperparameters
    Model weights and optimization settings are fitted to the simulation labels on the 19.5k-cell dataset.

pith-pipeline@v0.9.0 · 5775 in / 1205 out tokens · 37650 ms · 2026-05-21T08:18:34.321194+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

17 extracted references · 17 canonical work pages · 3 internal anchors

  1. [1]

    doi: 10.1145/3658617.3703638

    ISBN 979-8-4007-0635-6. doi: 10.1145/3658617.3703638. URL https://dl.acm. org/doi/10.1145/3658617.3703638. Cheng, X., Ye, Y ., He, G., Song, Q., and Cao, P. Hetero- geneous graph attention network based statistical timing library characterization with parasitic RC reduction. In 2024 29th Asia and South Pacific Design Automation Conference (ASP-DAC), pp. 1...

  2. [2]

    doi: 10.1109/ASP-DAC58780.2024

    ISBN 979-8-3503-9354-5. doi: 10.1109/ASP-DAC58780.2024. 10473881. URL https://ieeexplore.ieee. org/document/10473881/. Clark, L. T., Vashishtha, V ., Shifren, L., Gujja, A., Sinha, S., Cline, B., Ramamurthy, C., and Yeric, G. Asap7: A 7-nm finfet predictive process design kit. Microelectronics Journal, 53:105–115,

  3. [3]

    doi: https://doi.org/10.1016/j.mejo.2016.04

    ISSN 1879-2391. doi: https://doi.org/10.1016/j.mejo.2016.04

  4. [4]

    Dong, Z., Cao, W., Zhang, M., Tao, D., Chen, Y ., and Zhang, X

    URL https://www.sciencedirect.com/ science/article/pii/S002626921630026X. Dong, Z., Cao, W., Zhang, M., Tao, D., Chen, Y ., and Zhang, X. CktGNN: Circuit graph neural network for electronic design automation. URL http://arxiv.org/abs/ 2308.16406. Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer...

  5. [5]

    URL https: //arxiv.org/abs/2010.11929. Guo, K. and Lin, Y . Multi-row standard cell layout synthesis with enhanced scalability. In2025 International Sympo- sium of Electronics Design Automation (ISEDA), pp. 317– 323,

  6. [6]

    He, K., Zhang, X., Ren, S., and Sun, J

    doi: 10.1109/ISEDA65950.2025.11101126. He, K., Zhang, X., Ren, S., and Sun, J. Deep residual learning for image recognition,

  7. [7]

    Deep Residual Learning for Image Recognition

    URL https:// arxiv.org/abs/1512.03385. Kang, S.-M. and Leblebici, Y .CMOS Digital Integrated Circuits: Analysis and Design. McGraw-Hill, New York, NY , USA, 3rd edition,

  8. [8]

    doi: 10.1109/TCSI.2021.3069664

    ISSN 1549-8328, 1558-0806. doi: 10.1109/TCSI.2021.3069664. URL https:// ieeexplore.ieee.org/document/9394564/. Klemme, F., Chauhan, Y ., Henkel, J., and Amrouch, H. Cell library characterization using machine learning for design technology co-optimization. InProceedings of the 39th International Conference on Computer-Aided Design, pp. 1–9. ACM,

  9. [9]

    doi: 10.1145/3400302.3415713

    ISBN 978-1-4503-8026-3. doi: 10.1145/3400302.3415713. URL https://dl. acm.org/doi/10.1145/3400302.3415713. Liu, L., Li, Y ., Lu, B., Shang, L., and Yang, F. GTN-cell: Efficient standard cell characterization using graph trans- former network. In2025 Design, Automation & Test in Europe Conference (DATE), pp. 1–7. IEEE,

  10. [10]

    doi: 10.23919/DATE64628

    ISBN 978-3-9826741-0-0. doi: 10.23919/DATE64628. 2025.10992976. URL https://ieeexplore. ieee.org/document/10992976/. Luo, Z., Zhou, L., Zhang, Z., Zhou, J., Li, Z., You, H., Yao, F., and Zhao, Z. Protocelllayout: Prototype-guided graph learning for accurate and generalizable standard cell layout ppa estimation. InProceedings of the 44th International Conf...

  11. [11]

    11240788

    doi: 10.1109/ICCAD66269.2025. 11240788. Ma, T., Fan, G., Deng, Z., Sun, X., Low, K., and Shao, L. Fast cell library characterization for design technology co-optimization based on graph neural networks,

  12. [12]

    Mallik, A., Zuber, P., Liu, T.-T., Chava, B., Ballal, B., Del Bario, P

    URLhttp://arxiv.org/abs/2312.12784. Mallik, A., Zuber, P., Liu, T.-T., Chava, B., Ballal, B., Del Bario, P. R., Baert, R., Croes, K., Ryckaert, J., Badaroglu, M., Mercha, A., and Verkest, D. TEASE: a systematic analysis framework for early evaluation of FinFET-based advanced technology nodes. InProceed- ings of the 50th Annual Design Automation Conference...

  13. [13]

    Multiple chip planning for chip- interposer codesign,

    ISBN 978-1-4503-2071-9. doi: 10.1145/2463209.2488764. URL https://dl.acm. org/doi/10.1145/2463209.2488764. Pei, Z., Zhen, H.-L., Yuan, M., Huang, Y ., and Yu, B. Bet- terV: Controlled verilog generation with discriminative guidance. 10 FusionCell: Cross-Attentive Fusion of Layout Geometry and Netlist Topology for Standard-Cell Performance Prediction Shehz...

  14. [14]

    Shi, Z., Ma, C., Zheng, Z., Zhou, L., Pan, H., Jiang, W., Yang, F., Yang, X., Chu, Z., and Xu, Q

    URLhttp://arxiv.org/abs/2407.09777. Shi, Z., Ma, C., Zheng, Z., Zhou, L., Pan, H., Jiang, W., Yang, F., Yang, X., Chu, Z., and Xu, Q. Deep- Cell: Self-supervised multiview fusion for circuit repre- sentation learning. URL http://arxiv.org/abs/ 2502.06816. Touvron, H., Cord, M., Douze, M., Massa, F., Sablayrolles, A., and J ´egou, H. Training data-efficien...

  15. [15]

    Training data-efficient image transformers & distillation through attention

    URL https://arxiv.org/abs/2012.12877. Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, L., and Polosukhin, I. Attention is all you need,

  16. [16]

    Attention Is All You Need

    URL https://arxiv.org/ abs/1706.03762. Wang, M., Wen, Y ., Sun, B., Mu, J., Li, J., Wang, X., Ye, J., Yu, B., and Li, H. Bridging layout and RTL: Knowledge distillation based timing prediction. Weste, N. H. E. and Harris, D. M.CMOS VLSI De- sign: A Circuits and Systems Perspective. Addison- Wesley, Boston, MA, USA, 4th edition,

  17. [17]

    DeepLayout: Learning neural representations of circuit placement layout

    Zhao, Y ., Chai, Z., Jiang, X., Xu, Q., Wang, R., and Lin, Y . DeepLayout: Learning neural representations of circuit placement layout. Zhu, K., Chen, H., Turner, W. J., Kokai, G. F., Wei, P.-H., Pan, D. Z., and Ren, H. TAG: Learning circuit spatial embedding from layouts. URL http://arxiv.org/ abs/2209.03465. 11