Building Reliable Arithmetic Multipliers Under NBTI Aging and Process Variations
Pith reviewed 2026-05-19 23:30 UTC · model grok-4.3
The pith
Selective 2s complement transformations on multiplier inputs redistribute NBTI stress to extend circuit lifetime.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By exploiting the sign-invariance property of multiplication, selectively applying 2s complement transformations to the inputs redistributes NBTI stress across the transistors inside the multiplier. The result of the multiplication stays identical, so no functional errors are introduced. When the same technique is placed inside systolic arrays used for AI workloads, the hardware exhibits longer lifetime than the natural-aging baseline while adding negligible area and delay overhead.
What carries the argument
The sign-invariance property of multiplication, which lets selected inputs be converted to 2s complement form to move NBTI stress among transistors without changing the product.
If this is right
- Multipliers inside AI accelerators can operate longer before aging forces replacement or throttling.
- The same input-selection logic can be added to multipliers used in CPUs, GPUs, and FPGAs.
- Systolic-array designs gain reliability at almost zero extra silicon area or critical-path delay.
- Designers can apply the method during standard synthesis flows without new cell libraries.
Where Pith is reading between the lines
- The same invariance idea might be tested on other arithmetic blocks such as adders or MAC units that share sign-flip properties.
- Pairing the stress-redistribution logic with existing process-variation compensation could address both aging and manufacturing spread at once.
- Running the technique on real FPGA prototypes under controlled temperature and voltage stress would provide an independent check beyond the reported Cadence simulations.
Load-bearing premise
Converting inputs to 2s complement form moves stress to different transistors yet leaves the multiplication result exactly the same.
What would settle it
Fabricated multiplier test chips that apply the selective transformations and show no measurable reduction in threshold-voltage shift or delay degradation under accelerated NBTI stress would falsify the claim.
Figures
read the original abstract
Hardware aging poses a significant challenge for integrated circuits (ICs), leading to performance degradation and eventual failure. In this work, we focus on the aging of arithmetic multipliers, which are a cornerstone of modern computing systems including in CPUs, GPUs, and FPGAs, as well as AI accelerators like systolic arrays. In particular, AI workloads, which rely predominantly on multiplications, can accelerate Negative Bias Temperature Instability (NBTI) effects in multipliers. This paper presents a novel aging mitigation technique that leverages the signinvariance property of multiplication. By selectively applying 2s complement transformations to inputs, the method redistributes stress across transistors, reducing the effects of NBTI aging. The proposed method is also integrated into systolic arrays, a common AI accelerator, to demonstrate its efficiency in a high-throughput AI accelerator. Experimental evaluations using Cadence tools show better lifetime compared to natural aging (with no mitigation) baseline, while introducing negligible area and delay overheads.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper claims that selectively applying 2's complement transformations to multiplier inputs, exploiting the sign-invariance property of multiplication, redistributes NBTI stress across transistors without changing the result. Cadence-based evaluations on multipliers and their integration into systolic arrays are reported to yield improved lifetime versus a no-mitigation baseline while incurring negligible area and delay overhead.
Significance. If the stress-reduction claim holds, the approach would supply a low-cost, functionally transparent aging mitigation technique for multipliers that dominate AI accelerator workloads. Credit is due for the systolic-array integration and for grounding the evaluation in Cadence tool flows rather than purely abstract models.
major comments (2)
- [§3] §3 (Proposed Method): The sign-invariance property guarantees that (-x)×(-y) = x×y but supplies no guarantee that the resulting bit patterns lower average NBTI stress on partial-product generators or the adder tree. The manuscript must supply either transistor-level duty-cycle analysis or workload-specific stress simulations showing that the chosen transformations are less stressful on average; without this, the central lifetime claim rests on an unverified assumption.
- [§5] §5 (Experimental Results): The abstract and evaluation sections assert better lifetime than the natural-aging baseline, yet the text provides no quantitative figures (e.g., percentage lifetime extension, ΔVth values, error bars, or workload statistics). This absence prevents verification of the claimed improvement and of the “negligible overhead” assertion.
minor comments (2)
- [Abstract] Abstract: the compound term 'signinvariance' should be written 'sign-invariance' for readability.
- The manuscript would benefit from a brief comparison table against prior NBTI mitigation techniques (e.g., input reordering or guard-banding) to clarify the novelty of the sign-invariance approach.
Simulated Author's Rebuttal
We thank the referee for the constructive comments and for recognizing the potential significance of a low-overhead, functionally transparent NBTI mitigation technique for multipliers in AI accelerators. We address each major comment below with clarifications and indicate where revisions will be made to strengthen the manuscript.
read point-by-point responses
-
Referee: [§3] §3 (Proposed Method): The sign-invariance property guarantees that (-x)×(-y) = x×y but supplies no guarantee that the resulting bit patterns lower average NBTI stress on partial-product generators or the adder tree. The manuscript must supply either transistor-level duty-cycle analysis or workload-specific stress simulations showing that the chosen transformations are less stressful on average; without this, the central lifetime claim rests on an unverified assumption.
Authors: We appreciate this observation. The sign-invariance property is used only to ensure functional equivalence while permitting a choice of input representation. Our selection heuristic is designed to balance the occurrence of logic-0 and logic-1 values at internal nodes of the partial-product generators and adder tree, thereby reducing average NBTI stress. The Cadence-based evaluations reported in §5 already embed the NBTI aging model and demonstrate lifetime gains; however, to make the stress-reduction mechanism explicit, we will add a new subsection to §3 containing transistor-level duty-cycle histograms and average stress-factor comparisons for representative input patterns before and after transformation. revision: yes
-
Referee: [§5] §5 (Experimental Results): The abstract and evaluation sections assert better lifetime than the natural-aging baseline, yet the text provides no quantitative figures (e.g., percentage lifetime extension, ΔVth values, error bars, or workload statistics). This absence prevents verification of the claimed improvement and of the “negligible overhead” assertion.
Authors: We agree that the current text presents lifetime and overhead results primarily through figures without extracting numerical values. The Cadence simulations do produce concrete metrics (lifetime extension, threshold-voltage shift, area/delay overheads, and workload statistics for the systolic-array case). We will revise §5 to report these quantities explicitly in the text, including percentage lifetime improvement relative to the baseline, mean ΔVth values, simulation error bars, and the input-distribution statistics used for the AI workloads. revision: yes
Circularity Check
No significant circularity: sign-invariance is an external mathematical fact applied to a simulation-validated mitigation
full rationale
The paper's core technique applies the standard algebraic identity (-x)×(-y)=x×y to permit selective 2's-complement input transformations that aim to redistribute NBTI stress. This identity is independent of the aging model and is not derived from any fitted parameter or prior result by the same authors. Lifetime improvement is asserted via Cadence-based experimental comparison against a natural-aging baseline, not by algebraic reduction or self-citation. No equations, ansatzes, or uniqueness theorems are invoked that collapse the claimed benefit back into the method's own inputs. The derivation chain therefore remains self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Multiplication results remain unchanged under selective 2s complement transformations of inputs (sign-invariance property).
Reference graph
Works this paper leans on
-
[1]
Device aging: A reliability and security concern,
D. Kraak et al. , “Device aging: A reliability and security concern,” in Proceedings of the European Test Workshop , 2018
work page 2018
-
[2]
Modeling the Interdependences between Voltage Fluctuation and BTI Aging,
S. Salamin et al. , “Modeling the Interdependences between Voltage Fluctuation and BTI Aging,” IEEE Trans VLSI Syst, vol. 27, no. 7, 2019
work page 2019
-
[3]
System -level modeling of microprocessor reliability degradation due to BTI and HCI,
C. C. Chen, S. Cha, T. Liu, and L. Milor, “System -level modeling of microprocessor reliability degradation due to BTI and HCI,” in IEEE International Reliability Physics Symposium Proceedings , 2014
work page 2014
-
[4]
Q. Zhang et al. , “Three -Dimensional Mechanistic Modeling of Time - Dependent Dielectric Breakdown in Polycrystalline Thin Films,” Phys Rev Appl, vol. 19, no. 2, 2023
work page 2023
-
[5]
Understanding and Mitigating Hardware Failures in Deep Learning Training Accelerator Systems,
Y. He et al., “Understanding and Mitigating Hardware Failures in Deep Learning Training Accelerator Systems,” in Proceedings - International Symposium on Computer Architecture , 2023
work page 2023
-
[6]
P. Rech, “Artificial Neural Networks for Space and Safety -Critical Applications: Reliability Issues and Potential Solutions,” IEEE Trans Nucl Sci, vol. 71, no. 4, 2024
work page 2024
-
[7]
NBTI: Experimental investigation, physical modelling, circuit aging simulations and verification,
C. Schlünder, K. Puschkarsky, G. A. Rott, W. Gustin, and H. Reisinger, “NBTI: Experimental investigation, physical modelling, circuit aging simulations and verification,” Microelectronics Reliability, vol. 82, 2018
work page 2018
-
[8]
MAGIC: Malicious aging in Circuits/Cores,
N. Karimi et al. , “MAGIC: Malicious aging in Circuits/Cores,” ACM Transactions on Architecture and Code Optimization, vol. 12, no. 1, 2015
work page 2015
-
[9]
Y. Xue et al. , “On the Understanding of pMOS NBTI Degradation in Advance Nodes: Characterization, Modeling, and Exploration on the Physical Origin of Defects,” IEEE TED, vol. 70, no. 9, 2023
work page 2023
-
[10]
An energy - efficient approximate systolic array based on timing error prediction and prevention,
N. C. Huang, W. K. Tseng, H. J. Chou, and K. C. Wu, “An energy - efficient approximate systolic array based on timing error prediction and prevention,” in Proceedings of the IEEE VLSI Test Symposium , 2021
work page 2021
-
[11]
Optimizing Modular Multiplication for NVIDIA’s Maxwell GPUs,
N. Emmart, J. Luitjens, C. Weems, and C. Woolley, “Optimizing Modular Multiplication for NVIDIA’s Maxwell GPUs,” in Proceedings - Symposium on Computer Arithmetic , 2016
work page 2016
-
[12]
Design of Low-Power Wallace Tree Multiplier Architecture Using Modular Approach,
V. Solanki, A. D. Darji, and H. Singapuri, “Design of Low-Power Wallace Tree Multiplier Architecture Using Modular Approach,” Circuits Syst Signal Process, vol. 40, no. 9, 2021
work page 2021
-
[13]
N. I. Deligiannis et al., “A Reliability Evaluation Flow for Assessing the Impact of Permanent Hardware Faults on Integer Arithmetic Circuits,” IEEE Access, vol. 13, pp. 32177–32196, 2025
work page 2025
-
[14]
Modeling and Predicting Transistor Aging under Workload Dependency Using Machine Learning,
P. R. Genssler, H. E. Barkam, K. Pandaram et al. , “Modeling and Predicting Transistor Aging under Workload Dependency Using Machine Learning,” IEEE TCAS-I: Regular Papers, vol. 70, no. 9, 2023
work page 2023
-
[15]
Communication and aging aware application mapping for multicore based edge computing servers,
J. Ali, T. Maqsood, N. Khalid, and S. A. Madani, “Communication and aging aware application mapping for multicore based edge computing servers,” Cluster Comput, vol. 26, no. 1, 2023
work page 2023
-
[16]
Aging Mitigation in Systolic Array Accelerators: Balancing PE Loads for Enhanced Reliability,
Y. G. Chen, Y. C. Ho, and J. Y. Jou, “Aging Mitigation in Systolic Array Accelerators: Balancing PE Loads for Enhanced Reliability,” in International System on Chip Conference, IEEE Computer Society, 2024
work page 2024
-
[17]
Asymmetric aging effect on modern microprocessors,
F. Gabbay and A. Mendelson, “Asymmetric aging effect on modern microprocessors,” Microelectronics Reliability, vol. 119, 2021
work page 2021
-
[18]
Aging Attack on Systolic Array-Based AI Accelerators via NBTI -Induced Aging,
M. Heidary and B. K. Joardar, “Aging Attack on Systolic Array-Based AI Accelerators via NBTI -Induced Aging,” in Proceedings - ISQED, IEEE Computer Society, 2025
work page 2025
-
[19]
Ring-DVFS: Reliability -Aware reinforcement learning -based dvfs for real -Time embedded systems,
A. Yeganeh -Khaksar, M. Ansari et al., “Ring-DVFS: Reliability -Aware reinforcement learning -based dvfs for real -Time embedded systems,” IEEE Embed Syst Lett, vol. 13, no. 3, 2021
work page 2021
-
[20]
X. Garros et al., “Detrimental impact of technological processes on b ti reliability of advanced high -K/metal gate stacks,” in IEEE International Reliability Physics Symposium Proceedings , 2009
work page 2009
-
[21]
Y. Zhan, S. V. Kumar et al., “Thermally aware design,” Foundations and Trends in Electronic Design Automation , vol. 2, no. 3, 2007
work page 2007
-
[22]
Dependable DNN Accelerato r for Safety-Critical Systems: A Review on the Aging Perspective,
I. Moghaddasi, S. Gorgin, and J. A. Lee, “Dependable DNN Accelerato r for Safety-Critical Systems: A Review on the Aging Perspective,” 2023
work page 2023
-
[23]
Predictive modeling of the NBTI effect for reliabl e design,
S. Bhardwaj et al., “Predictive modeling of the NBTI effect for reliabl e design,” in Proceedings of CICC, 2006
work page 2006
-
[24]
Modeling manufacturing process variation for design and test,
S. Kundu and A. Sreedhar, “Modeling manufacturing process variation for design and test,” in Proceedings -DATE, 2011
work page 2011
-
[25]
Threshold voltage distribution in MLC NAND flash memory: Characterization, analysis, and modeling,
Y. Cai, E. F. Haratsch, O. Mutlu, and K. Mai, “Threshold voltage distribution in MLC NAND flash memory: Characterization, analysis, and modeling,” in Proceedings -DATE, 2013
work page 2013
-
[26]
R. Zhang, T. Liu, K. Yang, and L. Milor, “Analysis of time -dependent dielectric breakdown induced aging of SRAM cache with different configurations,” Microelectronics Reliability, vol. 76–77, 2017
work page 2017
-
[27]
A. Jafari, M. Raji, and B. Ghavami, “Impacts of Process Variations and Aging on Lifetime Reliability of Flip -Flops: A Comparative Analysis,” IEEE TDMR, vol. 19, no. 3, 2019
work page 2019
-
[28]
M. Turgeon, A. M. Wing, and L. W. Taylor, “Timing and Aging: Slowing of Fastest Regular Tapping Rate With Preserved Timing Error Detecti on and Correction,” Psychol Aging, vol. 26, no. 1, 2011
work page 2011
-
[29]
Lifetime reliability-aware task allocation and scheduling for MPSoC platforms,
H. Lin, Y. Feng, and X. Qiang, “Lifetime reliability-aware task allocation and scheduling for MPSoC platforms,” in Proceedings -Design, Automation and Test in Europe, DATE , 2009
work page 2009
-
[30]
ArISE: Aging -aware instruction set encoding for lifetime improvement,
F. Oboril and M. Tahoori, “ArISE: Aging -aware instruction set encoding for lifetime improvement,” in Proceedings of the Asia and South Pacific Design Automation Conference, ASP -DAC, 2014
work page 2014
-
[31]
S. Rehman, F. Kriebel, D. Sun, M. Shafique, and J. Henkel, “DTune: Leveraging reliable code generation for adaptive dependability tuning under process variation and aging -induced effects,” in Proceedings - Design Automation Conference , 2014
work page 2014
-
[32]
Texas Instruments on Automotive Reliability
Paul McLellan, “Texas Instruments on Automotive Reliability.” Accessed: Dec. 16, 2025. [Online]
work page 2025
-
[33]
Integrated Circuit Intrinsic Reliability,
D. Eaton, “Integrated Circuit Intrinsic Reliability,” 2005, IEEE SSC
work page 2005
-
[34]
P. S. Kathribail and T. Vijayakumar, “Comprehensive Study of MO SFET Degradation in Power Converters and Prognostic Failure Detection Using Physical Model,” J. Inst. Eng. India Ser. B 104, 305 –317 (2023)
work page 2023
-
[35]
Approximate Logic Synthesis: A Survey,
I. Scarabottolo, G. Ansaloni, G. A. Constantinides, L. Pozzi, and S. Reda, “Approximate Logic Synthesis: A Survey,” in Proc. of the IEEE, 2020
work page 2020
-
[36]
C. Xu, Y. Zheng, D. Niu, X. Zhu, S. H. Kang, and Y. Xie, “Impact of Write Pulse and Process Variation on 22 nm FinFET -Based STT- RAM Design: A Device -Architecture Co -Optimization Approach,” IEEE Transactions on Multi-Scale Computing Systems, vol. 1, no. 4, 2015
work page 2015
-
[37]
A survey on ensemble learning,
X. Dong, Z. Yu, W. Cao, Y. Shi, and Q. Ma, “A survey on ensemble learning,” 2020
work page 2020
-
[38]
Aging-aware instruction cache design by duty cycle balancing,
T. Jin and S. Wang, “Aging-aware instruction cache design by duty cycle balancing,” in Proceedings - 2012 IEEE Computer Society Annual Symposium on VLSI, ISVLSI 2012 , 2012
work page 2012
-
[39]
Analysis and mitigation of NBTI aging in register file: An end -to-end approach,
S. Kothawade, K. Chakraborty, and S. Roy, “Analysis and mitigation of NBTI aging in register file: An end -to-end approach,” in Proceedings of the 12th ISQED, 2011
work page 2011
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.