Graph Rewiring in GNNs to Mitigate Over-Squashing and Over-Smoothing: A Survey
Pith reviewed 2026-05-23 16:36 UTC · model grok-4.3
The pith
Graph rewiring changes input topology to let GNNs propagate information farther without excessive compression or node indistinguishability.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Over-squashing and over-smoothing both originate in the interaction between message passing and the given graph topology; rewiring methods modify that topology to improve long-range information flow while preserving task-relevant structure.
What carries the argument
Graph rewiring techniques: a family of methods that add, remove, or reweight edges to change the input graph's connectivity before or during GNN message passing.
If this is right
- Rewiring can reduce the exponential compression of signals from distant nodes.
- It can slow the convergence of node representations across layers.
- Different rewiring strategies carry distinct theoretical guarantees and computational overheads.
- The choice of rewiring must balance added connectivity against preservation of original task signals.
Where Pith is reading between the lines
- Rewiring might generalize as a preprocessing step for any message-passing architecture on graphs.
- Combining rewiring with learned edge predictors could reduce the need for manual hyper-parameter search.
- Domain-specific graphs such as molecular or citation networks may require tailored rewiring rules that respect known constraints.
Load-bearing premise
The two performance bottlenecks are caused mainly by the fixed input topology rather than by other aspects of the model or training procedure.
What would settle it
A controlled experiment in which a rewiring method is applied yet measured long-range signal strength and node distinguishability remain unchanged or worsen.
Figures
read the original abstract
Graph Neural Networks are powerful models for learning from graph-structured data, yet their effectiveness is often limited by two critical challenges: over-squashing, where information from distant nodes is excessively compressed, and over-smoothing, where repeated propagation makes node representations indistinguishable. Both phenomena stem from the interaction between message passing and the input topology, ultimately degrading information flow and limiting the performance of GNNs. In this survey, we examine graph rewiring techniques, a class of methods designed to modify the graph topology to enhance information propagation in GNNs. We provide a comprehensive review of state-of-the-art rewiring approaches, delving into their theoretical underpinnings, practical implementations, and performance trade-offs.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper is a survey reviewing graph rewiring techniques in GNNs designed to mitigate over-squashing (excessive compression of distant node information) and over-smoothing (indistinguishable node representations after repeated propagation). It covers the theoretical underpinnings of these phenomena as arising from message-passing interactions with input topology, then examines state-of-the-art rewiring methods, their implementations, and performance trade-offs.
Significance. A well-structured survey in this area would organize the rapidly growing literature on topology modification for better information flow, potentially aiding researchers in selecting methods and identifying gaps. The central premise aligns with established GNN literature on topology-message passing interactions; no novel derivations or empirical claims are made by the survey itself.
minor comments (2)
- [Abstract] Abstract: The scope is stated clearly, but the manuscript should explicitly state the search methodology, inclusion criteria, number of papers reviewed, and time frame covered to allow assessment of completeness and potential selection bias.
- The survey would benefit from a consistent taxonomy or comparison table (e.g., by rewiring type, computational cost, or empirical metrics used across cited works) to make trade-offs more transparent.
Simulated Author's Rebuttal
We thank the referee for the positive summary of our survey and the recommendation of minor revision. No specific major comments were provided in the report.
Circularity Check
No circularity: survey reports external literature without internal derivations
full rationale
This is a survey paper whose purpose is to review and organize existing published work on graph rewiring for GNNs. It contains no original equations, fitted parameters, predictions, or derivation chain that could reduce to its own inputs. The central premise (interaction of message passing with topology causing over-squashing and over-smoothing) is presented as established background from the broader GNN literature, not as a novel result derived or justified within the paper itself. No self-citation load-bearing steps, ansatzes, or renamings of results occur because the paper does not advance new technical claims that require such justification.
Axiom & Free-Parameter Ledger
Forward citations
Cited by 2 Pith papers
-
From Model to Data (M2D): Shifting Complexity from GNNs to Graphs for Transparent Graph Learning
M2D distillation augments input graphs with model-derived features and structure, letting simple student GNNs match teacher performance while exposing mechanisms such as attention and fairness directly in the data.
-
Ollivier-Ricci Curvature of Riemannian Manifolds and Directed Graphs with Applications to Graph Neural Networks
Ollivier-Ricci curvature is extended from manifolds and undirected graphs to directed graphs with applications to graph neural networks.
Reference graph
Works this paper leans on
-
[1]
Mixhop: Higher-order graph convolutional architectures via sparsified neighborhood mix- ing
Sami Abu-El-Haija, Bryan Perozzi, Amol Kapoor, Nazanin Alipourfard, Kristina Lerman, Hrayr Harutyunyan, Greg Ver Steeg, and Aram Galstyan. Mixhop: Higher-order graph convolutional architectures via sparsified neighborhood mix- ing. In ICML, pages 21–29, 2019. 16 H. Attali et al
work page 2019
-
[2]
Eigenvalues, expanders and superconcentrators
Noga Alon and Vitali D Milman. Eigenvalues, expanders and superconcentrators. In 25th Annual Symposium onFoundations of Computer Science., pages 320–322. IEEE, 1984
work page 1984
-
[3]
On the bottleneck of graph neural networks and its practical implications
Uri Alon and Eran Yahav. On the bottleneck of graph neural networks and its practical implications. In International Conference on Learning Representations, 2021
work page 2021
-
[4]
Diffwire: Inductive graph rewiring via the lov \’asz bound
Adrián Arnaiz-Rodríguez, Ahmed Begga, Francisco Escolano, and Nuria Oliver. Diffwire: Inductive graph rewiring via the lov \’asz bound. arXiv preprint arXiv:2206.07369, 2022
-
[5]
Curvature constrained mpnns: Improving message passing with local structural properties
Hugo Attali, Davide Buscaldi, and Nathalie Pernelle. Curvature constrained mpnns: Improving message passing with local structural properties. 2024
work page 2024
-
[6]
Delaunay graph: Addressing over-squashing and over-smoothing using delaunay triangulation
Hugo Attali, Davide Buscaldi, and Nathalie Pernelle. Delaunay graph: Addressing over-squashing and over-smoothing using delaunay triangulation. In Forty-first International Conference on Machine Learning, 2024
work page 2024
-
[7]
Transductive legal judgment prediction combin- ing BERT embeddings with delaunay-based GNNs
Hugo Attali and Nadi Tomeh. Transductive legal judgment prediction combin- ing BERT embeddings with delaunay-based GNNs. In Nikolaos Aletras, Ilias Chalkidis, Leslie Barrett, Cătălina Goant,ă, Daniel Preot,iuc-Pietro, and Gerasimos Spanakis, editors,Proceedings of the Natural Legal Language Processing Workshop 2024, pages 187–193, Miami, FL, USA, November...
work page 2024
-
[8]
Oversquashing in gnns through the lens of information contraction and graph expansion
Pradeep Kr Banerjee, Kedar Karhadkar, Yu Guang Wang, Uri Alon, and Guido Montúfar. Oversquashing in gnns through the lens of information contraction and graph expansion. In 2022 58th Annual Allerton Conference on Communication, Control, and Computing (Allerton), pages 1–8. IEEE, 2022
work page 2022
-
[9]
Bronstein, and Francesco Di Giovanni
Federico Barbero, Ameya Velingker, Amin Saberi, Michael M. Bronstein, and Francesco Di Giovanni. Locality-aware graph rewiring in GNNs. InThe Twelfth International Conference on Learning Representations, 2024
work page 2024
-
[10]
Graph neural networks use graphs when they shouldn’t
Maya Bechler-Speicher, Ido Amos, Ran Gilad-Bachrach, and Amir Glober- son. Graph neural networks use graphs when they shouldn’t. arXiv preprint arXiv:2309.04332, 2023
-
[11]
Understanding oversquashing in gnns through the lens of effective resistance
Mitchell Black, Zhengchao Wan, Amir Nayyeri, and Yusu Wang. Understanding oversquashing in gnns through the lens of effective resistance. InInternational Conference on Machine Learning, pages 2528–2547. PMLR, 2023
work page 2023
-
[12]
Beyond low-frequency infor- mation in graph convolutional networks
Deyu Bo, Xiao Wang, Chuan Shi, and Huawei Shen. Beyond low-frequency infor- mation in graph convolutional networks. InProceedings of the AAAI conference on artificial intelligence, volume 35, pages 3950–3957, 2021
work page 2021
-
[13]
Scaling graphneuralnetworkswithapproximatepagerank
Aleksandar Bojchevski, Johannes Gasteiger, Bryan Perozzi, Amol Kapoor, Martin Blais, Benedek Rózemberczki, Michal Lukasik, and Stephan Günnemann. Scaling graphneuralnetworkswithapproximatepagerank. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pages 2464–2473, 2020
work page 2020
-
[14]
How Attentive are Graph Attention Networks?
Shaked Brody, Uri Alon, and Eran Yahav. How attentive are graph attention networks? arXiv preprint arXiv:2105.14491, 2021
work page internal anchor Pith review Pith/arXiv arXiv 2021
-
[15]
Spectral net- works and locally connected networks on graphs
Joan Bruna, Wojciech Zaremba, Arthur Szlam, and Yann LeCun. Spectral net- works and locally connected networks on graphs. InICLR, 2014
work page 2014
-
[16]
A note on over-smoothing for graph neural networks
Chen Cai and Yusu Wang. A note on over-smoothing for graph neural networks. Graph Representation Learning, 2020
work page 2020
-
[17]
A lower bound for the smallest eigenvalue of the laplacian
Jeff Cheeger. A lower bound for the smallest eigenvalue of the laplacian. In Problems in Analysis: A Symposium in Honor of Salomon Bochner (PMS-31). Princeton University Press, 1970. Graph Rewiring: A Survey 17
work page 1970
-
[18]
Deli Chen, Yankai Lin, Wei Li, Peng Li, Jie Zhou, and Xu Sun. Measuring and re- lieving the over-smoothing problem for graph neural networks from the topological view. In Proceedings of the AAAI Conference, pages 3438–3445, 2020
work page 2020
-
[19]
American Mathe- matical Soc., 1997
Fan RK Chung and Fan Chung Graham.Spectral graph theory. American Mathe- matical Soc., 1997
work page 1997
-
[20]
Andreea Deac, Marc Lackenby, and Petar Veličković. Expander graph propagation. In Learning on Graphs Conference, pages 38–1. PMLR, 2022
work page 2022
-
[21]
On over-squashing in message passing neural networks: The impact of width, depth, and topology
Francesco Di Giovanni, Lorenzo Giusti, Federico Barbero, Giulia Luise, Pietro Lio, and Michael M Bronstein. On over-squashing in message passing neural networks: The impact of width, depth, and topology. InICML, pages 7865–7885. PMLR, 2023
work page 2023
-
[22]
Long range graph benchmark.Advances in Neural Information Processing Systems, 35:22326–22340, 2022
Vijay Prakash Dwivedi, Ladislav Rampášek, Michael Galkin, Ali Parviz, Guy Wolf, Anh Tuan Luu, and Dominique Beaini. Long range graph benchmark.Advances in Neural Information Processing Systems, 35:22326–22340, 2022
work page 2022
-
[23]
On class distributions induced by nearest neighbor graphs for node classification of tabular data
Federico Errica. On class distributions induced by nearest neighbor graphs for node classification of tabular data. InThirty-seventh Conference on Neural Information Processing Systems, 2023
work page 2023
-
[24]
Mitigating over-smoothing and over-squashing using augmentations of forman-ricci curvature
Lukas Fesser and Melanie Weber. Mitigating over-smoothing and over-squashing using augmentations of forman-ricci curvature. InThe Second Learning on Graphs Conference, 2023
work page 2023
-
[25]
Bochner’s method for cell complexes and combinatorial ricci cur- vature
Robin Forman. Bochner’s method for cell complexes and combinatorial ricci cur- vature. 2003
work page 2003
-
[26]
On the trade-off between over-smoothing and over-squashing in deep graph neural networks
Jhony H Giraldo, Konstantinos Skianis, Thierry Bouwmans, and Fragkiskos D Malliaros. On the trade-off between over-smoothing and over-squashing in deep graph neural networks. In Proceedings of the 32nd ACM international CIKM, pages 566–576, 2023
work page 2023
-
[27]
Learning task-dependent distributed rep- resentations by backpropagation through structure
Christoph Goller and Andreas Kuchler. Learning task-dependent distributed rep- resentations by backpropagation through structure. InProceedings of International Conference on Neural Networks (ICNN’96), volume 1, pages 347–352. IEEE, 1996
work page 1996
-
[28]
Diana Gomes, Frederik Ruelens, Kyriakos Efthymiadis, Ann Nowe, and Peter Vrancx. When are graph neural networks better than structure-agnostic meth- ods? In I Can’t Believe It’s Not Better Workshop: Understanding Deep Learning Through Empirical Falsification, 2022
work page 2022
-
[29]
A new model for learning in graph domains
Marco Gori, Gabriele Monfardini, and Franco Scarselli. A new model for learning in graph domains. InProceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005., volume 2, pages 729–734. IEEE, 2005
work page 2005
-
[30]
Drew: Dynamically rewired message passing with delay
Benjamin Gutteridge, Xiaowen Dong, Michael M Bronstein, and Francesco Di Gio- vanni. Drew: Dynamically rewired message passing with delay. InInternational Conference on Machine Learning, pages 12252–12267. PMLR, 2023
work page 2023
-
[31]
Jürgen Jost and Shiping Liu. Ollivier’s ricci curvature, local clustering and curvature-dimension inequalities on graphs.Discrete & Computational Geometry, 51(2):300–322, 2014
work page 2014
-
[32]
Fosr: First-order spectral rewiring for addressing oversquashing in gnns
Kedar Karhadkar, Pradeep Kr Banerjee, and Guido Montúfar. Fosr: First-order spectral rewiring for addressing oversquashing in gnns. InInternational Conference on Learning Representations, ICLR, 2023
work page 2023
-
[33]
Diffusion im- proves graph learning
Johannes Klicpera, Stefan Weißenberger, and Stephan Günnemann. Diffusion im- proves graph learning. In Advances in neural information processing systems, NeurIPS, 2019
work page 2019
-
[34]
Soo Yong Lee, Sunwoo Kim, Fanchen Bu, Jaemin Yoo, Jiliang Tang, and Kijung Shin. Feature distribution on graph topology mediates the effect of graph convo- lution: Homophily perspective.arXiv preprint arXiv:2402.04621, 2024. 18 H. Attali et al
-
[35]
Evennet: Ignoring odd-hop neighbors improves robustness of graph neural networks
Runlin Lei, Zhen Wang, Yaliang Li, Bolin Ding, and Zhewei Wei. Evennet: Ignoring odd-hop neighbors improves robustness of graph neural networks. Advances in Neural Information Processing Systems, 35:4694–4706, 2022
work page 2022
-
[36]
Large scale learning on non-homophilous graphs: New benchmarks and strong simple methods
Derek Lim, Felix Hohne, Xiuyu Li, Sijia Linda Huang, Vaishnavi Gupta, Omkar Bhalerao, and Ser Nam Lim. Large scale learning on non-homophilous graphs: New benchmarks and strong simple methods. InAdvances in neural information processing systems, 2021
work page 2021
-
[37]
Sitao Luan, Chenqing Hua, Qincheng Lu, Jiaqi Zhu, Mingde Zhao, Shuyuan Zhang, Xiao-Wen Chang, and Doina Precup. Is heterophily a real nightmare for graph neural networks to do node classification?arXiv preprint arXiv:2109.05641, 2021
-
[38]
TUDataset: A collection of benchmark datasets for learning with graphs
Christopher Morris, Nils M Kriege, Franka Bause, Kristian Kersting, Petra Mutzel, and Marion Neumann. Tudataset: A collection of benchmark datasets for learning with graphs. arXiv preprint arXiv:2007.08663, 2020
work page internal anchor Pith review arXiv 2007
-
[39]
Revisiting over-smoothing and over-squashing using ollivier- ricci curvature
Khang Nguyen, Nong Minh Hieu, Vinh Duc Nguyen, Nhat Ho, Stanley Osher, and Tan Minh Nguyen. Revisiting over-smoothing and over-squashing using ollivier- ricci curvature. InICML, pages 25956–25979. PMLR, 2023
work page 2023
-
[40]
K-hop graph neural networks.Neural Networks, 130:195–205, 2020
Giannis Nikolentzos, George Dasoulas, and Michalis Vazirgiannis. K-hop graph neural networks.Neural Networks, 130:195–205, 2020
work page 2020
-
[41]
Ricci curvature of metric spaces.Comptes Rendus Mathematique, 345(11):643–646, 2007
Yann Ollivier. Ricci curvature of metric spaces.Comptes Rendus Mathematique, 345(11):643–646, 2007
work page 2007
-
[42]
Kenta Oono and Taiji Suzuki. Graph neural networks exponentially lose expressive power for node classification.Proceedings of the ICLR, 2020
work page 2020
-
[43]
The pager- ank citation ranking: Bring order to the web
Lawrence Page, Sergey Brin, Rajeev Motwani, and Terry Winograd. The pager- ank citation ranking: Bring order to the web. Technical report, Technical report, stanford University, 1998
work page 1998
-
[44]
Oleg Platonov, Denis Kuznedelev, Michael Diskin, Artem Babenko, and Liudmila Prokhorenkova. A critical look at the evaluation of gnns under heterophily: are we really making progress? InInternational Conference on Learning Representations, 2023
work page 2023
-
[45]
Probabilistically rewired message- passing neural networks.arXiv preprint arXiv:2310.02156, 2023
Chendi Qian, Andrei Manolache, Kareem Ahmed, Zhe Zeng, Guy Van den Broeck, Mathias Niepert, and Christopher Morris. Probabilistically rewired message- passing neural networks.arXiv preprint arXiv:2310.02156, 2023
-
[46]
Dropedge: To- wards deep graph convolutional networks on node classification
Yu Rong, Wenbing Huang, Tingyang Xu, and Junzhou Huang. Dropedge: To- wards deep graph convolutional networks on node classification. InInternational conference on learning representations, ICLR, 2019
work page 2019
-
[47]
Multi-scale attributed node embedding
Benedek Rozemberczki, Carl Allen, and Rik Sarkar. Multi-scale attributed node embedding. Journal of Complex Networks, 9(2):cnab014, 2021
work page 2021
-
[48]
T Konstantin Rusch, Michael M Bronstein, and Siddhartha Mishra. A survey on oversmoothing in graph neural networks.arXiv preprint arXiv:2303.10993, 2023
-
[49]
Comparativeanalysisoftwodiscretizationsofriccicurvatureforcomplexnetworks
Areejit Samal, RP Sreejith, Jiao Gu, Shiping Liu, Emil Saucan, and Jürgen Jost. Comparativeanalysisoftwodiscretizationsofriccicurvatureforcomplexnetworks. Scientific reports, 8(1):8650, 2018
work page 2018
-
[50]
The graph neural network model
Franco Scarselli, Marco Gori, Ah Chung Tsoi, Markus Hagenbuchner, and Gabriele Monfardini. The graph neural network model. Intransactions on neural networks. IEEE, 2008
work page 2008
-
[51]
Ordered gnn: Ordering message passing to deal with heterophily and over-smoothing
Yunchong Song, Chenghu Zhou, Xinbing Wang, and Zhouhan Lin. Ordered gnn: Ordering message passing to deal with heterophily and over-smoothing. arXiv preprint arXiv:2302.01524, 2023
-
[52]
Understanding virtual nodes: Oversmoothing, oversquashing, and node heterogeneity
Joshua Southern, Francesco Di Giovanni, Michael Bronstein, and Johannes F Lutzeyer. Understanding virtual nodes: Oversmoothing, oversquashing, and node heterogeneity. arXiv preprint arXiv:2405.13526, 2024. Graph Rewiring: A Survey 19
-
[53]
Social influence analysis in large- scale networks
Jie Tang, Jimeng Sun, Chi Wang, and Zi Yang. Social influence analysis in large- scale networks. InProceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 807–816, 2009
work page 2009
-
[54]
Understanding over-squashing and bottlenecks on graphs via curvature
JakeTopping,FrancescoDiGiovanni,BenjaminPaulChamberlain,XiaowenDong, and Michael M Bronstein. Understanding over-squashing and bottlenecks on graphs via curvature. Proceedings of the International Conference on Learning Representations, 2022
work page 2022
-
[55]
Floriano Tori, Vincent Holst, and Vincent Ginis. The effectiveness of curvature- based rewiring and the role of hyperparameters in gnns revisited.arXiv preprint arXiv:2407.09381, 2024
-
[56]
Petar Veličković, Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro Liò, and Yoshua Bengio. Graph Attention Networks. InProceedings of the Inter- national Conference on Learning Representations, ICLR, 2018
work page 2018
-
[57]
Treedecomposedgraphneuralnetwork
YuWangandTylerDerr. Treedecomposedgraphneuralnetwork. In Proceedings of the 30th ACM international conference on information & knowledge management, pages 2040–2049, 2021
work page 2040
-
[58]
Graph Neural Networks for Graphs with Heterophily: A Survey
Xin Zheng, Yi Wang, Yixin Liu, Ming Li, Miao Zhang, Di Jin, Philip S Yu, and Shirui Pan. Graph neural networks for graphs with heterophily: A survey.arXiv preprint arXiv:2202.07082, 2022
work page internal anchor Pith review Pith/arXiv arXiv 2022
-
[59]
Graph neural networks: A review of methods and applications.AI open, 2020
Jie Zhou, Ganqu Cui, Shengding Hu, Zhengyan Zhang, Cheng Yang, Zhiyuan Liu, Lifeng Wang, Changcheng Li, and Maosong Sun. Graph neural networks: A review of methods and applications.AI open, 2020
work page 2020
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.