super hub Canonical reference

Graph Attention Networks

Adriana Romero, Arantxa Casanova, Guillem Cucurull, Yoshua Bengio · 2017 · stat.ML · arXiv 1710.10903

Canonical reference. 70% of citing Pith papers cite this work as background.

208 Pith papers citing it

Background 70% of classified citations

open full Pith review browse 208 citing papers more from Adriana Romero arXiv PDF

abstract

We present graph attention networks (GATs), novel neural network architectures that operate on graph-structured data, leveraging masked self-attentional layers to address the shortcomings of prior methods based on graph convolutions or their approximations. By stacking layers in which nodes are able to attend over their neighborhoods' features, we enable (implicitly) specifying different weights to different nodes in a neighborhood, without requiring any kind of costly matrix operation (such as inversion) or depending on knowing the graph structure upfront. In this way, we address several key challenges of spectral-based graph neural networks simultaneously, and make our model readily applicable to inductive as well as transductive problems. Our GAT models have achieved or matched state-of-the-art results across four established transductive and inductive graph benchmarks: the Cora, Citeseer and Pubmed citation network datasets, as well as a protein-protein interaction dataset (wherein test graphs remain unseen during training).

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 15 method 3 baseline 2

citation-polarity summary

background 14 use method 3 baseline 2 support 1

claims ledger

abstract We present graph attention networks (GATs), novel neural network architectures that operate on graph-structured data, leveraging masked self-attentional layers to address the shortcomings of prior methods based on graph convolutions or their approximations. By stacking layers in which nodes are able to attend over their neighborhoods' features, we enable (implicitly) specifying different weights to different nodes in a neighborhood, without requiring any kind of costly matrix operation (such as inversion) or depending on knowing the graph structure upfront. In this way, we address several key

authors

Adriana Romero Arantxa Casanova Guillem Cucurull Petar Veli\v{c}kovi\'c Pietro Li\`o Yoshua Bengio

co-cited works

representative citing papers

A document is worth a structured record: Principled inductive bias design for document recognition

cs.CV · 2025-07-11 · unverdicted · novelty 8.0

Introduces a method to design structure-specific relational inductive biases for a base transformer architecture, enabling end-to-end transcription of documents with intrinsic structures, demonstrated on sheet music, shape drawings, and mechanical engineering drawings.

PromptGNN-sim: Deep Fusion and Alignment of GNN and LLMs for Text-Attributed Graph Learning

cs.AI · 2026-06-29 · unverdicted · novelty 7.0

PromptGNN-sim uses GAT-based semantically aware neighborhood selection and structure-aware LLM prompts with bi-directional contrastive alignment to outperform prior GNN, LLM, and fusion methods on text-attributed graph datasets.

Redefining Maritime Anomaly Detection via Equation-Grounded Synthetic Anomalies

cs.LG · 2026-06-29 · unverdicted · novelty 7.0

Proposes equation-grounded taxonomy (unexpected AIS activity, route deviation, close approach) and LLM-guided synthesis pipeline to generate timestamp-labeled anomalies for evaluating maritime detection models.

Learning to Adaptively Allocate Gaussians for Arbitrary-Scale Image Super-Resolution

cs.CV · 2026-06-28 · unverdicted · novelty 7.0

QuADA-GS learns to predict local complexity-driven Gaussian densification from low-resolution inputs and uses Hierarchical Pointer Convolution for efficient arbitrary-scale super-resolution.

Grounded Iterative Language Planning: How Parameterized World Models Reduce Hallucination Propagation in LLM Agents

cs.AI · 2026-06-26 · unverdicted · novelty 7.0 · 2 refs

GILP trains a parameterized backbone for valid actions and state predictions, then uses a consistency gate with LLM drafts to reduce hallucinated-state rate from 0.176 to 0.035 on GPT-4o-mini while raising success from 0.668 to 0.838.

Dark Matter in Draco and Bo\"otes I: Hints of a Core in an Ultra-Faint Dwarf from Simulation-Based Inference

astro-ph.GA · 2026-06-24 · unverdicted · novelty 7.0

GraphNPE recovers a significantly lower central density for Boötes I consistent with a core while Draco remains marginally cuspy, and demonstrates that higher-order velocity moments reduce bias in dynamical modeling.

AGDN: Learning to Solve Traveling Salesman Problem with Anisotropic Graph Diffusion Network

cs.LG · 2026-06-17 · unverdicted · novelty 7.0

AGDN is a new GNN framework using a MixScore matrix and anisotropic graph diffusion to outperform prior methods on TSP instances across sizes and distributions.

Timestamp-Aware Spatio-Temporal Graph Contrastive Learning for Network Intrusion Detection

cs.CR · 2026-06-15 · unverdicted · novelty 7.0

A timestamp-aware spatio-temporal graph contrastive learning model for network intrusion detection outperforms other self-supervised methods on four datasets while matching supervised GNN performance.

Contrastive learning of dynamical representations for enhanced molecular sampling

physics.comp-ph · 2026-06-13 · unverdicted · novelty 7.0

SelfTICA reformulates collective-variable discovery as contrastive dynamical representation learning on time-lagged data, decoupling feature learning from slow-mode extraction to produce reusable collective variables from limited or biased trajectories.

Beyond Convolution: Advancing Hypergraph Neural Networks with Hypergraph U-Nets

cs.LG · 2026-06-08 · unverdicted · novelty 7.0

Introduces Hypergraph U-Nets with PHPool and PHUnpool operators derived from hierarchical clustering dendrograms for hypergraph reconstruction, classification, and anomaly detection.

Agentic multi-fidelity learning of quasiparticle and excitonic properties

cond-mat.mtrl-sci · 2026-06-05 · unverdicted · novelty 7.0

An agentic multi-fidelity learning method corrects numerical artifacts in GW-BSE excited-state calculations for 2D bilayers and improves quasiparticle gaps and exciton binding energies.

TRAPS: Therapeutic Response Analysis via Pathway-informed Stratification

cs.LG · 2026-06-05 · unverdicted · novelty 7.0

Benchmark of BINN, GraphPath, and PATH on 2622 TCGA patients shows PATH best for targeted therapy, BINN for survival, none useful for radiation, with GraphPath at 0.92 AUROC on prostate targeted therapy.

EpiFormer: Learning Antigen-Antibody Interactions for Epitope Prediction via Geometric Deep Learning

q-bio.QM · 2026-06-02 · unverdicted · novelty 7.0

EpiFormer improves epitope prediction F1 score by over 40% via early-fusion cross-attention in GNN layers and sparsity-aware objectives, while recovering known biology as emergent behavior.

Forecasting Conceptual Diffusion in Science: The Case of Quantum Computing

cs.SI · 2026-06-02 · unverdicted · novelty 7.0

LightGBM models on citation and diversity features predict exogenous diffusion of quantum computing concepts with R² up to 0.78 while endogenous reinforcement remains largely unpredictable after growth controls, with replications in other fields.

AbstainGNN: Teaching Graph Neural Networks to Abstain for Graph Classification

cs.LG · 2026-05-29 · unverdicted · novelty 7.0

AbstainGNN is a framework that jointly models prediction and abstention in GNNs for graph classification, using a PAC-Bayesian-derived unified objective and two-stage training to achieve better accuracy at given rejection rates than prior abstention methods.

Contrast to Detect: Dynamic Graph Contrastive Regularization for Unsupervised Anomaly Detection in Multivariate Time Series

cs.LG · 2026-05-22 · unverdicted · novelty 7.0

ContrastAD achieves highest mean F1 on all five MTS benchmarks and highest AUC on three by building DTW-based sparse graph snapshots and contrasting divergent pairs with a stable anchor instead of enforcing invariance.

Gaussian Sheaf Neural Networks

cs.LG · 2026-05-20 · unverdicted · novelty 7.0

Gaussian Sheaf Neural Networks derive a sheaf Laplacian for Gaussian node features on graphs to preserve their geometric structure during message passing.

NeighborDiv: Training-free Zero-shot Generalist Graph Anomaly Detection via Neighbor Diversity

cs.LG · 2026-05-20 · unverdicted · novelty 7.0

NeighborDiv detects graph anomalies via variance of inter-neighbor feature similarities under a new Neighbor-to-Neighbor Diversity Paradigm, achieving SOTA results with zero volatility in zero-shot cross-domain settings.

Learning over Positive and Negative Edges with Contrastive Message Passing

cs.LG · 2026-05-18 · unverdicted · novelty 7.0

Contrastive Message Passing lets GNNs apply similarity-preserving transforms to positive edges and dissimilarity-inducing transforms to negative edges via soft positive semidefinite constraints on weights, yielding gains in low-label high-homophily regimes.

GraphIP-Bench: How Hard Is It to Steal a Graph Neural Network, and Can We Stop It?

cs.CR · 2026-05-12 · unverdicted · novelty 7.0 · 2 refs

GraphIP-Bench is a new unified benchmark showing GNN model extraction succeeds at moderate query budgets while most defenses fail to prevent it or retain verification signals on surrogates.

TopoU-Net: a U-Net architecture for topological domains

cs.LG · 2026-05-11 · unverdicted · novelty 7.0

TopoU-Net is a rank-path U-Net for combinatorial complexes that encodes by lifting cochains upward along incidences, decodes by transporting downward, and merges via skip connections at matched ranks.

CTQWformer: A CTQW-based Transformer for Graph Classification

cs.LG · 2026-05-10 · unverdicted · novelty 7.0

CTQWformer fuses continuous-time quantum walks into a graph transformer and recurrent module to outperform standard GNNs and graph kernels on classification benchmarks.

Structural Interpretations of Protein Language Model Representations via Differentiable Graph Partitioning

cs.LG · 2026-05-09 · unverdicted · novelty 7.0

SoftBlobGIN combines ESM-2 representations with protein contact graphs via a lightweight GNN and differentiable substructure pooling to achieve 92.8% accuracy on enzyme classification, raise binding-site AUROC to 0.983, and generate auditable structural explanations without retraining the language模型

SGC-RML: A reliable and interpretable longitudinal assessment for PD in real-world DNS

cs.LG · 2026-05-08 · unverdicted · novelty 7.0

SGC-RML creates an 8D symptom atlas from multimodal PD data and integrates conformal calibration to deliver reliable, rejectable longitudinal assessments.

citing papers explorer

Showing 18 of 18 citing papers after filters.

A document is worth a structured record: Principled inductive bias design for document recognition cs.CV · 2025-07-11 · unverdicted · none · ref 52 · internal anchor
Introduces a method to design structure-specific relational inductive biases for a base transformer architecture, enabling end-to-end transcription of documents with intrinsic structures, demonstrated on sheet music, shape drawings, and mechanical engineering drawings.
Learning to Adaptively Allocate Gaussians for Arbitrary-Scale Image Super-Resolution cs.CV · 2026-06-28 · unverdicted · none · ref 59 · internal anchor
QuADA-GS learns to predict local complexity-driven Gaussian densification from low-resolution inputs and uses Hierarchical Pointer Convolution for efficient arbitrary-scale super-resolution.
Hierarchical Mesh Transformers with Topology-Guided Pretraining for Morphometric Analysis of Brain Structures cs.CV · 2026-04-06 · unverdicted · none · ref 7 · internal anchor
A hierarchical mesh transformer using topology-guided pretraining on simplicial complexes achieves state-of-the-art results on Alzheimer's classification, amyloid prediction, and focal cortical dysplasia detection from brain meshes.
EndoVGGT: GNN-Enhanced Depth Estimation for Surgical 3D Reconstruction cs.CV · 2026-03-25 · unverdicted · none · ref 22 · internal anchor
EndoVGGT uses a dynamic DeGAT graph attention module to improve depth estimation and non-rigid 3D reconstruction in surgery, reporting 24.6% PSNR and 9.1% SSIM gains on SCARED with zero-shot generalization to new domains.
GRAPE: Graph-Augmented Prototype Explanations for Interactive Medical Image Diagnosis cs.CV · 2026-06-29 · unverdicted · none · ref 22 · 2 links · internal anchor
GRAPE augments prototype medical image classifiers with graph attention for co-occurrence, a mismatch safety check, and open-vocabulary anchoring to support incremental addition of findings from single examples.
fMRI-Diffusion: Generating fMRI Time Series Via a Temporal Transformer Diffusion Model for Major Depressive Disorder Diagnosis cs.CV · 2026-05-22 · unverdicted · none · ref 34 · internal anchor
fMRI-Diffusion generates synthetic ROI-level fMRI time series via a temporal transformer diffusion model with supervised pretraining, improving MDD diagnostic accuracy by up to 3.7 percentage points over prior FC-matrix synthesis methods on the REST-meta-MDD dataset.
Weather-Robust Cross-View Geo-Localization via Prototype-Based Semantic Part Discovery cs.CV · 2026-05-12 · unverdicted · none · ref 55 · 2 links · internal anchor
SkyPart achieves state-of-the-art single-pass cross-view geo-localization on SUES-200, University-1652, and DenseUAV by using prototype-based part discovery, altitude-conditioned modulation, and Kendall-weighted loss, with widening gains under weather corruptions.
Region-Affinity Attention for Whole-Slide Breast Cancer Classification in Deep Ultraviolet Imaging cs.CV · 2026-04-19 · unverdicted · none · ref 23 · internal anchor
A novel Region-Affinity Attention mechanism classifies breast cancer on whole deep ultraviolet slides, achieving 92.67% accuracy and 95.97% AUC on 136 samples while outperforming standard attention methods.
Construct Dynamic Graphs for Hand Gesture Recognition via Spatial-Temporal Attention cs.CV · 2019-07-20 · unverdicted · none · ref 35 · internal anchor
DG-STA builds dynamic graphs from hand skeletons, applies spatial-temporal self-attention to learn features, and uses a mask to cut cost by 99%, outperforming prior methods on DHG-14/28 and SHREC'17.
Graph Neural Based End-to-end Data Association Framework for Online Multiple-Object Tracking cs.CV · 2019-07-11 · unverdicted · none · ref 78 · internal anchor
A graph neural network framework learns affinities from appearance and motion then solves bipartite matching for online multiple-object tracking.
Attention U-Net: Learning Where to Look for the Pancreas cs.CV · 2018-04-11 · unverdicted · none · ref 31 · internal anchor
Attention gates added to U-Net automatically focus on target organs in CT images and improve segmentation performance on abdominal datasets.
Case-Aware Medical Image Classification with Multimodal Knowledge Graphs and Reliability-Guided Refinement cs.CV · 2026-05-21 · unverdicted · none · ref 36 · 2 links · internal anchor
The paper presents a case-aware multimodal knowledge graph approach for medical image classification that retrieves similar cases, propagates knowledge via graph attention, and refines predictions with reliability estimates.
Social-BiGAT: Multimodal Trajectory Forecasting using Bicycle-GAN and Graph Attention Networks cs.CV · 2019-07-04 · unverdicted · none · ref 22 · internal anchor
Social-BiGAT is a graph-based generative adversarial network using GAT for social interaction features and Bicycle-GAN for multimodal outputs that reports state-of-the-art results on pedestrian trajectory forecasting benchmarks.
MolSight: A Graph-Aware Vision-Language Model for Unified Chemical Image Understanding cs.CV · 2026-07-02 · unverdicted · none · ref 82 · internal anchor
MolSight integrates a Molecular Topology Module and Molecular Grounding Module into VLMs to enhance molecular image understanding and claims to outperform prior models on chemical visual tasks.
A Dual Cross-Attention Graph Learning Framework For Multimodal MRI-Based Major Depressive Disorder Detection cs.CV · 2026-04-11 · unverdicted · none · ref 42 · internal anchor
Dual cross-attention fusion of sMRI and rs-fMRI data achieves 84.71% accuracy in MDD detection on the REST-meta-MDD dataset, outperforming concatenation on functional atlases.
Bridging the Dimensionality Gap: A Taxonomy and Survey of 2D Vision Model Adaptation for 3D Analysis cs.CV · 2026-04-03 · unverdicted · none · ref 6 · internal anchor
The paper offers a taxonomy of 2D-to-3D adaptation strategies divided into data-centric projection, architecture-centric 3D networks, and hybrid methods that combine both.
Leveraging Medical Foundation Model Features in Graph Neural Network-Based Retrieval of Breast Histopathology Images cs.CV · 2024-05-07 · unverdicted · none · ref 55 · internal anchor
A graph autoencoder model using foundation model features achieves high retrieval accuracy (mAP 96.7-97.6%, mMV 91.5-94.2%) on BreakHis and BACH breast cancer histopathology datasets.
PH-GCN: Person Re-identification with Part-based Hierarchical Graph Convolutional Network cs.CV · 2019-07-20 · unverdicted · none · ref 26 · internal anchor
PH-GCN constructs a hierarchical graph of person parts and performs local/global feature learning via message passing in an end-to-end network for person re-identification.

Graph Attention Networks

hub tools

citation-role summary

citation-polarity summary

claims ledger

authors

co-cited works

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer