Spherical CNNs

Jonas Koehler; Mario Geiger; Max Welling; Taco S. Cohen

arxiv: 1801.10130 · v3 · pith:VFP3KHCUnew · submitted 2018-01-30 · 💻 cs.LG · stat.ML

Spherical CNNs

Taco S. Cohen , Mario Geiger , Jonas Koehler , Max Welling This is my paper

classification 💻 cs.LG stat.ML

keywords sphericalcnnsproblemsconvolutionalfouriergeneralizedimagesnetworks

0 comments

read the original abstract

Convolutional Neural Networks (CNNs) have become the method of choice for learning problems involving 2D planar images. However, a number of problems of recent interest have created a demand for models that can analyze spherical images. Examples include omnidirectional vision for drones, robots, and autonomous cars, molecular regression problems, and global weather and climate modelling. A naive application of convolutional networks to a planar projection of the spherical signal is destined to fail, because the space-varying distortions introduced by such a projection will make translational weight sharing ineffective. In this paper we introduce the building blocks for constructing spherical CNNs. We propose a definition for the spherical cross-correlation that is both expressive and rotation-equivariant. The spherical correlation satisfies a generalized Fourier theorem, which allows us to compute it efficiently using a generalized (non-commutative) Fast Fourier Transform (FFT) algorithm. We demonstrate the computational efficiency, numerical accuracy, and effectiveness of spherical CNNs applied to 3D model recognition and atomization energy regression.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 8 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Fast contracted Clebsch--Gordan tensor products for equivariant graph neural networks
physics.comp-ph 2026-05 unverdicted novelty 7.0

An O(L^3) algorithm computes contracted Clebsch-Gordan tensor products for equivariant ML potentials using a structured angular grid and spherical Poisson bracket to handle parity-odd terms at fixed CP rank.
Mapped Convolutions
cs.CV 2019-06 unverdicted novelty 7.0

Mapped convolutions generalize standard convolutions by decoupling sampling and weighting, enabling direct convolution on spherical and mesh data with a 17% improvement in spherical depth estimation.
SurReal: Fr\'echet Mean and Distance Transform for Complex-Valued Deep Learning
cs.CV 2019-06 unverdicted novelty 7.0

SurReal architecture applies weighted Fréchet mean convolution and distance-based FC layers to complex data, improving accuracy on MSTAR (94% to 98%) and RadioML with 8-10% of baseline model size.
Sphere-Depth: A Benchmark for Depth Estimation Methods with Varying Spherical Camera Orientations
cs.CV 2026-04 unverdicted novelty 6.0

Sphere-Depth benchmark shows substantial performance degradation in both general and spherical-aware depth estimation models under simulated camera pose variations.
Generalized Spherical Neural Operators: Green's Function Formulation
cs.LG 2025-12 unverdicted novelty 6.0

GSNO uses position-dependent spherical Green's functions to create flexible neural operators that adapt to non-equivariant systems on spheres while keeping spectral efficiency and grid invariance.
TetraSphere: A Neural Descriptor for O(3)-Invariant Point Cloud Analysis
cs.CV 2022-11 unverdicted novelty 6.0

TetraSphere integrates a TetraTransform based on steerable spherical neurons into VN-DGCNN to produce an O(3)-equivariant descriptor that reports new SOTA results on rotated ScanObjectNN, ModelNet40 classification, an...
Geometric Deep Learning: Grids, Groups, Graphs, Geodesics, and Gauges
cs.LG 2021-04 accept novelty 6.0

Geometric deep learning provides a unified mathematical framework based on grids, groups, graphs, geodesics, and gauges to explain and extend neural network architectures by incorporating physical regularities.
Cortical Surface Parcellation using Spherical Convolutional Neural Networks
q-bio.NC 2019-07 unverdicted novelty 6.0

Spherical CNNs with deformation-augmented training data achieve faster and more accurate cortical parcellation than multi-atlas or naive U-Net methods on 427 adult brains.