Introduces ChronoEarth-492K, a 492K-patch temporally calibrated hyperspectral dataset from the EO-1 Hyperion archive spanning 2001-2017, plus a benchmark for static, short-horizon, and long-horizon spatiotemporal tasks using open geospatial products.
hub
AlphaEarth Foundations: An embedding field model for accurate and efficient global mapping from sparse label data
28 Pith papers cite this work. Polarity classification is still indexing.
abstract
Unprecedented volumes of Earth observation data are continually collected around the world, but high-quality labels remain scarce given the effort required to make physical measurements and observations. This has led to considerable investment in bespoke modeling efforts translating sparse labels into maps. Here we introduce AlphaEarth Foundations, an embedding field model yielding a highly general, geospatial representation that assimilates spatial, temporal, and measurement contexts across multiple sources, enabling accurate and efficient production of maps and monitoring systems from local to global scales. The embeddings generated by AlphaEarth Foundations are the only to consistently outperform a suite of other well-known/widely accepted featurization approaches tested on a diverse set of mapping evaluations without re-training. We have released a dataset of global, annual, analysis-ready embedding field layers from 2017 through 2024.
hub tools
citation-role summary
citation-polarity summary
representative citing papers
Fusing embeddings from four Earth models (AlphaEarth, Tessera, GeoCLIP, SatCLIP) outperforms the best single model on four of six tasks, with gains depending on task and location.
TrajGANR learns continuous neural representations of trajectories to enable fine-grained alignment with street-view images and locations in a joint multimodal self-supervised objective, outperforming prior geospatial MSSL methods on urban mobility and road tasks.
UNIGEOCLIP creates a unified embedding for aerial imagery, street views, elevation, text, and coordinates via all-to-all contrastive alignment plus a scaled lat-long encoder, outperforming single-modality and coordinate baselines on geospatial tasks.
A new spatial affinity component for self-supervised pretraining leverages high-resolution imagery to enhance mid-resolution satellite image representations and segmentation performance.
SpectralEarth-FM is a multisensor hierarchical transformer pretrained on a 40TB co-located HSI-MSI-SAR dataset using a JEPA-style objective and reports state-of-the-art results on hyperspectral and standard EO benchmarks.
FLUXtrapolation is a benchmark for domain generalization in ecosystem flux upscaling using temporal, spatial, and temperature-based extrapolation scenarios, with pilot results showing model separation on tail and multi-scale metrics.
A transformer-based in-context learning model predicts continental-scale subsurface temperatures from sparse borehole observations, outperforming physics and interpolation baselines while adapting to new regions with 20 examples.
Introduces WILDFIRE-FM and a fixed-contract evaluation framework demonstrating that wildfire model transfer conclusions depend strongly on evaluation design and task formulation.
An audit of 152 papers reveals that geospatial foundation models lack standardized evaluations, training controls, and weight releases, so no one knows the state of the art.
A new optimization algorithm with double machine learning for wildfire spread estimation enables better crew assignments that reduce total area burned.
A proxy consistency loss trains location encoders on proxy geographic data to outperform direct input fusion or frozen embeddings for air quality and poverty mapping with sparse labels.
EFDiff conditions a diffusion model with Prithvi-EO-2.0 geospatial embeddings via cross-attention to achieve 32x LST super-resolution, outperforming baselines on a global Landsat dataset.
TESSERA embeddings achieve the highest IoU (0.77-0.82) for 10m LCZ mapping across Swiss cities and outperform Sentinel-1/2 and AlphaEarth, though year-to-year transfer remains challenging.
Linear classifier on Clay v1.5 embeddings produces continuous biome probabilities that raise mean per-species AUC for occurrence prediction from 0.570 (discrete labels) to 0.618 on 10,015 Brazilian forest plots.
A fleet of sensor-specialized 22M-parameter JEPA models routed by an LLM improves LLM-as-judge scores on hydrologic questions over AlphaEarth alone with Cohen's d of 1.10.
A visual analytics workbench enables scientists to explore, query, and verify embedding-based similarity searches on weather and climate data by tracing results back to physical evidence.
neuroGravity reconstructs transferable human mobility networks from basic urban data via physics-informed deep learning, with transferability predicted by a spatial income segregation index.
A prompting-based adaptation technique lets RGB-trained LMMs process multi-spectral inputs and deliver strong zero-shot gains on remote-sensing benchmarks.
SSDM decouples global geospatial embeddings into structural modulation and semantic injection pathways to improve accuracy and consistency in high-resolution remote sensing land cover mapping.
HuiYanEarth-SAR is a foundation model that generates realistic global SAR imagery from geographic coordinates alone by integrating geospatial semantics and implicit scattering characteristics.
LIANet encodes multi-temporal Earth observation data into a coordinate-based neural field that supports label-only fine-tuning for downstream tasks without access to raw imagery.
Earth embeddings from satellite images predict neighborhood-level urban indicators with higher accuracy for built-environment outcomes than for behavior-driven ones, showing city-specific variation but year-to-year stability.
FireScope trains a VLM on US data to output wildfire risk rasters with reasoning traces and shows improved cross-continental performance on European events compared with prior approaches.
citing papers explorer
No citing papers match the current filters.