Wavjepa: Semantic learning unlocks ro- bust audio foundation models for raw waveforms,

· 2025 · arXiv 2509.23238

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Probing Spatial Structure in Pretrained Audio Representations

cs.SD · 2026-06-04 · unverdicted · novelty 7.0

Introduces SARL benchmark showing pretrained audio encoders encode source-level spatial factors more readily than room-level factors, with patterns shaped by input configuration and training paradigm.

OLIVE: View-Augmented Latent Prediction with Waveform Reconstruction for Speech SSL

cs.CL · 2026-06-29 · unverdicted · novelty 4.0

OLIVE is a new self-supervised speech representation framework that unifies view-augmented masked latent prediction with waveform reconstruction under one objective.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Probing Spatial Structure in Pretrained Audio Representations cs.SD · 2026-06-04 · unverdicted · none · ref 25
Introduces SARL benchmark showing pretrained audio encoders encode source-level spatial factors more readily than room-level factors, with patterns shaped by input configuration and training paradigm.

Wavjepa: Semantic learning unlocks ro- bust audio foundation models for raw waveforms,

fields

years

verdicts

representative citing papers

citing papers explorer