Unsupervised Morphological Characterization of Gravitational-Wave Glitches in LIGO O4a Using Frozen DINOv2 Features
Pith reviewed 2026-06-29 09:54 UTC · model grok-4.3
The pith
No morphologically novel glitches appear in LIGO O4a data beyond known classes.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Application of frozen DINOv2 embeddings to Q-transform spectrograms of O4a strain data, followed by PCA-UMAP projection and Dirichlet Process Mixture Model clustering, yields no clusters that deviate from known Gravity Spy morphologies; all anomalous clusters achieve cosine similarity greater than 0.98 to reference classes, establishing a null result for novel glitch morphologies in the examined dataset.
What carries the argument
Frozen DINOv2 Vision Transformer (ViTS/14 with register tokens) embeddings from Q-transform spectrograms, used to drive unsupervised PCA-UMAP-DPMM clustering and similarity checks against a Gravity Spy O3b reference index.
If this is right
- The pipeline supplies a validated zero-shot baseline that can be reapplied to later observing runs without retraining.
- L1 embeddings maintain high cluster stability (ARI above 0.90) across all four sessions examined.
- H1 embeddings exhibit lower and more variable stability, pointing to detector-specific structure in the noise manifold.
- Time-slide tests show no statistically significant H1-L1 coincidences in any session.
Where Pith is reading between the lines
- If the embedding space proves complete, any future cluster falling outside known classes would indicate a genuine change in detector behavior rather than an analysis artifact.
- The observed difference in robustness between H1 and L1 suggests separate per-detector reference sets may improve sensitivity in follow-up work.
- The same embedding-plus-clustering approach could be tested on other transient signals represented as time-frequency images.
Load-bearing premise
The frozen DINOv2 embeddings extracted from the spectrograms already contain enough information to reveal any new glitch morphologies that might exist.
What would settle it
Discovery of at least one cluster whose embeddings have cosine similarity below 0.98 to every known Gravity Spy class would falsify the claim that no novel morphologies are present.
Figures
read the original abstract
A central open question in gravitational-wave detector characterization is whether the O4a observing run has introduced glitch morphologies not present in earlier runs. We present gravi-signal-ml, an open-source pipeline for unsupervised morphological characterization of instrumental noise transients (glitches) in LIGO gravitational-wave data, applied to 1,277 hours of public O4a strain data from the Hanford and Livingston detectors. The pipeline extracts 384-dimensional visual embeddings from Q-transform spectrograms using a frozen DINOv2 Vision Transformer with register tokens (ViTS/14), requiring no labeled training data. Embeddings are projected via PCA and UMAP with cosine metric, then clustered using a Dirichlet Process Mixture Model (DPMM). Cluster robustness is systematically assessed through ablation studies, stability analysis across hyperparameter perturbations, and morphological cross-check against an in-domain Gravity Spy O3b reference index. A time-slide background test excludes statistically significant H1--L1 coincidences ($p \geq 0.1$) in all sessions. Across 188,000+ spectrograms, no morphologically novel glitch candidates were identified -- all anomalous clusters map to known Gravity Spy classes with cosine similarity $> 0.98$. L1 embeddings show consistently high robustness (ablation ARI $> 0.90$ in all four sessions), while H1 exhibits lower and more variable grayscale ablation ARI ($\sim 0.68$--$0.90$), suggesting a structural difference in the H1 noise manifold under DINOv2 feature extraction. This null result, obtained with a fully validated pipeline, establishes a reproducible baseline for zero-shot glitch morphology characterization in O4a data. The pipeline and all results are publicly available at https://github.com/lucacirfeta/dante-gravi-signal-ml DOI: https://doi.org/10.5281/zenodo.20121860.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents an open-source pipeline (gravi-signal-ml) for unsupervised characterization of LIGO glitches in 1,277 hours of O4a data. It extracts 384-dimensional embeddings from Q-transform spectrograms using a frozen DINOv2 ViT-S/14 model, projects them via PCA+UMAP, and clusters with a Dirichlet Process Mixture Model. Supported by ablation studies, stability analysis, Gravity Spy O3b cross-validation, and time-slide background tests, the central claim is a null result: across 188,000+ spectrograms, all anomalous clusters map to known Gravity Spy classes with cosine similarity >0.98, with no morphologically novel candidates identified. L1 shows high robustness while H1 exhibits lower and more variable performance.
Significance. If the embedding space is shown to be sensitive to potential new morphologies, the work supplies a reproducible, label-free baseline for glitch morphology searches in future runs, strengthened by the fully public code, data, and results at the cited GitHub and Zenodo links. This is a concrete contribution to detector characterization in the gravitational-wave instrumentation community.
major comments (2)
- [Abstract and pipeline description] Abstract and pipeline description: The null result (no new morphologies identified) is load-bearing on the assumption that any genuinely novel glitch morphology would produce a distinct DPMM cluster with cosine similarity ≤0.98 to the Gravity Spy O3b reference. The manuscript reports no test (e.g., synthetic injection of time-frequency patterns outside the Gravity Spy span) confirming that frozen DINOv2 embeddings on spectrograms would separate such cases rather than collapsing them into existing clusters.
- [Ablation studies] Ablation studies: The grayscale ablation ARI drop for H1 (0.68–0.90) versus L1 (>0.90) demonstrates that the feature extractor is sensitive to input representation, yet no analysis quantifies how this variability impacts the detection threshold for novel morphologies or whether the reported cross-validation remains valid under the lower-ARI regime.
minor comments (1)
- The manuscript would benefit from an explicit limitations subsection discussing the domain shift between DINOv2's natural-image pretraining and Q-transform spectrograms.
Simulated Author's Rebuttal
We thank the referee for their constructive comments, which help clarify the scope and limitations of our null result. We address each major comment below and will revise the manuscript accordingly to improve transparency.
read point-by-point responses
-
Referee: The null result (no new morphologies identified) is load-bearing on the assumption that any genuinely novel glitch morphology would produce a distinct DPMM cluster with cosine similarity ≤0.98 to the Gravity Spy O3b reference. The manuscript reports no test (e.g., synthetic injection of time-frequency patterns outside the Gravity Spy span) confirming that frozen DINOv2 embeddings on spectrograms would separate such cases rather than collapsing them into existing clusters.
Authors: We agree this assumption is central to interpreting the null result. The pipeline's validity for known classes is supported by the Gravity Spy O3b cross-validation (high ARI and cosine similarities >0.98) and the fact that DINOv2 was pretrained on a broad image corpus, enabling zero-shot feature extraction. However, we did not conduct synthetic injections of morphologies outside the Gravity Spy taxonomy. In the revised manuscript we will add an explicit limitations subsection discussing this point, including a qualitative assessment of embedding sensitivity based on existing ablation stability and a recommendation for future controlled-injection studies. This addition will not change the reported O4a findings but will better bound the claim. revision: yes
-
Referee: The grayscale ablation ARI drop for H1 (0.68–0.90) versus L1 (>0.90) demonstrates that the feature extractor is sensitive to input representation, yet no analysis quantifies how this variability impacts the detection threshold for novel morphologies or whether the reported cross-validation remains valid under the lower-ARI regime.
Authors: The referee is correct that the reported H1 ARI variability indicates greater sensitivity to input representation. We already highlight this difference in the manuscript as evidence of a structural distinction in the H1 noise manifold. We did not, however, propagate the ARI variation into a quantitative estimate of its effect on novel-morphology detection thresholds or re-evaluate cross-validation robustness specifically in the lower-ARI regime. In revision we will extend the ablation section with a short analysis (e.g., stability of cluster assignments under ARI-thresholded subsampling) and a brief statement on how this affects interpretation of the H1 null result. This will be added without altering the core results or conclusions. revision: yes
Circularity Check
No circularity: external reference index and time-slide tests provide independent validation
full rationale
The paper's central null result (no novel morphologies across 188k+ spectrograms) is obtained by clustering DINOv2 embeddings with DPMM and then mapping clusters to an external Gravity Spy O3b reference index via cosine similarity >0.98, supplemented by time-slide background tests (p>=0.1) and ablation studies. No step reduces by construction to a fitted parameter, self-definition, or self-citation chain; the pipeline is unsupervised with a frozen pretrained extractor, and the comparison benchmark is independent of the O4a data under analysis. The derivation chain is therefore self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Frozen DINOv2 embeddings on Q-transform spectrograms capture sufficient morphological information to detect novel glitch classes if present
Forward citations
Cited by 2 Pith papers
-
Patch-Level DINOv2 Scoring for Gravitational-Wave Glitch Detection: Breaking the Signal Dilution Barrier via Vector-Quantized Local Feature Indexing
Patch-level top-k similarity scoring against a vector-quantized DINOv2 reference index yields KS=0.963 separation for extended glitch morphologies on LIGO O4a data, addressing global CLS token dilution.
-
Sensitivity Limits and Operational Threshold Calibration for DINOv2-based Gravitational-Wave Glitch Characterization: A Strain-Domain Mock Data Challenge on LIGO O4a
Mock data challenge shows DINOv2 pipeline recovers high-SNR anisotropic glitches under dynamic thresholding but yields zero recall for all morphologies under a low-FPR operational threshold due to global average pooling.
Reference graph
Works this paper leans on
-
[1]
Campello, R. J. G. B., Moulavi, D., & Sander, J. 2013, Advances in Knowledge Discovery and Data Mining, Lecture Notes in Computer Science, 7819, 160, https://doi.org/10.1007/978-3-642-37456-2 14
-
[2]
Cabero, M., Lundgren, A., Nitz, A. H., et al. 2019, Classical and Quantum Gravity, 36, 15, https://doi.org/10.1088/1361-6382/ab2e14
-
[3]
W., Bahaadini, S., Rohani, N., et al
Coughlin, M. W., Bahaadini, S., Rohani, N., et al. 2019, Physical Review D, 99, 082002, https://doi.org/10.1103/PhysRevD.99.082002
-
[4]
Vision Transformers Need Registers
Darcet, T., Oquab, M., Doup´ e, E., & Bourdoukan, R. 2024, International Conference on Learning Representations (ICLR), arXiv:2309.16588
work page internal anchor Pith review Pith/arXiv arXiv 2024
-
[5]
2020,Machine Learning: Science and Technology, 2, 015004, https://doi.org/10.1088/2632-2153/abab5f
Essick, R., Godwin, P., Hanna, C., et al. 2020,Machine Learning: Science and Technology, 2, 015004, https://doi.org/10.1088/2632-2153/abab5f
-
[6]
Ferguson, T. S. 1973,The Annals of Statistics, 1, 209, https://doi.org/10.1214/aos/1176342360
-
[7]
Glanzer, J., Banagiri, S., Coughlin, S. B., et al. 2023, Classical and Quantum Gravity, 40, 065004, https://doi.org/10.1088/1361-6382/acb633
-
[8]
2026,gravi-signal-ml: Unsupervised Morphological Characterization of Gravitational-Wave
Cirfeta, L. 2026,gravi-signal-ml: Unsupervised Morphological Characterization of Gravitational-Wave
2026
-
[9]
Glitches, Zenodo, https://doi.org/10.5281/zenodo.20121860 LIGO Scientific Collaboration, Virgo Collaboration, & KAGRA Collaboration 2023,Gravitational Wave Open Science Center O4a Data Release, https://gwosc.org
-
[10]
Macas, R., Ashton, G., & the gwpy Developers 2021,gwpy: A Python package for gravitational-wave astrophysics, https://doi.org/10.5281/zenodo.5504573
-
[11]
Li, T., et al. 2024,arXiv e-prints, arXiv:2404.15552 LIGO Scientific Collaboration 2015,Classical and Quantum Gravity, 32, 074001, https://doi.org/10.1088/0264-9381/32/7/074001
-
[12]
UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction
McInnes, L., Healy, J., & Melville, J. 2018,arXiv e-prints, arXiv:1802.03426 Nu˜ nez, J. R., Anderton, C. R., & Renslow, R. S. 2018, PLOS ONE, 13, e0199239, https://doi.org/10.1371/journal.pone.0199239
work page internal anchor Pith review Pith/arXiv arXiv doi:10.1371/journal.pone.0199239 2018
-
[13]
DINOv2: Learning Robust Visual Features without Supervision
Oquab, M., Darcet, T., Moutakanni, T., et al. 2023,arXiv e-prints, arXiv:2304.07193 8Cirfeta
work page internal anchor Pith review Pith/arXiv arXiv 2023
-
[14]
2019,Advances in Neural Information Processing Systems, 32, https://pytorch.org
Paszke, A., Gross, S., Massa, F., et al. 2019,Advances in Neural Information Processing Systems, 32, https://pytorch.org
2019
-
[15]
2011, Journal of Machine Learning Research, 12, 2825
Pedregosa, F., Varoquaux, G., Gramfort, A., et al. 2011, Journal of Machine Learning Research, 12, 2825
2011
-
[16]
2025,arXiv e-prints, arXiv:2409.02831
Soni, S., et al. 2025,arXiv e-prints, arXiv:2409.02831
-
[17]
Harris, C. R., Millman, K. J., van der Walt, S. J., et al. 2020,Nature, 585, 357, https://doi.org/10.1038/s41586-020-2649-2
-
[18]
2017,Classical and Quantum Gravity, 34, 064003, https://doi.org/10.1088/1361-6382/aa5cea
Zevin, M., Coughlin, S., Bahaadini, S., et al. 2017,Classical and Quantum Gravity, 34, 064003, https://doi.org/10.1088/1361-6382/aa5cea
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.