Zimtohrli: An efficient psychoacoustic audio similarity metric,

· 2025 · arXiv 2509.26133

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Neural Audio Codec with Adjustable Token Temporal Resolution Using Sampling-Frequency-Independent Convolutional Layers

eess.AS · 2026-07-02 · unverdicted · novelty 6.0

A single neural audio codec can operate at multiple token temporal resolutions by generating TTR-dependent convolutional kernels from shared parameters while adjusting kernel size and stride.

DTT-BSR+: A Generative-Regression Cascade for Music Source Restoration

eess.AS · 2026-06-23 · unverdicted · novelty 4.0

DTT-BSR+ is a generative-then-regression cascade for music source restoration that reports MMSNR gains over single-stage DTT-BSR and X-LANCE on most stems while noting a distribution-vs-reconstruction trade-off via FAD.

citing papers explorer

Showing 2 of 2 citing papers.

Neural Audio Codec with Adjustable Token Temporal Resolution Using Sampling-Frequency-Independent Convolutional Layers eess.AS · 2026-07-02 · unverdicted · none · ref 29
A single neural audio codec can operate at multiple token temporal resolutions by generating TTR-dependent convolutional kernels from shared parameters while adjusting kernel size and stride.
DTT-BSR+: A Generative-Regression Cascade for Music Source Restoration eess.AS · 2026-06-23 · unverdicted · none · ref 32
DTT-BSR+ is a generative-then-regression cascade for music source restoration that reports MMSNR gains over single-stage DTT-BSR and X-LANCE on most stems while noting a distribution-vs-reconstruction trade-off via FAD.

Zimtohrli: An efficient psychoacoustic audio similarity metric,

fields

years

verdicts

representative citing papers

citing papers explorer