Hearing between the lines: Unlocking the reasoning power of llms for speech evaluation,

· 2026 · arXiv 2601.13742

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

ParaPairAudioBench: Paralinguistic Pairwise Audio Benchmark for LALM-as-a-Judge

cs.SD · 2026-06-23 · unverdicted · novelty 7.0

ParaPairAudioBench is a new pairwise benchmark showing LALM judges lag human paralinguistic judgments by 32 percentage points with poor tie calibration across style, rate, emphasis, age, and gender.

citing papers explorer

Showing 1 of 1 citing paper.

ParaPairAudioBench: Paralinguistic Pairwise Audio Benchmark for LALM-as-a-Judge cs.SD · 2026-06-23 · unverdicted · none · ref 27
ParaPairAudioBench is a new pairwise benchmark showing LALM judges lag human paralinguistic judgments by 32 percentage points with poor tie calibration across style, rate, emphasis, age, and gender.

Hearing between the lines: Unlocking the reasoning power of llms for speech evaluation,

fields

years

verdicts

representative citing papers

citing papers explorer