Perceptually-Weighted Video Quality Metric for Asymmetric Encoded Sports Videos

Alexander Kopte; Andr\'e Kaup; Anna Meyer; Diwakara Reddy; Jonas Janzen; Marc Windsheimer; Paul Wawerek-L\'opez; Simon Deniffel

read the original abstract

Objective video quality metrics commonly assume uniform spatial attention, an assumption that conflicts with the selective nature of human visual perception, particularly in sports videos. Here, allocating more bits for salient regions through semantic encoding can lead to significant bitrate savings. We present a Perceptually-Weighted Video Quality Metric (PW-VQM), a full-reference metric that accounts for the unequal perceptual importance of spatial regions and therefore targets quality evaluation for asymmetrically encoded content. SSIM maps computed in a multiscale wavelet domain are weighted by differentiating between foreground and background regions. Perceptually salient foreground regions are identified by combining open-vocabulary object detection with optical flow analysis, and are assigned higher weight during quality aggregation. Evaluated on sports video content, PW-VQM achieves a Spearman Rank Order Correlation Coefficient of 0.9511, outperforming established metrics including SSIM, VMAF, FUNQUE, and LPIPS. An ablation study confirms the individual contributions of the components of the perceptual weighting.

Perceptually-Weighted Video Quality Metric for Asymmetric Encoded Sports Videos

discussion (0)