Conditional Perceptual Quality Preserving Image Compression

Dailan He; Hongwei Qin; Jingjing Liu; Qian Zhang; Tongda Xu; Yanghao Li; Yan Wang; Ya-Qin Zhang; Yuanyuan Wang; Zhe Wang

arxiv: 2308.08154 · v1 · pith:QY2RK7ZCnew · submitted 2023-08-16 · 📡 eess.IV · cs.CV

Conditional Perceptual Quality Preserving Image Compression

Tongda Xu , Qian Zhang , Yanghao Li , Dailan He , Zhe Wang , Yuanyuan Wang , Hongwei Qin , Yan Wang

show 2 more authors

Jingjing Liu Ya-Qin Zhang

This is my paper

classification 📡 eess.IV cs.CV

keywords qualityperceptualconditionalcompressiondefinedimageinformationoriginal

0 comments

read the original abstract

We propose conditional perceptual quality, an extension of the perceptual quality defined in \citet{blau2018perception}, by conditioning it on user defined information. Specifically, we extend the original perceptual quality $d(p_{X},p_{\hat{X}})$ to the conditional perceptual quality $d(p_{X|Y},p_{\hat{X}|Y})$, where $X$ is the original image, $\hat{X}$ is the reconstructed, $Y$ is side information defined by user and $d(.,.)$ is divergence. We show that conditional perceptual quality has similar theoretical properties as rate-distortion-perception trade-off \citep{blau2019rethinking}. Based on these theoretical results, we propose an optimal framework for conditional perceptual quality preserving compression. Experimental results show that our codec successfully maintains high perceptual quality and semantic quality at all bitrate. Besides, by providing a lowerbound of common randomness required, we settle the previous arguments on whether randomness should be incorporated into generator for (conditional) perceptual quality compression. The source code is provided in supplementary material.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Semantics Disentanglement and Composition for Universal Image Coding with Efficiently LLM Reasoning and Generative Diffusion
cs.CV 2024-12 unverdicted novelty 6.0

UniCodec uses LLM-driven semantic disentanglement at the encoder and diffusion-based compositional generation at the decoder to enable one codec for both human perception and machine vision tasks without task-specific...