Once-for-All: Controllable Generative Image Compression with Dynamic Granularity Adaptation

Anqi Li; Feng Li; Huihui Bai; Runmin Cong; Yao Zhao; Yuxi Liu

arxiv: 2406.00758 · v4 · pith:P3DJZJ4Cnew · submitted 2024-06-02 · 📡 eess.IV · cs.CV· cs.MM

Once-for-All: Controllable Generative Image Compression with Dynamic Granularity Adaptation

Anqi Li , Feng Li , Yuxi Liu , Runmin Cong , Yao Zhao , Huihui Bai This is my paper

classification 📡 eess.IV cs.CVcs.MM

keywords compressionimagecontrol-gicadaptioncontrollablegenerativebitratecapable

0 comments

read the original abstract

Although recent generative image compression methods have demonstrated impressive potential in optimizing the rate-distortion-perception trade-off, they still face the critical challenge of flexible rate adaption to diverse compression necessities and scenarios. To overcome this challenge, this paper proposes a Controllable Generative Image Compression framework, termed Control-GIC, the first capable of fine-grained bitrate adaption across a broad spectrum while ensuring high-fidelity and generality compression. Control-GIC is grounded in a VQGAN framework that encodes an image as a sequence of variable-length codes (i.e. VQ-indices), which can be losslessly compressed and exhibits a direct positive correlation with the bitrates. Drawing inspiration from the classical coding principle, we correlate the information density of local image patches with their granular representations. Hence, we can flexibly determine a proper allocation of granularity for the patches to achieve dynamic adjustment for VQ-indices, resulting in desirable compression rates. We further develop a probabilistic conditional decoder capable of retrieving historic encoded multi-granularity representations according to transmitted codes, and then reconstruct hierarchical granular features in the formalization of conditional probability, enabling more informative aggregation to improve reconstruction realism. Our experiments show that Control-GIC allows highly flexible and controllable bitrate adaption where the results demonstrate its superior performance over recent state-of-the-art methods. Code is available at https://github.com/lianqi1008/Control-GIC.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

MoECodec: Image Compression for joint human and machine perception via Mixture-of-Experts
eess.IV 2026-06 unverdicted novelty 6.0

MoECodec replaces FFN layers with token-wise MoE plus stable routing and GShMLP experts to support multiple downstream tasks in a single image compression model.