EGIC: Enhanced Low-Bit-Rate Generative Image Compression Guided by Semantic Segmentation

1 Technical University of Munich, Germany
2 University of Applied Sciences Landshut, Germany
3 TU Wien, Austria
ECCV 2024
MY ALT TEXT

Distortion-perception comparison (top left is best).

Abstract

We introduce EGIC, an enhanced generative image compression method that allows traversing the distortion-perception curve efficiently from a single model. EGIC is based on two novel building blocks: i) OASIS-C, a conditional pre-trained semantic segmentation-guided discriminator, which provides both spatially and semantically-aware gradient feedback to the generator, conditioned on the latent image distribution, and ii) Output Residual Prediction (ORP), a retrofit solution for multi-realism image compression that allows control over the synthesis process by adjusting the impact of the residual between an MSE-optimized and GAN-optimized decoder output on the GAN-based reconstruction. Together, EGIC forms a powerful codec, outperforming state-of-the-art diffusion and GAN-based methods (e.g., HiFiC, MS-ILLM, and DIRAC-100), while performing almost on par with VTM-20.0 on the distortion end. EGIC is simple to implement, very lightweight, and provides excellent interpolation characteristics, which makes it a promising candidate for practical applications targeting the low bit range.

Interactive Demo - EGIC vs JPEG

Zoom-in for more details

0.159bpp vs 0.260bpp (1.64x)

Compare EGIC to other methods

BibTeX

@article{körber2024egic,
  title={EGIC: Enhanced Low-Bit-Rate Generative Image Compression Guided by Semantic Segmentation},
  author={Nikolai Körber and Eduard Kromer and Andreas Siebert and Sascha Hauke and Daniel Mueller-Gritschneder and Björn Schuller},
  journal={arXiv preprint arXiv:2309.03244},
  year={2024}
}