LoC-LIC: Low Complexity Learned Image Coding Using Hierarchical Feature Transforms

1 Friedrich-Alexander University Erlangen-Nürnberg, Germany 2 Fraunhofer Institute for Integrated Circuits IIS, Erlangen, Germany

Abstract

Learned image compression has demonstrated superior rate-distortion performance compared to traditional standards for over the past years. However, despite these advancements, legacy formats like JPEG remain dominant. The primary barrier to adoption is not quality, but computational complexity. Real-time applications demand low decoding latencies for smooth user experiences, whereas current learned models often require significantly longer processing times, rendering them impractical for standard hardware. We identify that the majority of this computational burden lies in the initial high-resolution convolutional layers that transform pixels into feature maps. To address this, we propose an innovative hierarchical feature extraction transform. By utilizing fewer channels for high spatial resolution inputs and increasing channel depth only as spatial dimensions are reduced in the latent space, we significantly cut computational load without sacrificing bit rate reduction efficiency. This strategy reduces forward pass complexity from 1256 kMAC/Pixel to just 270 kMAC/Pixel. This architectural shift offers an immediate solution for deploying efficient learned compression on existing devices without relying on future hardware acceleration.

Overview

Our novel approach utilizes hierarchical feature extraction transforms to map images from the pixel domain to the latent domain and vice versa, reducing both memory and computational complexity. The key features of our approach include:

  • Low complexity autoencoder through our novel hierarchical feature extraction, which has progressively deeper feature representations with a lower number of feature maps for larger sizes and higher features for smaller sizes, allowing reduction forward pass complexity from 1256 kMAC/Pixel to only 270 kMAC/Pixel.
  • Hyper-autoencoder with multi-reference entropy model maintaining competitive performance to the state-of-the-art models.
  • A large dataset that spans the large part of the image space manifold.

Method Diagram
Overall view of the proposed architecture with hierarchical feature encoder and decoder.

Results

Our proposed method demonstrates significant improvements in both computational efficiency. The following figures illustrate the performance of our model compared to state-of-the-art methods.

The compression efficiency vs complexity of different learned image compression models.
The compression efficiency vs complexity of different learned image compression models.

BibTeX


      @article{ameen2026loclic,
        title={LoC-LIC: Low Complexity Learned Image Coding Using Hierarchical Feature Transforms},
        author={Ameen, Ayman A and Richter, Thomas and Kaup, Andr{\'e}},
        year={2026}
  }