Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_cv 80% Match Research Paper Researchers in computer vision and image processing,Developers of image editing software,Engineers working on image compression standards 3 days ago

FIPER: Factorized Features for Robust Image Super-Resolution and Compression

computer-vision › scene-understanding
📄 Abstract

Abstract: In this work, we propose using a unified representation, termed Factorized Features, for low-level vision tasks, where we test on Single Image Super-Resolution (SISR) and \textbf{Image Compression}. Motivated by the shared principles between these tasks, they require recovering and preserving fine image details, whether by enhancing resolution for SISR or reconstructing compressed data for Image Compression. Unlike previous methods that mainly focus on network architecture, our proposed approach utilizes a basis-coefficient decomposition as well as an explicit formulation of frequencies to capture structural components and multi-scale visual features in images, which addresses the core challenges of both tasks. We replace the representation of prior models from simple feature maps with Factorized Features to validate the potential for broad generalizability. In addition, we further optimize the compression pipeline by leveraging the mergeable-basis property of our Factorized Features, which consolidates shared structures on multi-frame compression. Extensive experiments show that our unified representation delivers state-of-the-art performance, achieving an average relative improvement of 204.4% in PSNR over the baseline in Super-Resolution (SR) and 9.35% BD-rate reduction in Image Compression compared to the previous SOTA. Project page: https://jayisaking.github.io/FIPER/
Authors (5)
Yang-Che Sun
Cheng Yu Yeo
Ernie Chu
Jun-Cheng Chen
Yu-Lun Liu
Submitted
October 23, 2024
arXiv Category
eess.IV
arXiv PDF

Key Contributions

Proposes Factorized Features, a unified representation for low-level vision tasks like Single Image Super-Resolution (SISR) and Image Compression. This representation uses basis-coefficient decomposition and explicit frequency formulation to capture structural and multi-scale features, addressing core challenges in both tasks and demonstrating broad generalizability.

Business Value

Enables higher quality image reconstruction for both upscaling low-resolution images and compressing images for efficient storage and transmission. This has broad applications in media, communication, and imaging devices.