Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
📄 Abstract
Abstract: In this work, we propose using a unified representation, termed Factorized
Features, for low-level vision tasks, where we test on Single Image
Super-Resolution (SISR) and \textbf{Image Compression}. Motivated by the shared
principles between these tasks, they require recovering and preserving fine
image details, whether by enhancing resolution for SISR or reconstructing
compressed data for Image Compression. Unlike previous methods that mainly
focus on network architecture, our proposed approach utilizes a
basis-coefficient decomposition as well as an explicit formulation of
frequencies to capture structural components and multi-scale visual features in
images, which addresses the core challenges of both tasks. We replace the
representation of prior models from simple feature maps with Factorized
Features to validate the potential for broad generalizability. In addition, we
further optimize the compression pipeline by leveraging the mergeable-basis
property of our Factorized Features, which consolidates shared structures on
multi-frame compression. Extensive experiments show that our unified
representation delivers state-of-the-art performance, achieving an average
relative improvement of 204.4% in PSNR over the baseline in Super-Resolution
(SR) and 9.35% BD-rate reduction in Image Compression compared to the previous
SOTA. Project page: https://jayisaking.github.io/FIPER/
Authors (5)
Yang-Che Sun
Cheng Yu Yeo
Ernie Chu
Jun-Cheng Chen
Yu-Lun Liu
Submitted
October 23, 2024
Key Contributions
Proposes Factorized Features, a unified representation for low-level vision tasks like Single Image Super-Resolution (SISR) and Image Compression. This representation uses basis-coefficient decomposition and explicit frequency formulation to capture structural and multi-scale features, addressing core challenges in both tasks and demonstrating broad generalizability.
Business Value
Enables higher quality image reconstruction for both upscaling low-resolution images and compressing images for efficient storage and transmission. This has broad applications in media, communication, and imaging devices.