Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_cv 90% Match Research Paper Computer Vision Researchers,AI Artists,Graphic Designers,Image Processing Engineers 2 days ago

SRAGAN: Saliency Regularized and Attended Generative Adversarial Network for Chinese Ink-wash Painting Style Transfer

generative-ai › gans
📄 Abstract

Abstract: Recent style transfer problems are still largely dominated by Generative Adversarial Network (GAN) from the perspective of cross-domain image-to-image (I2I) translation, where the pivotal issue is to learn and transfer target-domain style patterns onto source-domain content images. This paper handles the problem of translating real pictures into traditional Chinese ink-wash paintings, i.e., Chinese ink-wash painting style transfer. Though a wide range of I2I models tackle this problem, a notable challenge is that the content details of the source image could be easily erased or corrupted due to the transfer of ink-wash style elements. To remedy this issue, we propose to incorporate saliency detection into the unpaired I2I framework to regularize image content, where the detected saliency map is utilized from two aspects: (\romannumeral1) we propose saliency IOU (SIOU) loss to explicitly regularize object content structure by enforcing saliency consistency before and after image stylization; (\romannumeral2) we propose saliency adaptive normalization (SANorm) which implicitly enhances object structure integrity of the generated paintings by dynamically injecting image saliency information into the generator to guide stylization process. Besides, we also propose saliency attended discriminator which harnesses image saliency information to focus generative adversarial attention onto the drawn objects, contributing to generating more vivid and delicate brush strokes and ink-wash textures. Extensive qualitative and quantitative experiments demonstrate superiority of our approach over related advanced image stylization methods in both GAN and diffusion model paradigms.
Authors (2)
Xiang Gao
Yuqi Zhang
Submitted
April 24, 2024
arXiv Category
cs.CV
Pattern Recognition, 2025, 162: 111344
arXiv PDF

Key Contributions

SRAGAN addresses the challenge of content detail corruption during style transfer by incorporating saliency detection into an unpaired I2I framework. It uses Saliency IOU (SIOU) loss to explicitly regularize object content structure, ensuring that the original image's important features are preserved while applying the target style.

Business Value

Enables the creation of high-quality digital art and stylized images, useful for graphic design, advertising, and personalized content generation, particularly for traditional art styles.