Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
📄 Abstract
Abstract: Artistic styles are defined by both their structural and appearance elements.
Existing neural stylization techniques primarily focus on transferring
appearance-level features such as color and texture, often neglecting the
equally crucial aspect of structural stylization. To address this gap, we
introduce \textbf{DiffArtist}, the first 2D stylization method to offer
fine-grained, simultaneous control over both structure and appearance style
strength. This dual controllability is achieved by representing structure and
appearance generation as separate diffusion processes, necessitating no further
tuning or additional adapters. To properly evaluate this new capability of dual
stylization, we further propose a Multimodal LLM-based stylization evaluator
that aligns significantly better with human preferences than existing metrics.
Extensive analysis shows that DiffArtist achieves superior style fidelity and
dual-controllability compared to state-of-the-art methods. Its text-driven,
training-free design and unprecedented dual controllability make it a powerful
and interactive tool for various creative applications. Project homepage:
https://diffusionartist.github.io.