Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_cv 95% Match Research Paper AI Artists,Generative AI Developers,Hobbyists,Researchers in Diffusion Models 1 week ago

FreeFuse: Multi-Subject LoRA Fusion via Auto Masking at Test Time

generative-ai › diffusion
📄 Abstract

Abstract: This paper proposes FreeFuse, a novel training-free approach for multi-subject text-to-image generation through automatic fusion of multiple subject LoRAs. In contrast to existing methods that either focus on pre-inference LoRA weight merging or rely on segmentation models and complex techniques like noise blending to isolate LoRA outputs, our key insight is that context-aware dynamic subject masks can be automatically derived from cross-attention layer weights. Mathematical analysis shows that directly applying these masks to LoRA outputs during inference well approximates the case where the subject LoRA is integrated into the diffusion model and used individually for the masked region. FreeFuse demonstrates superior practicality and efficiency as it requires no additional training, no modification to LoRAs, no auxiliary models, and no user-defined prompt templates or region specifications. Alternatively, it only requires users to provide the LoRA activation words for seamless integration into standard workflows. Extensive experiments validate that FreeFuse outperforms existing approaches in both generation quality and usability under the multi-subject generation tasks. The project page is at https://future-item.github.io/FreeFuse/
Authors (3)
Yaoli Liu
Yao-Xiang Ding
Kun Zhou
Submitted
October 27, 2025
arXiv Category
cs.CV
arXiv PDF

Key Contributions

Introduces FreeFuse, a novel training-free method for fusing multiple subject LoRAs in text-to-image generation. It automatically derives subject masks from cross-attention weights at inference time, enabling efficient and practical multi-subject generation without additional training or model modifications.

Business Value

Democratizes advanced image generation capabilities by making it easier and more efficient for users to combine multiple personalized subjects into a single image, fostering creativity in digital art and design.