Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
Introduces FreeFuse, a novel training-free method for fusing multiple subject LoRAs in text-to-image generation. It automatically derives subject masks from cross-attention weights at inference time, enabling efficient and practical multi-subject generation without additional training or model modifications.
Democratizes advanced image generation capabilities by making it easier and more efficient for users to combine multiple personalized subjects into a single image, fostering creativity in digital art and design.