Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
SheafAlign is a novel sheaf-theoretic framework for decentralized multimodal alignment that replaces single-space alignment with multiple comparison spaces. It models pairwise modality relations using sheaf structures and employs decentralized contrastive learning. This approach overcomes the limitation of requiring mutual redundancy across all modalities, preserves both shared and unique information, and achieves superior zero-shot generalization and robustness to missing modalities with significantly lower communication costs.
Enables more efficient and robust multimodal data fusion in decentralized systems, such as sensor networks or collaborative robotics. This is crucial for applications where data is distributed and modalities may be incomplete or unreliable.