Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_cv 95% Match Research Paper Cybersecurity Professionals,Media Forensics Experts,AI Researchers,Content Moderation Teams 2 days ago

Referee: Reference-aware Audiovisual Deepfake Detection

ai-safety › robustness
📄 Abstract

Abstract: Since deepfakes generated by advanced generative models have rapidly posed serious threats, existing audiovisual deepfake detection approaches struggle to generalize to unseen forgeries. We propose a novel reference-aware audiovisual deepfake detection method, called Referee. Speaker-specific cues from only one-shot examples are leveraged to detect manipulations beyond spatiotemporal artifacts. By matching and aligning identity-related queries from reference and target content into cross-modal features, Referee jointly reasons about audiovisual synchrony and identity consistency. Extensive experiments on FakeAVCeleb, FaceForensics++, and KoDF demonstrate that Referee achieves state-of-the-art performance on cross-dataset and cross-language evaluation protocols. Experimental results highlight the importance of cross-modal identity verification for future deepfake detection. The code is available at https://github.com/ewha-mmai/referee.
Authors (3)
Hyemin Boo
Eunsang Lee
Jiyoung Lee
Submitted
October 31, 2025
arXiv Category
cs.CV
arXiv PDF Code

Key Contributions

Referee is a novel reference-aware audiovisual deepfake detection method that leverages speaker-specific cues from one-shot examples. By matching identity-related queries across modalities, it jointly reasons about audiovisual synchrony and identity consistency, achieving state-of-the-art performance on cross-dataset and cross-language evaluations.

Business Value

Enhances the reliability of digital media by providing advanced tools to detect sophisticated audiovisual deepfakes, crucial for combating misinformation and ensuring secure communication.

View Code on GitHub