arxiv_cv 94% Match Research Paper AI Safety Researchers,Machine Learning Engineers,NLP Researchers,Computer Vision Researchers 2 weeks ago

Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling

ai-safety › robustness

📄 Abstract

Abstract: Vision-Language Models (VLMs) excel at visual understanding but often suffer from visual hallucinations, where they generate descriptions of nonexistent objects, actions, or concepts, posing significant risks in safety-critical applications. Existing hallucination mitigation methods typically follow one of two paradigms: generation adjustment, which modifies decoding behavior to align text with visual inputs, and post-hoc verification, where external models assess and correct outputs. While effective, generation adjustment methods often rely on heuristics and lack correction mechanisms, while post-hoc verification is complicated, typically requiring multiple models and tending to reject outputs rather than refine them. In this work, we introduce REVERSE, a unified framework that integrates hallucination-aware training with on-the-fly self-verification. By leveraging a new hallucination-verification dataset containing over 1.3M semi-synthetic samples, along with a novel inference-time retrospective resampling technique, our approach enables VLMs to both detect hallucinations during generation and dynamically revise those hallucinations. Our evaluations show that REVERSE achieves state-of-the-art hallucination reduction, outperforming the best existing methods by up to 12% on CHAIR-MSCOCO and 34% on HaloQuest. Our dataset, model, and code are available at: https://reverse-vlm.github.io.

Authors (6)

Tsung-Han Wu

Heekyung Lee

Jiaxin Ge

Joseph E. Gonzalez

Trevor Darrell

David M. Chan

Submitted

April 17, 2025

arXiv Category

cs.CV

arXiv PDF

Key Contributions

Introduces REVERSE, a unified framework for reducing hallucinations in VLMs by integrating hallucination-aware training with on-the-fly self-verification. It leverages a large hallucination-verification dataset and a novel inference approach, offering a more effective and less complex alternative to existing generation adjustment and post-hoc verification methods.

Business Value

Increases the trustworthiness and reliability of AI-generated content, making VLMs safer for use in sensitive applications like medical reporting, legal documentation, or autonomous systems.

Paper Metadata

Innovation Type

Unified Framework/Methodology

Deployment Feasibility

Moderate, requires integration into VLM training and inference pipelines.

Limitations Addressed

Visual hallucinations in VLMs, which pose risks in safety-critical applications, and the limitations of existing mitigation methods (generation adjustment and post-hoc verification).

Performance Gains

Significantly reduces hallucination rates in VLM outputs while maintaining descriptive accuracy.

Technical Tags

vision-language modelshallucinationtext generationverificationretrospective resamplingsafety-critical applicationsself-verificationgenerative AI

Research Topics

AI SafetyNatural Language ProcessingComputer VisionMachine LearningGenerative AI

Methods & Architectures

REVERSE frameworkHallucination-aware trainingOn-the-fly self-verificationRetrospective resamplingHallucination-verification dataset Vision-Language Models (VLMs)

Applications & Tasks

Safety-Critical AI Content Generation Information Verification Reducing visual hallucinations in VLMsImproving reliability of generated textBridging generation and verification Generating accurate image descriptionsVerifying generated contentMitigating AI hallucinations

Datasets & Benchmarks

Datasets

Hallucination-verification dataset (1.3M samples)

Hallucination rate reductionAccuracy of verification

Related Fields

Natural Language GenerationAI EthicsTrustworthy AIDeep Learning Evaluation

Keywords

hallucinationvision-language modelsVLMAI safetyrobustnessverificationtext generationretrospective resamplinggenerative AINLPcomputer visiontrustworthy AI

Academic Context

#AI Safety#Natural Language Processing#Computer Vision#Machine Learning#Generative AI

Commercial Potential

Potential Products

Reliable AI content generation toolsFact-checking modules for AI outputs

Target Industries

TechnologyMediaHealthcareFinance

Use Case Examples

Generating accurate medical reports from imagesEnsuring AI-generated news summaries are factually correctDeveloping safer AI assistants for critical tasks

Competitive Edge

Offers a unified and more effective approach to hallucination mitigation compared to existing fragmented methods.

Market Opportunity

Increasing demand for reliable and safe AI systems.

Revenue Models

Licensing of the REVERSE frameworkintegration into AI platforms

Resource Requirements

Compute Needs

High (for training and inference)

Data Requirements

Large-scale image-text datasets, specialized hallucination-verification dataset.

Deployment Constraints

Requires careful integration to avoid impacting generation speed or quality negatively.

Scalability

Scales with the size of the VLM and the complexity of the verification process.

Production Readiness

Maturity Level

Research

Time to Market

1-2 years

Patent Potential

Medium

View Full Paper Back to Papers