arxiv_cl 95% Match Research Paper AI researchers in generative models,AI ethicists,Developers of video generation systems,Content creators,Policy makers 2 weeks ago

From Preferences to Prejudice: The Role of Alignment Tuning in Shaping Social Bias in Video Diffusion Models

computer-vision › diffusion-models

📄 Abstract

Abstract: Recent advances in video diffusion models have significantly enhanced text-to-video generation, particularly through alignment tuning using reward models trained on human preferences. While these methods improve visual quality, they can unintentionally encode and amplify social biases. To systematically trace how such biases evolve throughout the alignment pipeline, we introduce VideoBiasEval, a comprehensive diagnostic framework for evaluating social representation in video generation. Grounded in established social bias taxonomies, VideoBiasEval employs an event-based prompting strategy to disentangle semantic content (actions and contexts) from actor attributes (gender and ethnicity). It further introduces multi-granular metrics to evaluate (1) overall ethnicity bias, (2) gender bias conditioned on ethnicity, (3) distributional shifts in social attributes across model variants, and (4) the temporal persistence of bias within videos. Using this framework, we conduct the first end-to-end analysis connecting biases in human preference datasets, their amplification in reward models, and their propagation through alignment-tuned video diffusion models. Our results reveal that alignment tuning not only strengthens representational biases but also makes them temporally stable, producing smoother yet more stereotyped portrayals. These findings highlight the need for bias-aware evaluation and mitigation throughout the alignment process to ensure fair and socially responsible video generation.

Authors (9)

Zefan Cai

Haoyi Qiu

Haozhe Zhao

Ke Wan

Jiachen Li

Jiuxiang Gu

+3 more

Submitted

October 20, 2025

arXiv Category

cs.CL

arXiv PDF

Key Contributions

This paper introduces VideoBiasEval, a diagnostic framework for evaluating social bias in video diffusion models, particularly after alignment tuning. It uses event-based prompting and multi-granular metrics to systematically analyze how biases related to gender and ethnicity are encoded and amplified, highlighting the unintended consequences of optimizing for human preferences.

Business Value

Helps developers create more ethical and inclusive AI-generated content, reducing the risk of perpetuating harmful stereotypes. This is crucial for brand reputation and responsible AI deployment in media and entertainment.

Paper Metadata

Innovation Type

Evaluation Framework and Diagnostic Tool

Deployment Feasibility

High for evaluation and research, moderate for direct model improvement without further algorithmic development.

Limitations Addressed

Unintentional encoding and amplification of social biases in video generation models,Lack of systematic methods to evaluate bias in text-to-video models,Difficulty in disentangling semantic content from actor attributes

Technical Tags

Video Diffusion ModelsAlignment TuningSocial BiasHuman PreferencesRepresentation LearningBias EvaluationText-to-Video GenerationVideoBiasEvalEvent-based PromptingMulti-granular Metrics

Research Topics

AI EthicsFairness in AIGenerative ModelsVideo GenerationModel Alignment

Methods & Architectures

Alignment tuningReward modelsVideoBiasEval frameworkEvent-based promptingMulti-granular bias metrics Video Diffusion Models

Applications & Tasks

Content creation Media generation Virtual reality Gaming AI ethics research Encoding and amplification of social biases in video generationUnintended consequences of alignment tuningLack of systematic evaluation for social bias in video models Evaluating social representation in video generationTracing bias evolution through the alignment pipelineDeveloping fairer video diffusion models

Datasets & Benchmarks

Benchmarks

VideoBiasEval

Overall ethnicity biasGender bias conditioned on ethnicityDistributional shifts in social attributesTemporal persistence of bias

Related Fields

AI EthicsFairness in AIGenerative ModelsComputer VisionNatural Language Processing

Keywords

Video Diffusion ModelsSocial BiasAlignment TuningText-to-VideoGenerative AIFairnessEthicsEvaluationRepresentationHuman PreferencesGender BiasEthnicity BiasVideo Generation

Academic Context

#AI Ethics#Fairness in AI#Generative Models#Video Generation#Model Alignment

Technology Stack

Frameworks & Libraries

VideoBiasEval

Commercial Potential

Potential Products

Bias detection tools for generative modelsFairness-aware alignment tuning methodsEthical AI content generation platforms

Target Industries

Media and EntertainmentAdvertisingTechnologyGaming

Use Case Examples

Ensuring AI-generated characters in movies reflect diverse demographicsPreventing AI from creating stereotypical representations in advertisementsDeveloping ethical guidelines for AI content generation

Competitive Edge

Provides a novel, systematic framework (VideoBiasEval) specifically designed to diagnose social biases in video diffusion models, addressing a critical gap in current evaluation methodologies.

Market Opportunity

Growing market for ethical AI solutions and bias auditing tools.

Revenue Models

Licensing of bias detection toolsconsulting services for AI fairness.

Resource Requirements

Compute Needs

High for training and evaluating large video diffusion models.

Data Requirements

Requires diverse datasets for training video diffusion models and specific prompting strategies for bias evaluation.

Deployment Constraints

Potential for bias to be deeply ingrained in models, requiring significant effort to mitigate.

Scalability

Scalability of diffusion models is a general challenge; bias evaluation framework aims to be efficient.

Regulatory Considerations

Increasing regulatory focus on AI fairness and bias mitigation.

Production Readiness

Maturity Level

Research

Time to Market

2-3 years for bias mitigation techniques to be widely adopted.

Patent Potential

Low for the evaluation framework itself, but potential for patents on bias mitigation techniques or fairer model architectures.

View Full Paper Back to Papers