Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
📄 Abstract
Abstract: While recent years have seen remarkable progress in music generation models,
research on their biases across countries, languages, cultures, and musical
genres remains underexplored. This gap is compounded by the lack of datasets
and benchmarks that capture the global diversity of music. To address these
challenges, we introduce GlobalDISCO, a large-scale dataset consisting of 73k
music tracks generated by state-of-the-art commercial generative music models,
along with paired links to 93k reference tracks in LAION-DISCO-12M. The dataset
spans 147 languages and includes musical style prompts extracted from
MusicBrainz and Wikipedia. The dataset is globally balanced, representing
musical styles from artists across 79 countries and five continents. Our
evaluation reveals large disparities in music quality and alignment with
reference music between high-resource and low-resource regions. Furthermore, we
find marked differences in model performance between mainstream and
geographically niche genres, including cases where models generate music for
regional genres that more closely align with the distribution of mainstream
styles.
Key Contributions
This paper introduces GlobalDISCO, a large-scale, globally balanced dataset of AI-generated music spanning 147 languages and 79 countries. It reveals significant disparities in music quality and alignment between high-resource and low-resource regions, highlighting biases in current generative music models.
Business Value
Promotes the development of more equitable and culturally sensitive AI music generation tools, potentially opening new markets and creative possibilities globally. Helps companies avoid reputational damage from biased AI.