Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
📄 Abstract
Abstract: Speech Emotion Recognition (SER) is a key affective computing technology that
enables emotionally intelligent artificial intelligence. While SER is
challenging in general, it is particularly difficult for low-resource languages
such as Urdu. This study investigates Urdu SER in a cross-corpus setting, an
area that has remained largely unexplored. We employ a cross-corpus evaluation
framework across three different Urdu emotional speech datasets to test model
generalization. Two standard domain-knowledge based acoustic feature sets,
eGeMAPS and ComParE, are used to represent speech signals as feature vectors
which are then passed to Logistic Regression and Multilayer Perceptron
classifiers. Classification performance is assessed using unweighted average
recall (UAR) whilst considering class-label imbalance. Results show that
Self-corpus validation often overestimates performance, with UAR exceeding
cross-corpus evaluation by up to 13%, underscoring that cross-corpus evaluation
offers a more realistic measure of model robustness. Overall, this work
emphasizes the importance of cross-corpus validation for Urdu SER and its
implications contribute to advancing affective computing research for
underrepresented language communities.
Authors (4)
Unzela Talpur
Zafi Sherhan Syed
Muhammad Shehram Shah Syed
Abbas Shah Syed
Submitted
October 28, 2025
Key Contributions
This paper investigates the challenging area of cross-corpus validation for Speech Emotion Recognition (SER) in Urdu, a low-resource language. It highlights that self-corpus validation can significantly overestimate performance, with UAR exceeding cross-corpus evaluation by up to 13%, underscoring the importance of robust cross-corpus evaluation for reliable SER models.
Business Value
Enables the development of more reliable emotionally intelligent AI systems for Urdu-speaking populations, improving applications in areas like mental health support and personalized user experiences.