Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
This paper proposes a novel stress-aware speech-to-speech translation (S2ST) system that preserves word-level emphasis by leveraging LLMs for cross-lingual emphasis conversion. It introduces a data generation pipeline and 'LLM-as-Judge' evaluation to overcome data scarcity, achieving superior emphasis preservation.
Enhances the expressiveness and emotional nuance of translated speech, improving cross-cultural communication and user experience in applications like virtual assistants and international calls.