Prosodic Analysis And Modelling For Malay Emotional Speech Synthesis

Authors

  • Mumtaz B. Mustafa Faculty of Computer Science and Information Technology, University of Malaya
  • Raja N. Ainon Faculty of Computer Science and Information Technology, University of Malaya
  • Roziati Zainuddin Faculty of Computer Science and Information Technology, University of Malaya
  • Zuraidah M. Don Faculty of Language and Linguistics, University of Malaya
  • Gerry Knowles Lingenium Sdn Bhd, Kuala Lumpur
  • Salimah Mokhtar Faculty of Computer Science and Information Technology, University of Malaya

Keywords:

Emotional speech re-synthesis, Prosody conversion, Rule-based approach, MBROLA

Abstract

This paper discusses an emotional prosody generator for a Malay speech synthesis system that can re-synthesize the selected vocal emotion from neutral synthesized speech output and improve the naturalness by adopting rule-based prosody conversion techniques. The role of prosodic features in emotional expression, particularly fundamental frequency and duration, has been widely investigated in several research projects. This project attempts to improve the naturalness of the synthesized emotional Malay speech by establishing an effective mechanism for the re-synthesis of emotion. Such a mechanism is created by analyzing the variation in the F0 contour of continuous emotional Malay speech against a fixed time period. The emotional prosodic generator for Malay developed in the course of this research makes use of principles of parametric prosody manipulation to synthesize four basic emotions, namely happiness, anger, sadness and fear. Subjective evaluation by means of listening tests was conducted to validate the ability of the emotions generator to generate the necessary prosody to synthesize emotional expression. The evaluation results show an overall recognition rate of between 61% and 85%.

Downloads

Download data is not yet available.

Downloads

Published

2010-09-01

How to Cite

B. Mustafa, M., N. Ainon, R., Zainuddin, R., M. Don, Z., Knowles, G., & Mokhtar, S. (2010). Prosodic Analysis And Modelling For Malay Emotional Speech Synthesis. Malaysian Journal of Computer Science, 23(2), 102–110. Retrieved from https://jml.um.edu.my/index.php/MJCS/article/view/6399