We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
$$\hbox {F}_{0}$$ contour generation and synthesis using Bengali Hmm-based speech synthesis system.
- Authors
Mukherjee, Sankar; Mandal, Shyamal
- Abstract
HMM based Bengali speech synthesis system (Bengali-HTS) generates highly intelligible synthesized speech but its naturalness is not adequate even though it is trained with a very good amount of speech corpus. In case of interrogative, imperative and exclamatory sentences, naturalness of the synthesized speech falls drastically. This paper proposes a method to overcome this problem by modifying the $$\hbox {F}_{0}$$ contour of synthetic speech based on Fujisaki model. The Fujisaki model features for different types of Bengali sentences are analyzed for the generation of $$\hbox {F}_{0}$$ contour. These features depend on prosodic word/phrase boundary of the sentence. So a two layer supervised classification and regression tree is trained to predict the prosodic word/phrase boundary. Fujisaki model then generates $$\hbox {F}_{0}$$ contour from input text using the prosodic word/phrase boundary and segmental duration information from HMM-based speech synthesis system. Moreover, for HMM training purpose, prosodic structure of sentence has been employed rather than lexical structure. From MOS and preference test it is found that proposed method significantly improved the overall quality of synthesized speech than that of Bengali-HTS.
- Subjects
SPEECH synthesis; EXCLAMATIONS (Grammar); BENGALI language; REGRESSION trees; LEXICAL access; GRAMMAR
- Publication
International Journal of Speech Technology, 2015, Vol 18, Issue 1, p25
- ISSN
1381-2416
- Publication type
Article
- DOI
10.1007/s10772-014-9247-3