We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Investigation of Input Alphabets of End-to-End Lithuanian Text-to-Speech Synthesizer.
- Authors
KASPARAITIS, Pijus; ANTANAVIČIUS, Danielius
- Abstract
The present paper deals with choosing the input alphabet for the end-to-end synthesizer of the Lithuanian language. Tacotron 2 is a state-of-the-art end-to-end speech synthesis model. Characters, phonemes or their combinations can be used as an input of the model. The model was trained on Lithuanian speech recordings using the following five input alphabets: letters, lowercase letters, accented letters, reduced set of accented letters, letters with separate accent marks. Acceptability of the synthesized speech was evaluated on the basis of human listeners' subjective judgment. Experimental testing showed that accent marks significantly improved the quality of the synthesized speech. Reducing the size of the input alphabet also has a slight positive impact. Putting accent marks into the text produced the best results as compared to using the accented letters.
- Subjects
SPEECH synthesis; AUTOMATIC speech recognition; LITHUANIAN language; NATURAL language processing; SPEECH; LITHUANIANS; JUDGMENT (Psychology)
- Publication
Baltic Journal of Modern Computing, 2023, Vol 11, Issue 2, p285
- ISSN
2255-8942
- Publication type
Article
- DOI
10.22364/bjmc.2023.11.2.05