We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Estonian Speech Recognition and Transcription Editing Service.
- Authors
OLEV, Aivo; ALUMAE, Tanel
- Abstract
This paper describes the latest iteration of our Estonian speech recognition system and the publicly available transcription editing service. The system is now based on an end-to-end wav2vec2.0 model. It achieves a word error rate of 6.9% on a test set of broadcast conversations. Besides recognition it performs speaker diarization, speaker identification, Estonian language detection, and punctuation restoration. The service consists of a speech processing pipeline, web server and a web-based user interface for end-users, offering transcript editing and speaker annotation functionality. The core components of the service have been made open-source and deployed internally by multiple public and private institutions.
- Subjects
AUTOMATIC speech recognition; SPEECH perception; TRANSCRIPTION; WEB-based user interfaces; ESTONIAN language; EDITING
- Publication
Baltic Journal of Modern Computing, 2022, Vol 10, Issue 3, p409
- ISSN
2255-8942
- Publication type
Article
- DOI
10.22364/bjmc.2022.10.3.14