ext to Speech synthesizer for Dzongkha

Achyut Nepal; Cheni Zangmo; Nidup Wangmo; Sangay Choden; Yeshi Wangchuk; Kamal Kr. Chapagai

doi:10.17102/zmV803

Authors

Achyut Nepal Information Technology Department, College of Science and Technology, Royal University of Bhutan Author
Cheni Zangmo Information Technology Department, College of Science and Technology, Royal University of Bhutan Author
Nidup Wangmo Information Technology Department, College of Science and Technology, Royal University of Bhutan Author
Sangay Choden Information Technology Department, College of Science and Technology, Royal University of Bhutan Author
Yeshi Wangchuk Information Technology Department, College of Science and Technology, Royal University of Bhutan Author
Kamal Kr. Chapagai Information Technology Department, College of Science and Technology, Royal University of Bhutan Author

DOI:

https://doi.org/10.17102/zmV803

Keywords:

NLP, TTS, Synthesizer, HMM, Phoneme

Abstract

A high-quality speech synthesizer should be intelligent and produce natural speech. The quality of speech generated by a text-to-speech synthesizer also depends on the amount of data used for training. This paper presents the development of Dzongkha TTS (Text-to-Speech) system using open-source toolkit Hidden Markov Model Toolkit (HTK) and proposes a method to increase the speech database for quality output. Every word in a language can be broken down into several phonemes, which are combinations of phonemes that generate words. Therefore, we suggest developing a corpus through phoneme concatenation, which can increase the database for training TTS systems.

ext to Speech synthesizer for Dzongkha

Authors

DOI:

Keywords:

Abstract

Downloads

Published

Issue

Section

How to Cite

Similar Articles

Most read articles by the same author(s)

Make a Submission

Visiting Counter Flag

ISSN Number

Latest publications

Information