ext to Speech synthesizer for Dzongkha

Authors

  • Achyut Nepal Information Technology Department, College of Science and Technology, Royal University of Bhutan Author
  • Cheni Zangmo Information Technology Department, College of Science and Technology, Royal University of Bhutan Author
  • Nidup Wangmo Information Technology Department, College of Science and Technology, Royal University of Bhutan Author
  • Sangay Choden Information Technology Department, College of Science and Technology, Royal University of Bhutan Author
  • Yeshi Wangchuk Information Technology Department, College of Science and Technology, Royal University of Bhutan Author
  • Kamal Kr. Chapagai Information Technology Department, College of Science and Technology, Royal University of Bhutan Author

DOI:

https://doi.org/10.17102/zmV803

Keywords:

NLP, TTS, Synthesizer, HMM, Phoneme

Abstract

A high-quality speech synthesizer should be intelligent and produce natural speech. The quality of speech generated by a text-to-speech synthesizer also depends on the amount of data used for training. This paper presents the development of Dzongkha TTS (Text-to-Speech) system using open-source toolkit Hidden Markov Model Toolkit (HTK) and proposes a method to increase the speech database for quality output. Every word in a language can be broken down into several phonemes, which are combinations of phonemes that generate words. Therefore, we suggest developing a corpus through phoneme concatenation, which can increase the database for training TTS systems.

Downloads

Published

15-06-2015

How to Cite

Nepal, A., Zangmo, C., Wangmo, N., Choden, S., Wangchuk, Y., & Chapagai, K. K. (2015). ext to Speech synthesizer for Dzongkha. Zorig Melong | A Technical Journal of Science, Engineering and Technology, 2(1), 122-125. https://doi.org/10.17102/zmV803