IIIT Hyderabad Publications |
|||||||||
|
Building Sleek Synthesizer for Multi-lingual Screen ReaderAuthors: E.Veera Raghavendra, B Yegnanarayana,Alan W Black,Kishore Prahallad Conference: in Proceedings of Interspeech, Brisbane, Australia Date: 2008-09-29 Report no: IIIT/TR/2008/156 AbstractIn this paper, we are investigating the unit size: syllable,half-phone and quarter-phone to be used for speech synthesis in multi-lingual screen reader in phonetic languages such as Telugu and non-phonetic language English. Perceptual studies show that syllable-level unit performs better for Telugu and half-phone units perform better for English. While syllable based synthesizers produce better sounding speech, the cover-age of all syllables is a non-trivial issue. We address the issue of coverage of syllables through approximate matching of syllable and show that such approximation produces intelligible and better quality speech than diphone units. In this paper, we also propose a hybrid synthesizer within the framework of unit selection and also show that the hybrid synthesizer built from pruned database performs as well as hybrid synthesizer built from unpruned database. Full paper: pdf Centre for Language Technologies Research Centre |
||||||||
Copyright © 2009 - IIIT Hyderabad. All Rights Reserved. |