dc.contributor.author | Schlunz, Georg I. | |
dc.contributor.author | Barnard, Etienne | |
dc.contributor.author | van Huyssteen, Gerhard B. | |
dc.date.accessioned | 2018-03-07T10:27:13Z | |
dc.date.available | 2018-03-07T10:27:13Z | |
dc.date.issued | 2010 | |
dc.identifier.citation | Georg Schlünz, Etienne Barnard and Gerhard van Huyssteen, “Part-of-speech effects on text-to-speech synthesis”, in Proc. Annual Symp. Pattern Recognition Association of South Africa (PRASA), pp 257-262, Stellenbosch, South Africa, 2010. [http://engineering.nwu.ac.za/multilingual-speech-technologies-must/publications] | en_US |
dc.identifier.uri | https://researchspace.csir.co.za/dspace/bitstream/handle/10204/4674/Schlunz_2010.pdf?sequence=1&isAllowed=y | |
dc.identifier.uri | http://hdl.handle.net/10394/26555 | |
dc.description.abstract | One of the goals of text-to-speech (TTS) systems is to produce natural-sounding synthesized speech. Towards this end various natural language processing (NLP) tasks are performed to model the prosodic aspects of the TTS voice. One of the fundamental NLP tasks being used is the part-of-speech (POS) tagging of the words in the text. This paper investigates the effects of POS information on the naturalness of a hidden Markov model (HMM) based TTS voice when additional resources are not available to aid in the modeling of prosody. It is found that, when a minimal feature set is used for the HMM context labels, the addition of POS tags does improve the naturalness of the voice. However, the same effect can be accomplished by including segmental counting and positional information instead of the POS tags. | en_US |
dc.description.sponsorship | Human Language Technology Competency Area, CSIR, Meraka Institute, Pretoria, South Africa
Multilingual Speech Technologies, North-West University, Vanderbijlpark, South Africa
Centre for Text Technology, North-West University, Potchefstroom, South Africa | en_US |
dc.language.iso | en | en_US |
dc.publisher | Pattern Recognition Association of South Africa and Mechatronics International Conference | en_US |
dc.subject | Speech effects | en_US |
dc.subject | Text-to-speech | en_US |
dc.subject | Natural Language Processing— Speech recognition and synthesis | en_US |
dc.title | Part-of-speech effects on text-to-speech synthesis | en_US |
dc.type | Presentation | en_US |