Part-of-speech effects on text-to-speech synthesis

Schlunz, Georg I.; Barnard, Etienne; van Huyssteen, Gerhard B.

dc.contributor.author	Schlunz, Georg I.
dc.contributor.author	Barnard, Etienne
dc.contributor.author	van Huyssteen, Gerhard B.
dc.date.accessioned	2018-03-07T10:27:13Z
dc.date.available	2018-03-07T10:27:13Z
dc.date.issued	2010
dc.identifier.citation	Georg Schlünz, Etienne Barnard and Gerhard van Huyssteen, “Part-of-speech effects on text-to-speech synthesis”, in Proc. Annual Symp. Pattern Recognition Association of South Africa (PRASA), pp 257-262, Stellenbosch, South Africa, 2010. [http://engineering.nwu.ac.za/multilingual-speech-technologies-must/publications]	en_US
dc.identifier.uri	https://researchspace.csir.co.za/dspace/bitstream/handle/10204/4674/Schlunz_2010.pdf?sequence=1&isAllowed=y
dc.identifier.uri	http://hdl.handle.net/10394/26555
dc.description.abstract	One of the goals of text-to-speech (TTS) systems is to produce natural-sounding synthesized speech. Towards this end various natural language processing (NLP) tasks are performed to model the prosodic aspects of the TTS voice. One of the fundamental NLP tasks being used is the part-of-speech (POS) tagging of the words in the text. This paper investigates the effects of POS information on the naturalness of a hidden Markov model (HMM) based TTS voice when additional resources are not available to aid in the modeling of prosody. It is found that, when a minimal feature set is used for the HMM context labels, the addition of POS tags does improve the naturalness of the voice. However, the same effect can be accomplished by including segmental counting and positional information instead of the POS tags.	en_US
dc.description.sponsorship	Human Language Technology Competency Area, CSIR, Meraka Institute, Pretoria, South Africa Multilingual Speech Technologies, North-West University, Vanderbijlpark, South Africa Centre for Text Technology, North-West University, Potchefstroom, South Africa	en_US
dc.language.iso	en	en_US
dc.publisher	Pattern Recognition Association of South Africa and Mechatronics International Conference	en_US
dc.subject	Speech effects	en_US
dc.subject	Text-to-speech	en_US
dc.subject	Natural Language Processing— Speech recognition and synthesis	en_US
dc.title	Part-of-speech effects on text-to-speech synthesis	en_US
dc.type	Presentation	en_US