Search
Now showing items 11-18 of 18
Effects of application type on the choice of interaction modality in IVR systems
(Unisa Press, 2012)
This paper addresses the feasibility of using the telephone as a tool for information access in the technology challenged and illiterate communities of Southern Africa. We did two case studies of disparate Interactive Voice ...
Improved transition models for cepstral trajectories
(Pattern recognition association of South Africa (PRASA), 2012)
We improve on a piece-wise linear model of the trajectories of Mel Frequency Cepstral Coefficients, which are commonly used as features in Automatic Speech Recognition. For this purpose, we have created a very clean ...
Medium-vocabulary speech recognition for under-resourced languages
(SLTU, 2012)
We report on the development of speech-recognition systems that are able to perform accurate recognition on mediumvocabulary tasks (i.e. tasks that require distinctions between approximately 200 different terms). We are ...
Validating smartphone-collected speech corpora
(SLTU, 2012)
We investigate the effectiveness with which the accuracy of a prompted speech corpus can be validated when minimal additional speech resources are available, and specifically when a language model in the target language ...
Spoken language identification system adaptation in under-resourced environments
(Pattern recognition association of South Africa (PRASA), 2013)
Speech technologies have matured over the past few decades and have made significant impacts in a variety of fields, from assistive technologies to personal assistants. However, speech system development is a resource ...
Kernel bandwidth estimation for non-parametric density estimation: a comparative study
(Pattern recognition association of South Africa (PRASA), 2013)
We investigate the performance of conventional bandwidth estimators for non- parametric kernel density estimation on a number of representative pattern-recognition tasks, to gain a better understanding of the behaviour of ...
Cross-bandwidth adaptation for ASR systems
(Pattern recognition association of South Africa (PRASA), 2013)
Mismatches between application and training data greatly reduce the performance of automatic speech recognition (ASR) systems. However, collecting suitable amounts of in-domain and application-specific data for training ...
Tone realisation in a Yorùbá speech recognition corpus
(SLTU, 2012)
We investigate the acoustic realisation of tone in short continuous utterances in Yorùbá. Fundamental frequency (F0) contours are extracted for automatically aligned syllables from a speech corpus of 33 speakers collected ...