Search
Now showing items 1-7 of 7
Towards lecture transcription in resource-scarce environments
(Pattern recognition association of South Africa (PRASA), 2012)
We present progress towards automated Lecture Transcription (LT) in resource scarce environments. Our development has focused on the transcription of lectures in Afrikaans from two faculties at North-West University. A ...
A voice service for user feedback on school meals
(ACM, 2012)
Research using voice-based services as a technology platform for providing information access and services within developing world regions has shown much promise. The results for design and deployment of such voice-based ...
Effects of application type on the choice of interaction modality in IVR systems
(Unisa Press, 2012)
This paper addresses the feasibility of using the telephone as a tool for information access in the technology challenged and illiterate communities of Southern Africa. We did two case studies of disparate Interactive Voice ...
Improved transition models for cepstral trajectories
(Pattern recognition association of South Africa (PRASA), 2012)
We improve on a piece-wise linear model of the trajectories of Mel Frequency Cepstral Coefficients, which are commonly used as features in Automatic Speech Recognition. For this purpose, we have created a very clean ...
Medium-vocabulary speech recognition for under-resourced languages
(SLTU, 2012)
We report on the development of speech-recognition systems that are able to perform accurate recognition on mediumvocabulary tasks (i.e. tasks that require distinctions between approximately 200 different terms). We are ...
Validating smartphone-collected speech corpora
(SLTU, 2012)
We investigate the effectiveness with which the accuracy of a prompted speech corpus can be validated when minimal additional speech resources are available, and specifically when a language model in the target language ...
Tone realisation in a Yorùbá speech recognition corpus
(SLTU, 2012)
We investigate the acoustic realisation of tone in short continuous utterances in Yorùbá. Fundamental frequency (F0) contours are extracted for automatically aligned syllables from a speech corpus of 33 speakers collected ...