Search
Now showing items 1-10 of 61
The Spoken Web Search task at Mediaeval 2012
(Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on, 2013)
In this paper, we describe the “Spoken Web Search” Task, which
was held as part of the 2012 MediaEval benchmark evaluation campaign.
The purpose of this task was to perform audio search with audio
input in four languages, ...
Towards lecture transcription in resource-scarce environments
(Pattern recognition association of South Africa (PRASA), 2012)
We present progress towards automated Lecture Transcription (LT) in resource scarce environments. Our development has focused on the transcription of lectures in Afrikaans from two faculties at North-West University. A ...
Correlation between rapid learnability and user preference in IVR systems for developing regions
(iIST-Africa, 2013)
Access to information and communication is one of the most important needs in any population group. It is generally challenging for people in the developing world to access information because the tools and the technologies ...
Generating fundamental frequency contours for speech synthesis in Yorùbá
(International Speech Communication Association ( ISCA ), 2013)
We present methods for modelling and synthesising fundamental frequency (F0) contours suitable for application in text-to-speech (TTS) synthesis of Yorùbá (an African tone language). These methods are discussed and compared ...
Speech data collection in an under-resourced language within a multilingual context
(International Research Institute MICA, 2014)
In this paper, we present an end-to-end solution to the development of an automatic speech recognition (ASR) system in
typical under-resourced languages, where the target language is likely to be influenced by one more ...
Adapting mobile medical information search to low-resourced areas
(IST-Africa, 2013)
Providing good medical care in low-resourced areas is a challenge faced by many low and middle income countries. Continuously improving mobile communication infrastructure in these areas is however providing the opportunity ...
A voice service for user feedback on school meals
(ACM, 2012)
Research using voice-based services as a technology platform for providing information access and services within developing world regions has shown much promise. The results for design and deployment of such voice-based ...
Determination and the no-free-lunch paradox
(MIT Press, 2011)
We discuss the no-free-lunch NFL theorem for supervised learning as a logical paradox—that is, as a counterintuitive result that is correctly proven from apparently incontestable assumptions. We show that the uniform prior ...
The NCHLT Speech Corpus of the South African languages
(Workshop Spoken Language Technologies for Under-resourced Languages (SLTU), 2014)
The NCHLT speech corpus contains wide-band speech from approximately
200 speakers per language, in each of the eleven
official languages of South Africa. We describe the design and
development processes that were ...
Efficient data selection for ASR
(Language Resources and Evaluation, 2015)
Automatic speech recognition (ASR) technology has matured over the past few decades and has made significant impacts in a variety of fields, from assistive technologies to commercial products. However, ASR system development ...