Search
Now showing items 1-10 of 12
Determination and the no-free-lunch paradox
(MIT Press, 2011)
We discuss the no-free-lunch NFL theorem for supervised learning as a logical paradox—that is, as a counterintuitive result that is correctly proven from apparently incontestable assumptions. We show that the uniform prior ...
Processing spoken lectures in resource-scarce environments
(Pattern Recognition Association of South Africa and Mechatronics International Conference, 2011)
Initial work towards processing Afrikaans spoken
lectures in a resource-scarce environment is presented. Two
approaches to acoustic modeling for eventual alignment are
compared: (a) using a well-trained target-language ...
Comparing two developmental applications of speech technology
(Conf. on Human Language Technology for Development (HLTD2011), 2011)
Over the past decade applications of speech
technologies for development (ST4D) have
shown much potential for enabling information
access and service delivery. In this paper
we review two deployed ST4D services and
posit ...
Trajectory behaviour at different phonemic context sizes
(Pattern Recognition Association of South Africa and Mechatronics International Conference, 2011)
We propose a piecewise-linear model for the temporal trajectories
of Mel Frequency Cepstral Coefficients during phone transitions.
As with conventional Hidden Markov Models, the parameters of the
model can be estimated ...
Collecting and evaluating speech recognition corpora for 11 South African languages
(Springer, 2011)
We describe the Lwazi corpus for automatic speech recognition (ASR), a new telephone speech corpus which contains data from the eleven official languages of South Africa. Because of practical constraints, the amount of ...
Phone recognition for spoken web search
(MediaEval Workshop, Pisa, Italy, 2011)
Aiming at both speaker independence and robustness with
respect to recognition errors in the spoken queries, we have
implemented a two-pass system for spoken web search. In
the first pass, unconstrained phone recognition ...
Efficient harvesting of Internet audio for resource-scarce ASR
(Interspeech 2011, 2011)
Spoken recordings that have been transcribed for human reading
(e.g. as captions for audiovisual material, or to provide alternative
modes of access to recordings) are widely available in many
languages. Such recordings ...
Speech systems for autonomous unmanned aircraft: Enabling autonomous unmanned aircraft to communicate in civil airspace
(Aerospace Symp. of South Africa (IASSA), 2011)
Airspace control is currently based largely on the
exchange of speech between aircraft and Air Traffic Service
Units, or between aircraft themselves. ICAO regulatory
guidelines make no distinction between unmanned ...
Efficiency measurements in IVR systems for oral users: Consequences of differences in educational levels
(South African Inst. for Computer Scientists and Information Technologists Conf. (SAICSIT), 2011)
In this paper we present the development of an Interactive Voice Response (IVR) system that enables its users to obtain soccer results. The objective of the study is to evaluate the usability of the system through experiments ...
The Lwazi Community Communication Service: design and piloting of a voice-based information service
(World Wide Web Conf. (WWW 11), 2011)
We present the design, development and pilot process of the Lwazi
Community Communication Service (LCCS), a multilingual
automated telephone-based information service. The service acts as
a communication and dissemination ...