Search

Now showing items 1-10 of 12

Determination and the no-free-lunch paradox

Barnard, Etienne (MIT Press, 2011)

We discuss the no-free-lunch NFL theorem for supervised learning as a logical paradox—that is, as a counterintuitive result that is correctly proven from apparently incontestable assumptions. We show that the uniform prior ...

Processing spoken lectures in resource-scarce environments

van Heerden, Charl; De Villiers, Pieter; Barnard, Etienne; Davel, Marelie H. (Pattern Recognition Association of South Africa and Mechatronics International Conference, 2011)

Initial work towards processing Afrikaans spoken lectures in a resource-scarce environment is presented. Two approaches to acoustic modeling for eventual alignment are compared: (a) using a well-trained target-language ...

Comparing two developmental applications of speech technology

Grover, Aditi S.; Barnard, Etienne (Conf. on Human Language Technology for Development (HLTD2011), 2011)

Over the past decade applications of speech technologies for development (ST4D) have shown much potential for enabling information access and service delivery. In this paper we review two deployed ST4D services and posit ...

Trajectory behaviour at different phonemic context sizes

Badenhorst, Jaco; Davel, Marelie H.; Barnard, Etienne (Pattern Recognition Association of South Africa and Mechatronics International Conference, 2011)

We propose a piecewise-linear model for the temporal trajectories of Mel Frequency Cepstral Coefficients during phone transitions. As with conventional Hidden Markov Models, the parameters of the model can be estimated ...

Collecting and evaluating speech recognition corpora for 11 South African languages

Badenhorst, Jaco; Van Heerden, Charl; Barnard, Etienne; Davel, Marelie H. (Springer, 2011)

We describe the Lwazi corpus for automatic speech recognition (ASR), a new telephone speech corpus which contains data from the eleven official languages of South Africa. Because of practical constraints, the amount of ...

Phone recognition for spoken web search

Barnard, Etienne; van Heerden, Charl; Kleynhans, Neil; Bali, Kalika; Davel, Marelie H. (MediaEval Workshop, Pisa, Italy, 2011)

Aiming at both speaker independence and robustness with respect to recognition errors in the spoken queries, we have implemented a two-pass system for spoken web search. In the first pass, unconstrained phone recognition ...

Efficient harvesting of Internet audio for resource-scarce ASR

Davel, Marelie H.; van Heerden, Charl; Kleynhans, Neil; Barnard, Etienne (Interspeech 2011, 2011)

Spoken recordings that have been transcribed for human reading (e.g. as captions for audiovisual material, or to provide alternative modes of access to recordings) are widely available in many languages. Such recordings ...

Speech systems for autonomous unmanned aircraft: Enabling autonomous unmanned aircraft to communicate in civil airspace

Burger, Chris R.; Barnard, Etienne; Jones, Thomas (Aerospace Symp. of South Africa (IASSA), 2011)

Airspace control is currently based largely on the exchange of speech between aircraft and Air Traffic Service Units, or between aircraft themselves. ICAO regulatory guidelines make no distinction between unmanned ...

Efficiency measurements in IVR systems for oral users: Consequences of differences in educational levels

Ndwe, Jama T.; Barnard, Etienne; Koen, Renee; McAlister, Bryan (South African Inst. for Computer Scientists and Information Technologists Conf. (SAICSIT), 2011)

In this paper we present the development of an Interactive Voice Response (IVR) system that enables its users to obtain soccer results. The objective of the study is to evaluate the usability of the system through experiments ...

The Lwazi Community Communication Service: design and piloting of a voice-based information service

Grover, Aditi S.; Barnard, Etienne (World Wide Web Conf. (WWW 11), 2011)

We present the design, development and pilot process of the Lwazi Community Communication Service (LCCS), a multilingual automated telephone-based information service. The service acts as a communication and dissemination ...