Search

Now showing items 1-10 of 11

The Spoken Web Search task at Mediaeval 2012

Metze, Florian; Xavier, Anguera; Barnard, Etienne; Gravier, Guillaume; Davel, Marelie H. (Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on, 2013)

In this paper, we describe the “Spoken Web Search” Task, which was held as part of the 2012 MediaEval benchmark evaluation campaign. The purpose of this task was to perform audio search with audio input in four languages, ...

Correlation between rapid learnability and user preference in IVR systems for developing regions

Ndwe, T.J.; Barnard, Etienne; Foko, Thato (iIST-Africa, 2013)

Access to information and communication is one of the most important needs in any population group. It is generally challenging for people in the developing world to access information because the tools and the technologies ...

Generating fundamental frequency contours for speech synthesis in Yorùbá

Van Niekerk, Daniel R.; Barnard, Etienne (International Speech Communication Association ( ISCA ), 2013)

We present methods for modelling and synthesising fundamental frequency (F0) contours suitable for application in text-to-speech (TTS) synthesis of Yorùbá (an African tone language). These methods are discussed and compared ...

Adapting mobile medical information search to low-resourced areas

Hanbury, Allan; Van Zyl, Hendra; Boyer, Célia; Barnard, Etienne (IST-Africa, 2013)

Providing good medical care in low-resourced areas is a challenge faced by many low and middle income countries. Continuously improving mobile communication infrastructure in these areas is however providing the opportunity ...

A Discourse Model of Affect for Text-to-Speech Synthesis

Schlunz, Georg I.; Barnard, Etienne (Pattern Recognition Association of South Africa and Mechatronics International Conference, 2013)

This paper introduces a model of affect to improve prosody in text-to-speech synthesis. It operates on the discourse level of text to predict the underlying linguistic factors that contribute towards emotional appraisal, ...

G2P variant prediction techniques for ASR and STD

Davel, Marelie H.; van Heerden, Charl; Barnard, Etienne (Interspeech 2013, 2013)

Introducing pronunciation variants into a lexicon is a balancing act: incorporating necessary variants can improve automatic speech recognition (ASR) and spoken term detection (STD) performance by capturing some of the ...

The semi-automated creation of stratified speech corpora

Van Heerden, Carel; Barnard, Etienne; Davel, Marelie H. (Pattern recognition association of South Africa (PRASA), 2013)

Smartphones provide an efficient means for the collection of speech data; however, the quality of the corpora created in this fashion is not predictable. We describe an approach that allows us to post-process and rank ...

Consequences of Deploying culturally inclined earcons in speech technology design for oral users in South Africa

Ndwe, Tembalethu J.; Barnard, Etienne (3rd ACM Symp. on Computing for Development, 2013)

We discuss the qualitative outcomes of utilizing an earcon in the design of an Interactive Voice Response (IVR) system. Earcons are short non-speech audio messages that are used in the computer/user interface (UI) to provide ...

Spoken language identification system adaptation in under-resourced environments

Kleynhans, Neil; Barnard, Etienne (Pattern recognition association of South Africa (PRASA), 2013)

Speech technologies have matured over the past few decades and have made significant impacts in a variety of fields, from assistive technologies to personal assistants. However, speech system development is a resource ...

Kernel bandwidth estimation for non-parametric density estimation: a comparative study

Van der Walt, Christiaan M.; Barnard, Etienne (Pattern recognition association of South Africa (PRASA), 2013)

We investigate the performance of conventional bandwidth estimators for non- parametric kernel density estimation on a number of representative pattern-recognition tasks, to gain a better understanding of the behaviour of ...