Search

Now showing items 1-7 of 7

The NCHLT Speech Corpus of the South African languages

Barnard, Etienne; Davel, Marelie H.; van Heerden, Charl; De Wet, Febe; Badenhorst, Jaco (Workshop Spoken Language Technologies for Under-resourced Languages (SLTU), 2014)

The NCHLT speech corpus contains wide-band speech from approximately 200 speakers per language, in each of the eleven official languages of South Africa. We describe the design and development processes that were ...

Analysing co-articulation using frame-based feature trajectories

Badenhorst, Jaco; Davel, Marelie H.; Barnard, Etienne (Pattern Recognition Association of South Africa and Mechatronics International Conference, 2010)

We investigate several approaches aimed at a more detailed understanding of co-articulation in spoken utterances. We find that the Euclidean difference between instantaneous frame-based feature values and the mean values ...

Trajectory behaviour at different phonemic context sizes

Badenhorst, Jaco; Davel, Marelie H.; Barnard, Etienne (Pattern Recognition Association of South Africa and Mechatronics International Conference, 2011)

We propose a piecewise-linear model for the temporal trajectories of Mel Frequency Cepstral Coefficients during phone transitions. As with conventional Hidden Markov Models, the parameters of the model can be estimated ...

A smartphone-based ASR data collection tool for under-resourced languages

De Vries, Nic J.; Badenhorst, Jaco; Basson, Willem D.; De Wet, Febe; Barnard, Etienne; De Waal, Alta; Davel, Marelie H. (Elsevier, 2014)

Acoustic data collection for automatic speech recognition (ASR) purposes is a particularly challenging task when working with under-resourced languages, many of which are found in the developing world. We provide a brief ...

Collecting and evaluating speech recognition corpora for 11 South African languages

Badenhorst, Jaco; Van Heerden, Charl; Barnard, Etienne; Davel, Marelie H. (Springer, 2011)

We describe the Lwazi corpus for automatic speech recognition (ASR), a new telephone speech corpus which contains data from the eleven official languages of South Africa. Because of practical constraints, the amount of ...

Improved transition models for cepstral trajectories

Badenhorst, Jaco; Barnard, Etienne; Davel, Marelie H. (Pattern recognition association of South Africa (PRASA), 2012)

We improve on a piece-wise linear model of the trajectories of Mel Frequency Cepstral Coefficients, which are commonly used as features in Automatic Speech Recognition. For this purpose, we have created a very clean ...

Woefzela - an open-source platform for ASR data collection in the developing world

de Vries, Nic J.; Badenhorst, Jaco; Davel, Marelie H.; Barnard, Etienne; de Waal, Alta (Interspeech 2011, 2011)

Building transcribed speech corpora for under-resourced languages plays a pivotal role in developing speech technologies for such languages. We have developed an open-source tool for devices running the Android operating ...