Search
Now showing items 1-7 of 7
The NCHLT Speech Corpus of the South African languages
(Workshop Spoken Language Technologies for Under-resourced Languages (SLTU), 2014)
The NCHLT speech corpus contains wide-band speech from approximately
200 speakers per language, in each of the eleven
official languages of South Africa. We describe the design and
development processes that were ...
Analysing co-articulation using frame-based feature trajectories
(Pattern Recognition Association of South Africa and Mechatronics International Conference, 2010)
We investigate several approaches aimed at a more
detailed understanding of co-articulation in spoken utterances.
We find that the Euclidean difference between instantaneous
frame-based feature values and the mean values ...
Trajectory behaviour at different phonemic context sizes
(Pattern Recognition Association of South Africa and Mechatronics International Conference, 2011)
We propose a piecewise-linear model for the temporal trajectories
of Mel Frequency Cepstral Coefficients during phone transitions.
As with conventional Hidden Markov Models, the parameters of the
model can be estimated ...
A smartphone-based ASR data collection tool for under-resourced languages
(Elsevier, 2014)
Acoustic data collection for automatic speech recognition (ASR) purposes is a particularly challenging task when working with under-resourced languages, many of which are found in the developing world. We provide a brief ...
Collecting and evaluating speech recognition corpora for 11 South African languages
(Springer, 2011)
We describe the Lwazi corpus for automatic speech recognition (ASR), a new telephone speech corpus which contains data from the eleven official languages of South Africa. Because of practical constraints, the amount of ...
Improved transition models for cepstral trajectories
(Pattern recognition association of South Africa (PRASA), 2012)
We improve on a piece-wise linear model of the trajectories of Mel Frequency Cepstral Coefficients, which are commonly used as features in Automatic Speech Recognition. For this purpose, we have created a very clean ...
Woefzela - an open-source platform for ASR data collection in the developing world
(Interspeech 2011, 2011)
Building transcribed speech corpora for under-resourced
languages plays a pivotal role in developing speech technologies
for such languages. We have developed an open-source
tool for devices running the Android operating ...