Search
Now showing items 1-9 of 9
Category-based phoneme-to-grapheme transliteration
(International Speech Communication Association ( ISCA ), 2013)
Grapheme-based speech recognition systems are faster to develop but typically do not reach the same level of performance as phoneme-based systems. In this paper we introduce a technique for improving the performance of ...
Comparing grapheme-based and phoneme-based speech recognition for Afrikaans
(Pattern recognition association of South Africa (PRASA), 2012)
This paper compares the recognition accuracy of a phoneme-based automatic speech recognition system with that of a grapheme-based system, using Afrikaans as case study. The first system is developed using a conventional ...
Validating smartphone-collected speech corpora
(SLTU, 2012)
We investigate the effectiveness with which the accuracy of a prompted speech corpus can be validated when minimal additional speech resources are available, and specifically when a language model in the target language ...
Improved transition models for cepstral trajectories
(Pattern recognition association of South Africa (PRASA), 2012)
We improve on a piece-wise linear model of the trajectories of Mel Frequency Cepstral Coefficients, which are commonly used as features in Automatic Speech Recognition. For this purpose, we have created a very clean ...
Medium-vocabulary speech recognition for under-resourced languages
(SLTU, 2012)
We report on the development of speech-recognition systems that are able to perform accurate recognition on mediumvocabulary tasks (i.e. tasks that require distinctions between approximately 200 different terms). We are ...
Performance Analysis of a Multilingual Directory Enquiries Application
(PRASA, 2014)
In a multilingual society such as South Africa, a practical directory enquiries (DE) application should be able to
serve users from various language backgrounds with information relating to names in various languages: a ...
Implications of Sepedi/English code switching for ASR systems
(Pattern recognition association of South Africa (PRASA), 2013)
Code switching (the process of switching from one language to another during a conversation) is a common phenomenon in multilingual environments. Where a minority and dominant language coincide, code switching from the ...
The semi-automated creation of stratified speech corpora
(Pattern recognition association of South Africa (PRASA), 2013)
Smartphones provide an efficient means for the collection of speech data; however, the quality of the corpora created in this fashion is not predictable. We describe an approach that allows us to post-process and rank ...
Automatic alignment of audiobooks in Afrikaans
(Pattern recognition association of South Africa (PRASA), 2012)
This paper reports on the automatic alignment of audiobooks in Afrikaans. An existing Afrikaans pronunciation dictionary and corpus of Afrikaans speech data are used to generate baseline acoustic models. The baseline system ...