Browsing by Subject "Lemmatisation"
Now showing items 1-6 of 6
-
Automatic lemmatisation for Afrikaans
(North-West University, 2006)A lemmatiser is an important component of various human language technology applicalions for any language. At present, a rule-based lemmatiser for Afrikaans already exists, but this lemmatiser produces disappointingly low ... -
Die deelwoord in Afrikaans : perspektiewe vanuit ? kognitiewe gebruiksgebaseerde beskrywingsraamwerk
(2014)During an annotation project of 60 000 Afrikaans tokens by CTexT (North-West University), the developers had to answer difficult questions with regard to the annotation of the participle specifically. One of the main reasons ... -
Efficient development of human language technology resources for resource-scarce languages
(2014)The development of linguistic data, especially annotated corpora, is imperative for the human language technology enablement of any language. The annotation process is, however, often time-consuming and expensive. As such, ... -
Evaluation of the performance of a machine learning lemmatiser for isiXhosa
(North-West University (South Africa) , Potchefstroom Campus, 2015)Human language resources (HLR) and applications currently available in South Africa are of a very basic nature, with lemmatisation being one of the basic. South African languages, except for English are considered ... -
Introducing XGL: a lexicalised probabilistic graphical lemmatiser for isiXhosa
(IEEE, 2015)In this paper, a lexicalized probabilistic graphical lemmatiser for isiXhosa, XGL, is presented. An overview of isiXhosa lemmatisation issues is given, followed by a discussion on previous work in automated lemmatisation ... -
Outomatiese Setswana lemma-identifisering
(North-West University, 2006)Within the context of natural language processing, a lemmatiser is one of the most important core technology modules that has to be developed for a particular language. A lemmatiser reduces words in a corpus to the ...