Cilubà

  • warning: Creating default object from empty value in /home/webserver/html/aflat/modules/taxonomy/taxonomy.pages.inc on line 33.
  • warning: fread(): Length parameter must be greater than 0 in /home/webserver/html/aflat/includes/common.inc(1743) : eval()'d code on line 13.

Cilubà Part-of-Speech Tagger - Demo

This demo showcases a part-of-speech tagger for Cilubà. It retrieves the morpho-syntactic categories for words in a sentence.

Type in the text you want to tag
Example: Kiipàcìlà kàà kumpàlà kàà mukàndà eu ǹkwambulwisha eu yônso udi ulonga cilubà nè ukààdi mudbìdìje nè nsùùlakajilu wacì.

[Tagging the text might take a while]

Automatic Diacritic Restoration for African Languages

The orthography of many African languages includes diacritically marked characters. Falling outside the scope of the standard Latin encoding, these characters are often represented in digital language resources as their unmarked equivalents. This renders corpus compilation more difficult, as these languages typically do not have the benefit of large electronic dictionaries to perform diacritic restoration.

This is a demonstration system for a diacritic restoration method that is able to automatically restore diacritics on the basis of local graphemic context. It is based on the machine learning method of Memory-Based learning. We have applied the method to the African languages of Cilubà, Gĩkũyũ, Kĩkamba, Maa, Sesotho sa Leboa, Tshivenḓa and Yoruba.

You can find more information on this system in this paper

Select a language and enter the word or sentence you want to restore diacritics for.
Cilubà (e.g. mutekete)
Gĩkũyũ (e.g. nituronire)
Kĩkamba (e.g. ningulilikana)
Maasai (e.g. oltunani)
Sesotho sa Leboa (Northern Sotho) (e.g. swanetse)
Tshivenḓa (e.g. tshiswitulo)
Yoruba (e.g. isinku)
 

[Processing the text might take a while]

Authors:
Guy De Pauw: CNTS - Language Technology Group, University of Antwerp, Antwerp, Belgium, guy [dot] depauw [at] ua [dot] ac [dot] be
Gilles-Maurice de Schryver: African Languages and Cultures, Ghent University, Ghent, Belgium, gillesmaurice [dot] deschryver [at] ugent [dot] be
Peter Waiganjo Wagacha: School of Computing and Informatics, University of Nairobi, Nairobi, Kenya, waiganjo [at] uonbi [dot] ac [dot] ke

Cilubà - French Dictionary

Description: 

An on-line Cilubà-French French-Cilubà translation dictionary.

Syndicate content