warning: Creating default object from empty value in /home/webserver/html/aflat/modules/taxonomy/ on line 33.

The Ukwabelana corpus - An annotated isiZulu corpus


  • contains 10,000 morphologically labeled words and 3,000 POS-tagged sentences.
  • The corpus comprises around 100,000 common Zulu word types and 30,000 Zulu sentences compiled from fictional works and the Zulu Bible, from which the labeled words and sentences have been sampled.
  • All software and additional data used during the annotation process is provided: the partial grammar in DCG format, the abductive algorithm for parsing with incomplete information and a prototype for a POS tagger which assigns word categories to morphologically analyzed words."

Information access in indigenous languages: a case study in Zulu

Information access in indigenous languages: a case study in Zulu, Cosijn, Erica, Pirkola Ari, Bothma Theo, and Järvelin Kalervo , Emerging Frameworks and Methods. Proceedings of the Fourth International Conference on Conceptions of Library and Information Science (CoLIS4), Seattle, WA, USA, p.221-238, (2002)

Zulu - English dictionary


An on-line Zulu-English English-Zulu translation dictionary.

Syndicate content