word list

warning: Creating default object from empty value in /home/webserver/html/aflat.bak/modules/taxonomy/taxonomy.pages.inc on line 33.

List of Tshivenda words containing diacritics

Description: 

List of Tshivenda words containing diacritics distributed under the Creative Commons Attribution 2.5 South Africa License (http://creativecommons.org/licenses/by/2.5/za/).

Corpora for African languages - An Crúbadán

Description: 

The Crúbadán Project is devoted to creating basic language technology for minority languages and under-resourced languages using web-crawling and statistical techniques. As of early 2008 we have collected text corpora for 419 languages, including more than 125 African languages, and have used these to create open source spell checkers for more than 20 languages. Please contact Kevin Scannell (http://borel.slu.edu/) if you are interested in developing open source resources for other African languages using these data.

Syndicate content