corpora

warning: Creating default object from empty value in /home/webserver/html/aflat.bak/modules/taxonomy/taxonomy.pages.inc on line 33.

Corpora for African languages - An Crúbadán

Description: 

The Crúbadán Project is devoted to creating basic language technology for minority languages and under-resourced languages using web-crawling and statistical techniques. As of early 2008 we have collected text corpora for 419 languages, including more than 125 African languages, and have used these to create open source spell checkers for more than 20 languages. Please contact Kevin Scannell (http://borel.slu.edu/) if you are interested in developing open source resources for other African languages using these data.

Syndicate content