- What is AfLaT?
- AfLaT 2014
- AfLaT 2013
- AfLaT 2012
- AfLaT 2011
- AfLaT 2010
- AfLaT 2009
- Bajuni Database (Derek Nurse)
- Diacritic Restoration
- Luo MT [offline]
- Northern Sotho
Submitted by Guy on Wed, 2012-07-11 07:59
Tagging and Verifying an Amharic News Corpus, , Proceedings of the workshop on Language technology for normalisation of less-resourced languages (SALTMIL8/AfLaT2012), Istanbul, Turkey, p.79-84, (2012)
Submitted by Guy on Wed, 2012-07-11 07:56
A Corpus of Santome, , Proceedings of the workshop on Language technology for normalisation of less-resourced languages (SALTMIL8/AfLaT2012), Istanbul, Turkey, p.61-66, (2012)
Submitted by Guy on Wed, 2012-03-14 10:03
- contains 10,000 morphologically labeled words and 3,000 POS-tagged sentences.
- The corpus comprises around 100,000 common Zulu word types and 30,000 Zulu sentences compiled from fictional works and the Zulu Bible, from which the labeled words and sentences have been sampled.
- All software and additional data used during the annotation process is provided: the partial grammar in DCG format, the abductive algorithm for parsing with incomplete information and a prototype for a POS tagger which assigns word categories to morphologically analyzed words."
Submitted by sobusola on Fri, 2010-01-29 12:34
Current research is focussed on resources for corpus development
Submitted by nimaan on Thu, 2007-02-01 10:00
Speech mining to make African oral patrimony accessible, , Proceedings of LREC workshop - Networking the development of language resources for African languages, Genova - Italia, (2006)
Submitted by nimaan on Thu, 2007-02-01 09:42
Boites à outils TAL pour les langues peu informatisées: le cas du Somali, , Journées d'Analyses des Données Textuelles, Besançon-France, p.697-705, (2006)