AfLaT.org - corpus https://aflat.org/taxonomy/term/116/0 en Tagging and Verifying an Amharic News Corpus https://aflat.org/content/tagging-and-verifying-amharic-news-corpus <span class="biblio-title"><a href="/content/tagging-and-verifying-amharic-news-corpus">Tagging and Verifying an Amharic News Corpus</a></span>, <span class="biblio-authors"><a href="/biblio/author/161">Gambäck, Björn</a></span> , Proceedings of the workshop on Language technology for normalisation of less-resourced languages (SALTMIL8/AfLaT2012), Istanbul, Turkey, p.79-84, (2012) <span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rft.title=Tagging+and+Verifying+an+Amharic+News+Corpus&amp;rft.isbn=978-2-9517408-7-7&amp;rft.date=2012&amp;rft.spage=79&amp;rft.epage=84&amp;rft.aulast=&amp;rft.aufirst=&amp;rft.pub=European+Language+Resources+Association+%28ELRA%29&amp;rft.place=Istanbul%2C+Turkey"></span> https://aflat.org/content/tagging-and-verifying-amharic-news-corpus#comments aflat2012 Amharic corpus part-of-speech tagging Wed, 11 Jul 2012 06:59:41 +0000 Guy 564 at https://aflat.org A Corpus of Santome https://aflat.org/content/corpus-santome <span class="biblio-title"><a href="/content/corpus-santome">A Corpus of Santome</a></span>, <span class="biblio-authors"><a href="/biblio/author/419">Hagemeijer, Tjerk</a>, <a href="/biblio/author/420">Hendrickx Iris</a>, <a href="/biblio/author/421">Amaro Haldane</a>, and <a href="/biblio/author/422">Tiny Abigail</a></span> , Proceedings of the workshop on Language technology for normalisation of less-resourced languages (SALTMIL8/AfLaT2012), Istanbul, Turkey, p.61-66, (2012) <span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rft.title=A+Corpus+of+Santome&amp;rft.isbn=978-2-9517408-7-7&amp;rft.date=2012&amp;rft.spage=61&amp;rft.epage=66&amp;rft.aulast=&amp;rft.aufirst=&amp;rft.pub=European+Language+Resources+Association+%28ELRA%29&amp;rft.place=Istanbul%2C+Turkey"></span> https://aflat.org/content/corpus-santome#comments aflat2012 corpus santome Wed, 11 Jul 2012 06:56:03 +0000 Guy 562 at https://aflat.org The Ukwabelana corpus - An annotated isiZulu corpus https://aflat.org/ukwabelana <!--paging_filter--><div class="field field-type-link field-field-url"> <div class="field-label">URL:&nbsp;</div> <div class="field-items"> <div class="field-item odd"> <a href="https://www.cs.bris.ac.uk/Research/MachineLearning/Morphology/resources.jsp#corpus" target="_blank">https://www.cs.bris.ac.uk/Research/MachineLearning/Morphology/resources.jsp#corpus</a> </div> </div> </div> <div class="field field-type-text field-field-description"> <div class="field-label">Description:&nbsp;</div> <div class="field-items"> <div class="field-item odd"> <!--paging_filter--><p><UL><br /> <LI>contains 10,000 morphologically labeled words and 3,000 POS-tagged sentences.<br /> <LI>The corpus comprises around 100,000 common Zulu word types and 30,000 Zulu sentences compiled from fictional works and the Zulu Bible, from which the labeled words and sentences have been sampled.<br /> <LI>All software and additional data used during the annotation process is provided: the partial grammar in DCG format, the abductive algorithm for parsing with incomplete information and a prototype for a POS tagger which assigns word categories to morphologically analyzed words."<br /> </UL></p> </div> </div> </div> https://aflat.org/ukwabelana#comments corpus isiZulu Zulu Wed, 14 Mar 2012 09:03:09 +0000 Guy 551 at https://aflat.org Natural Language Processing https://aflat.org/node/362 <!--paging_filter--><div class="field field-type-text field-field-description"> <div class="field-label">Description:&nbsp;</div> <div class="field-items"> <div class="field-item odd"> <!--paging_filter--><p>Current research is focussed on resources for corpus development</p> </div> </div> </div> https://aflat.org/node/362#comments Western Africa corpus Diacritic Restoration machine translation morphology Fri, 29 Jan 2010 11:34:15 +0000 sobusola 362 at https://aflat.org Speech mining to make African oral patrimony accessible https://aflat.org/node/128 <span class="biblio-title"><a href="/biblio/view/128">Speech mining to make African oral patrimony accessible</a></span>, <span class="biblio-authors"><a href="/biblio/author/28" class="biblio-local-author">Abdillahi, Nimaan</a>, <a href="/biblio/author/29">Nocera Pascal</a>, <a href="/biblio/author/31">Bonastre Jean-François</a>, and <a href="/biblio/author/40">Bêchet Frédéric</a></span> , Proceedings of LREC workshop - Networking the development of language resources for African languages, Genova - Italia, (2006) <span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Fwww.aflat.org&amp;rft.title=Speech+mining+to+make+African+oral+patrimony+accessible&amp;rft.date=2006&amp;rft.aulast=Nimaan+Abdillahi++Fr%C3%A9deric+Bechet&amp;rft.aufirst=Pascal+Nocera+Jean-Fran%C3%A7ois+Bonastre&amp;rft.place=Genova+-+Italia"></span> https://aflat.org/node/128#comments Eastern Africa Tool / Application African languages corpus information access search engine Somali language speech recognition Thu, 01 Feb 2007 09:00:58 +0000 nimaan 128 at https://aflat.org Boites à outils TAL pour les langues peu informatisées: le cas du Somali https://aflat.org/node/124 <span class="biblio-title"><a href="/biblio/view/124">Boites à outils TAL pour les langues peu informatisées: le cas du Somali</a></span>, <span class="biblio-authors"><a href="/biblio/author/28" class="biblio-local-author">Abdillahi, Nimaan</a>, <a href="/biblio/author/29">Nocera Pascal</a>, and <a href="/biblio/author/30">Torres-Moreno Juan-Manuel</a></span> , Journées d&#039;Analyses des Données Textuelles, Besançon-France, p.697-705, (2006) <span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Fwww.aflat.org&amp;rft.title=Boites+%C3%A0+outils+TAL+pour+les+langues+peu+informatis%C3%A9es%3A+le+cas+du+Somali&amp;rft.date=2006&amp;rft.spage=697&amp;rft.epage=705&amp;rft.aulast=Nimaan+Abdillahi++Juan-Manuel+Torres-Moreno&amp;rft.aufirst=Pascal+Nocera&amp;rft.pub=JADT+06&amp;rft.place=Besan%C3%A7on-France"></span> https://aflat.org/node/124#comments Eastern Africa Tool / Application corpus langue somalienne Langues rares outils TAL Thu, 01 Feb 2007 08:42:49 +0000 nimaan 124 at https://aflat.org