Information Structure in African Languages: Corpora and Tools

TitleInformation Structure in African Languages: Corpora and Tools
Publication TypeConference Paper
Year of Publication2009
AuthorsChiarcos, C., Fiedler I., Grubic M., Haida A., Hartmann K., Ritz J., Schwarz A., Zeldes A., and Zimmermann M.
BooktitleProceedings of the First Workshop on Language Technologies for African Languages (AfLaT 2009)
PublisherAssociation for Computational Linguistics
LocationAthens, Greece
EditorDe Pauw, Guy, de Schryver Gilles-Maurice, and Levin Lori

In this paper, we describe tools and resources for the study of African languages developed at the Collaborative Research Centre "Information Structure". These include deeply annotated data collections of 25 subsaharan languages that are described together with their annotation scheme, and further, the corpus tool ANNIS that provides a unified access to a broad variety of annotations created with a range of different tools. With the application of ANNIS to several African data collections, we illustrate its suitability for the purpose of language documentation, distributed access and the creation of data archives.

W09-0703.pdf284.59 KB