Information structure in African languages: corpora and tools

TitleInformation structure in African languages: corpora and tools
Publication TypeJournal Article
Year of Publication2011
AuthorsC. Chiarcos, Fiedler I., M. Grubic, K. Hartmann, J. Ritz, Schwarz A., Zeldes A., and M. Zimmermann
Journal TitleLanguage Resources and Evaluation
Journal Date09/2011

In this paper, we describe tools and resources for the study of African languages developed at the Collaborative Research Centre 632 “Information Structure”. These include deeply annotated data collections of 25 sub-Saharan languages that are described together with their annotation scheme, as well as the corpus tool ANNIS, which provides unified access to a broad variety of annotations created with a range of different tools. With the application of ANNIS to several African data collections, we illustrate its suitability for the purpose of language documentation, distributed access, and the creation of data archives.