Containing overgeneration in Zulu computational morphology
Title | Containing overgeneration in Zulu computational morphology |
Publication Type | Conference Paper |
Year of Publication | 2007 |
Authors | Pretorius, Laurette, and Bosch Sonja E. |
Booktitle | Human Language Technologies as a Challenge for Computer Science and Linguistics, Proceedings of 3rd Language and Technology Conference |
Date | October 2007 |
Publisher | Wydawnictwo Poznańskie Sp. z o.o. |
Location | Poznan |
Editor | Vetulani, Z. |
ISBN Number | 978-83-7177-407-2 |
Abstract | The development of a large coverage computational morphological analyser for Zulu requires not only the modelling of the regular phenomena often associated with word formation, but also the idiosyncratic behaviour that may occur in Zulu morphology. This paper discusses the application of an existing rule-based finite-state morphological analyser prototype ZulMorph in semi-automating the mining of available Zulu language corpora for idiosyncratic behaviour. The semi-automated procedure makes provision for bootstrapping the morphological analyser to include newly extracted information from corpora. Of particular interest is also the central role that the machine-readable lexicon plays. The procedure is applied to a Zulu development corpus of 30 000 types and the results are given and discussed. |
- Login to post comments
- Google Scholar