Gĩkũyũ Diacritic Placement - Demo
The orthography of Gĩkũyũ includes a number of accented characters to represent the entire vowel system (namely ĩ and ũ). Not available on standard computer keyboards, these characters are usually typed as the nearest available characters (i and u). Our approach, based on machine learning techniques, automatically places diacritics on unmarked Gĩkũyũ text (estimated accuracy of 90%) without the need for an extensive digital lexicon.
Type in the text you want to add diacritics to
Example: gitoi kimenyaga kierwo (he who does not know, knows after being told)
Authors:
Peter Waiganjo Wagacha: School of Computing and Informatics, University of Nairobi, Nairobi, Kenya, waiganjo [at] uonbi [dot] ac [dot] ke
Guy De Pauw: CNTS - Language Technology Group, University of Antwerp, Antwerp, Belgium, guy [dot] depauw [at] ua [dot] ac [dot] be
Pauline W. Githinji: School of Computing and Informatics, University of Nairobi, Nairobi, Kenya, pnishus [at] yahoo [dot] com
Paper
- Login to post comments
comment
Interesting!