Collecting and evaluating speech recognition corpora for 11 South African languages

Submitted by Guy on Tue, 2011-09-20 10:28

Title	Collecting and evaluating speech recognition corpora for 11 South African languages
Publication Type	Journal Article
Year of Publication	2011
Authors	Badenhorst, J., Van Heerden C., Davel M. H., and Barnard Etienne
Journal Title	Language Resources and Evaluation
Journal Date	09/2011
Volume	45
Issue	3
Pagination	289-309
ISSN	1574-020X
Abstract	We describe the Lwazi corpus for automatic speech recognition (ASR), a new telephone speech corpus which contains data from the eleven official languages of South Africa. Because of practical constraints, the amount of speech per language is relatively small compared to major corpora in world languages, and we report on our investigation of the stability of the ASR models derived from the corpus. We also report on phoneme distance measures across languages, and describe initial phone recognisers that were developed using this data. We find that a surprisingly small number of speakers (fewer than 50) and around 10 to 20 h of speech per language are sufficient for the purposes of acceptable phone-based recognition.
URL	https://www.springerlink.com/content/m772051343jg875k/
DOI	10.1007/s10579-011-9152-1

»

Login to post comments
Google Scholar

Also...

User login

Also hosted on AfLaT.org

Register @ aflat.org

Registered members of AfLaT.org can upload publications, add links and information on their research projects. If you would like to become a member of AfLaT.org, please contact guy♻aflat.org.