会议专题

Effect of Gaussian Densities and Amount of Training Data on Grapheme-Based Acoustic Modeling for Arabic

Grapheme-based acoustic modeling for Arabic is a demanding research area since high phonetic transcription accuracy is not yet solved completely. In this paper, we are studying the use of a pure grapheme-based approach using Gaussian mixture model to implicitly model missing diacritics and investigating the effect of Gaussian densities and amount of training data on speech recognition accuracy. Two transcription systems were built: a phoneme-based system and a grapheme-based system. Several acoustic models were created with each system by changing the number of Gaussian densities and the amount of training data. Results show that by increasing the number of Gaussian densities or the amount of training data, the improvement rate in the grapheme-based approach was found to be faster than in the phoneme-based approach. Hence the accuracy gap between the two approaches can be compensated by increasing either the number of Gaussian densities or the amount of training data.

Acoustic modeling Arabic language Graphemic modeling Speech recognition

Mohamed ELMAHDY Rainer GRUHN Wolfgang MINKER Slim ABDENNADHER

Faculty of Engineering & Computer Science, University of Ulm, Ulm, Germany Faculty of Media Engineer SVOX AG, Ulm, Germany Faculty of Engineering & Computer Science, University of Ulm, Ulm, Germany Faculty of Media Engineering & Technology, German University in Cairo, Cairo, Egypt

国际会议

International Conference on Natural Language Processing and Knowledge Engineering(IEEE自然语言处理与知识工程国际会议 IEEE NLP-KE 2009)

大连

英文

1-5

2009-09-24(万方平台首次上网日期,不代表论文的发表时间)