Comparative Ezperiments to Evaluate the Use of Syllables for Large-Vocabulary Automatic Speech Recognition

摘要：

This paper motivates the use of syllables to enhance the performance of automatic speech recognition (ASR) systems when dealing with large-vocabulary speech. Arabic and English are considered in our paper to test the proposed approach. The Arabic database consists of sentences selected from different Arabic broadcast news, whereas for English speech, TIMIT database had been used to test our approach. Comparative experiments have indicated that the use of syllables as acoustic units for the recognition of both languages leads to an improvement in the recognition performance of HMM-based ASR systems. The Hidden Markov Model Toolkit (HTK) was used throughout our experiments. A series of experiments on speaker-independent continuous-speech recognition have been carried out using both databases. Using such an approach, experiments show that for Arabic database, the recognition rate using syllables outperforms the recognition rate obtained using monophones and triphones by 15.75% and 2.64%, respectively. On the other hand, for TIMIT database, the recognition rate using syllables outperforms the recognition rate using monophones and triphones by 40.08% and 19.74%, respectively.

作者: Hesham Tolba Mohamed Azmi

作者单位: Electrical Engineering Department Faculty of Engineering, Taibah University Al Madinah, KSA Alexandria Higher Institute of Engineering Department of Communication, Alexandria, Egypt

会议类型: 国际会议

会议名称: 2009 2nd IEEE International Conference on Computer Science and Information Technology(第二届计算机科学与信息技术国际会议 ICCSIT2009)

会议地点: 北京

会议语种:英文

页码: 250-253

在线出版日期: 2009-08-08（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Comparative Ezperiments to Evaluate the Use of Syllables for Large-Vocabulary Automatic Speech Recognition