APPROACHES TO LANGUAGE IDENTIFICATION USING FEATURE-LEVEL AND DECISION-LEVEL FUSION
The PRLM (phone recognizer followed by language model) system has the best performance on NIST language recognition evaluation tasks. However, its orthographical or phonetic transcription and heavy computational demands may preclude their use in low cost, realistic applications. An alternative approach to LID uses Gaussian mixture models (GMMs) to classify languages using the acoustic content of the speech signal. In this paper, we propose the feature-level and decision-level multiple features fusion to the GMM-LM (GMM recognizer followed by language model) system for a higher level of performance. The experiments show that decision-level fusion produces the best accuracy 83.6% which is 16.3% and 52% relative improvement over feature-level fusion and energy result.
Feature-level fusion Decision-level fusion GMM-LM
XIUHUA ZENG JIAN YANG DAN XU
School of Information Science and Engineering, Yunnan University, Kunming, China
国际会议
The Second International Conference on Information & Systems Sciences(ICISS2008)(第二届信息与系统科学国际会议)
大连
英文
795-801
2008-12-18(万方平台首次上网日期,不代表论文的发表时间)