会议专题

APPROACHES TO LANGUAGE IDENTIFICATION USING FEATURE-LEVEL AND DECISION-LEVEL FUSION

The PRLM (phone recognizer followed by language model) system has the best performance on NIST language recognition evaluation tasks. However, its orthographical or phonetic transcription and heavy computational demands may preclude their use in low cost, realistic applications. An alternative approach to LID uses Gaussian mixture models (GMMs) to classify languages using the acoustic content of the speech signal. In this paper, we propose the feature-level and decision-level multiple features fusion to the GMM-LM (GMM recognizer followed by language model) system for a higher level of performance. The experiments show that decision-level fusion produces the best accuracy 83.6% which is 16.3% and 52% relative improvement over feature-level fusion and energy result.

Feature-level fusion Decision-level fusion GMM-LM

XIUHUA ZENG JIAN YANG DAN XU

School of Information Science and Engineering, Yunnan University, Kunming, China

国际会议

The Second International Conference on Information & Systems Sciences(ICISS2008)(第二届信息与系统科学国际会议)

大连

英文

795-801

2008-12-18(万方平台首次上网日期,不代表论文的发表时间)