会议专题

Auditory Features with Vocal Track Length Normalization for Language Identification

This paper reports on a novel feature, auditory cepstrum coefficient (ACC) with vocal tract length normalization (VTLN), for language identification (LID). The ACC feature is based on the auditory characteristics of human ear and the VTLN technology compensates the speaker variability. The detailed implementation of ACC feature with VTLN in frequency domain is given. Experimental results show that the proposed auditory feature outperforms its widely used Mel-frequency cepstrum coefficient (MFCC) counterpart and is more effective when combined with VTLN.

Weiqiang Zhang Jia Liu Liang He

Department of Electronic Engineering, Tsinghua University, Beijing 100084, China

国际会议

2008 International Conference on Audio,Language and Image Processing(2008国际声音、语言、图像过程大会)

镇江

英文

66-70

2008-07-07(万方平台首次上网日期,不代表论文的发表时间)