Subspace Analysis of Spectral Features for Speaker Recognition
A new front-end feature extraction scheme creating so called LDA-projected magnitude spectrum(L-PMS)features is proposed for speaker recognition systems.Mainstream feature extraction schemes usually use filter-bank or linear predictive coding(LPC)in the process of converting high-dimensional speech data into low-dimensional feature vectors,which may lose important discriminative information for speaker recognition tasks.In this work,the new feature extraction scheme takes log of magnitude spectrum of windowed utterance frames.After variance normalization on the spectral features,linear discriminant analysis(LDA)is applied to create discriminatively more powerful features comparing to the conventional mel-frequency cepstral coefficient(MFCC)features.The new feature was evaluated on the TIMIT and NTIMIT corpora,using vector quantization(VQ)speaker model.The Experiments on all 630 subjects in TIMIT and NTIMIT corpora show that the proposed L-PMS features substantially outperform the conventional MFCC features in the sense of identification rate.
Ling Chen Hong Man Huading Jia Zhiyi Wang Lei Wang Zili Li
CS Department Southwestern University of Finance and Economics Chengdu,China;Aidingge Info Tech Co., ECE Department Stevens Institute of Technology Hoboken,New Jersey,USA CS Department Southwestern University of Finance and Economics Chengdu,China
国际会议
厦门
英文
98-102
2014-08-19(万方平台首次上网日期,不代表论文的发表时间)