Compensating Function of Formant Instantaneous Characteristics in Speaker Identification
The idea of extracting typical Mel-frequency cepstral coefficients (MFCC) conducts efficient improvement in speech signal processing, though these coefficients only apply the information of magnitude without phase. This articles purpose is to form a sort of formant instantaneous characteristics (FIC) using phase information and to see these phase parametersfunction for speaker identification (SI). The procedure requests applications of Hilbert Transform (HT), bandpass filters, and mel-frequency perceptual warping. FIC together with MFCC were tested in SI experiments based on a Gaussian mixture model (GMM). And results show that FIC play a compensating role to MFCC in SI, with one of improved relative rate up to 10.13%. Experimental utterances are Chinese mandarin under clean recording circumstances.
speaker identification formant instantaneous frequency MFCC
Limin Hou Juanmin Xie
School of Communication and Information Engineering Shanghai University Shanghai,China
国际会议
The Fifth International Conference on Information Assurance and Security(第五届信息保障与安全国际会议)
西安
英文
744-747
2009-08-18(万方平台首次上网日期,不代表论文的发表时间)