Speaker Identification Using Pseudo Pitch Synchronized Phase Information in Voiced Sound
In conventional speaker identification methods based on mel-frequency cepstral coefficients (MFCCs), phase information is ignored. Our recent studies have shown that phase information contains speaker dependent characteristics. We propose a new extraction method to extract pitch synchronous phase information from the voiced section only. Speaker identifi- cation experiments were performed using the NTT clean database and JNAS database. Using the new phase extraction method, we obtained a relative reduction in the speaker error rate of approximately 27% and 46%, respectively, for the two databases. We also obtained a relative error reduction of approximately 52% and 42%, respectively, when combining phase information with the MFCC-based method.
Kohta Shimada Kazumasa Yamamoto Seiichi Nakagawa
Department of Computer Science and Engineering, Toyohashi University of Technology, Japan
国际会议
2011亚太信号与信息处理协会年度峰会(APSIPAASC 2011)
西安
英文
1-6
2011-10-18(万方平台首次上网日期,不代表论文的发表时间)