Speaker Identification Using Pseudo Pitch Synchronized Phase Information in Voiced Sound

摘要：

In conventional speaker identification methods based on mel-frequency cepstral coefficients (MFCCs), phase information is ignored. Our recent studies have shown that phase information contains speaker dependent characteristics. We propose a new extraction method to extract pitch synchronous phase information from the voiced section only. Speaker identifi- cation experiments were performed using the NTT clean database and JNAS database. Using the new phase extraction method, we obtained a relative reduction in the speaker error rate of approximately 27％ and 46％, respectively, for the two databases. We also obtained a relative error reduction of approximately 52％ and 42％, respectively, when combining phase information with the MFCC-based method.

作者: Kohta Shimada Kazumasa Yamamoto Seiichi Nakagawa

作者单位: Department of Computer Science and Engineering, Toyohashi University of Technology, Japan

会议类型: 国际会议

会议名称: 2011亚太信号与信息处理协会年度峰会(APSIPAASC 2011)

会议地点: 西安

会议语种:英文

页码: 1-6

在线出版日期: 2011-10-18（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Speaker Identification Using Pseudo Pitch Synchronized Phase Information in Voiced Sound