会议专题

Speaker Identification Using Pseudo Pitch Synchronized Phase Information in Voiced Sound

In conventional speaker identification methods based on mel-frequency cepstral coefficients (MFCCs), phase information is ignored. Our recent studies have shown that phase information contains speaker dependent characteristics. We propose a new extraction method to extract pitch synchronous phase information from the voiced section only. Speaker identifi- cation experiments were performed using the NTT clean database and JNAS database. Using the new phase extraction method, we obtained a relative reduction in the speaker error rate of approximately 27% and 46%, respectively, for the two databases. We also obtained a relative error reduction of approximately 52% and 42%, respectively, when combining phase information with the MFCC-based method.

Kohta Shimada Kazumasa Yamamoto Seiichi Nakagawa

Department of Computer Science and Engineering, Toyohashi University of Technology, Japan

国际会议

2011亚太信号与信息处理协会年度峰会(APSIPAASC 2011)

西安

英文

1-6

2011-10-18(万方平台首次上网日期,不代表论文的发表时间)