I-vector Based Speaker Gender Recognition
Automatic gender recognition has been becoming very important in potential applications.Many state-of-the-art gender recognition approaches based on a variety of biometrics,such as face,body shape,voice,are proposed recently.Among them,relying on voice is suboptimal due to significant variations in pitch,emotion,and noise in real-world speech.Inspired from the speaker recognition approaches relying on i-vector presentation in NIST SRE,its believed that i-vector contains information about gender as a part of speakers characters,and works for speaker recognition as well as for gender recognition in complex environments.So,we apply the total variability space analysis to gender classification and propose i-vector based discrimination for speaker gender recognition.The results of experiments on TIMIT corpus and NUST603_2014 database show that the proposed i-vector based speaker gender recognition improves the performance up to 99.9%,and surpasses the pitch method and UBM-SVM baseline subsystems in term of accuracy comparatively.
speech processing gender recognition i-vector mel frequency cepstrum coefficient
Minghe Wang Ying Chen Zhenmin Tang Erhua Zhang
School of Computer Science and Engineering Nanjing University of Science and Technology, NUST Nanjing, China
国际会议
重庆
英文
729-732
2015-12-19(万方平台首次上网日期,不代表论文的发表时间)