会议专题

Speaker Normalization Using Dynamic Frequency Warping

In an effort to reduce the degradation in a gender-independence isolated word recognition performance caused by variation character among different speaker, a dynamic frequency warping approach to speaker normalization is investigated. There are a lot of discrepancy in frequency domain which caused by vocal tract length difference among different speakers. Dynamic Frequency Warping (DFW) is an exact analog of Dynamic Time Warping (DTW) which is used to reduce the discrepancy frequency scale of speech and normalize the frequency accurately. In this paper, the DFW method is to be introduced to normalize the frequency scale of speech and then applied it to a gender-independence isolated word recognition system. The results of experiments show a large improvement in average word error rate.

Zhenhua Huang Limin Hou

School of Communication and Information Engineering, Shanghai Univ., Shanghai, China

国际会议

2008 International Conference on Audio,Language and Image Processing(2008国际声音、语言、图像过程大会)

镇江

英文

1091-1095

2008-07-07(万方平台首次上网日期,不代表论文的发表时间)