会议专题

Ranknet Based English Stressed Syllable Detection

A lot of time or frequency domain speech features had been applied to address the problem of English stressed syllable detection. Researchers had proved that the combination of multiple features is necessary to get better performance. But up to now, the tasks of seeking new feasible speech features and innovative feature fusion approaches are still open. This paper proposes a detection-by-ranking approach to address the stressed syllable detection problem based on the RankNet technique. The approach is able to find out the stressed syllable through one by one comparison of feature vectors corresponding to vowels of syllables in a multi-syllable word. This paper also introduces the fractal dimensions of each vowel as one type of the stress features. Experiments conducted on the corpus TIMIT show that the proposed feature fusion method reaches high performance and the introducing of fractal dimension is helpful for improving the detection correct rate.

Xiaohong Yang Qingcai Chen Ling Wan Xiaolong Wang

Department of Computer Science and Technology Key Laboratory of Network Oriented Intelligent Computation Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, China, 518055

国际会议

第十届中国虚拟现实年会

上海

英文

1084-1089

2010-10-20(万方平台首次上网日期,不代表论文的发表时间)