会议专题

An Iterative Approach to Model Merging for Speech Pattern Discovery

This paper introduces a novel approach to automatically discover recurrent speech patterns from multi-speaker corpus without a priori knowledge. The proposed approach is based on the sub-word acoustic units and it iteratively concatenates the most-likely joint sub-word units to produce a longer acoustic unit till our proposed stop criterion is satisfied. Among the resulting acoustic units, the units with the most stable number of occurrences are selected as the lexicon. The proposed approach has been applied to automatically discover English words from TIDIGIT corpus. The experimental results measured by F1 score showed the proposed approach can effectively detect and extract the recurrent patterns. This technique can be used for lexicon generation from an unknown speech corpus or in audio content summarization.

Lei Wang Eng Siong Chng Haizhou Li

Nanyang Technological University, Singapore Nanyang Technological University, Singapore Institute for Infocomm Research, Singapore

国际会议

2011亚太信号与信息处理协会年度峰会(APSIPAASC 2011)

西安

英文

1-5

2011-10-18(万方平台首次上网日期,不代表论文的发表时间)