会议专题

A Improved Speech Synthesis System Utilizing BPSO-based Lip Feature Selection

To get a higher lipreading recognition result in speech synthesis system driven by visual speech, Binary Particle Swarm Optimization (BPSO) algorithms is used to select the optimal lip feature subset Experiments are carried out based on HMM with 4 states and 16 Gaussian mixture components in a small database for speaker-dependent case. Experiment results show that the integrated discriminate vector after feature selection obtained the information from the geometrical features and the pixel based features. Comparing with feature fusion based on concatenating, the recognition rates with feature selection based on BPSO are improved by as much as 2.42%.

feature Selection Binary Particle Swarm Optimization normalized geometrical feature normalized DCT coefficients Hidden Markov Model

Mengjun Wang Xiangling Wang Gang Li

School of Information Engineering HeBei University of Technology Tianjin, China School of Precision Instrument and Opto-Electronics Engineering, Tianjin University Tianjin, China

国际会议

2011 4th International Conference on Biomedical Engineering and Informatics(第四届生物医学工程与信息学国际会议 BMEI 2011)

上海

英文

1298-1301

2011-10-15(万方平台首次上网日期,不代表论文的发表时间)