A Lip Reading Method Based on 3-D DCT and 3-D HMM

摘要：

Lip reading aims at recognizing what human says by analyzing visual speech information, such as lip movement. This technique is used to improve the recognition rate under the noise environment. Now, more lip feature extracting methods are developed, most of which are sensitive to lip positioning. To extract the features using Hidden Markov Model can be avoided such defect. Lip movement videos have 3-D structure. Therefore, we extend Hidden Markov Models to 3-dimensional space, construct a lip movement model using it. In order to consider lip dynamic features, we extract the feature vectors using 3-D Discrete Cosine Transform. 3-D DCT and 3-D HMM based lip reading method has following advantages: It can consider dynamic features of the lip movement. It can be robust on rotation, parallel shift and variant scaling. We tested its performance on VidTIMIT database compared with Pseudo 3-D HMM based method. Our method can increase the recognition rate about 2~ 3％ against P3-D HMM based method.

关键词： 3-D HMM 3-D DCT Lip reading

作者: Kim Yong Min Li Hong Zuo

作者单位: Mathematical faculty Kim II Sung University Pyongyang, D.P.R.K School of Electronics and Information Engineering Changchun University of Science and Technology Cha

会议类型: 国际会议

会议名称: 2011 International Conference on Electronics and Optoelectronics(2011电子学与光电子学国际会议 ICEOE 2011)

会议地点: 大连

会议语种:英文

页码: 115-119

在线出版日期: 2011-07-29（万方平台首次上网日期，不代表论文的发表时间）

会议专题

A Lip Reading Method Based on 3-D DCT and 3-D HMM