Multi-modal Gesture Recognition using Integrated Model of Motion, Audio and Video

摘要：

　　Gesture recognition is needed in many practical applications such as human-robot interaction and sign language recognition.In this paper,we propose the novel model that integrates multi-modal features of motion,audio and video captured from Kinect.The proposed framework is able to recognize complex motion by using these modal features.We use Hidden Markov Models or Random Forests to construct motion and audio classifiers or video classifiers.In motion and audio classifiers,we choose feature representation suitable for gesture recognition by comparing multiple features and trained models.To test the availability of the proposed framework,we also compare the performance of the unimodal models and the integrated multi-modal models.In the experiments,we use dataset provided by MMGRC,which is a workshop for Multi-Modal Gesture Recognition Challenge,and the result shows that the proposed framework scored the best correct recognition rate.This means that the modals complement each other and their combination leads to the improvement of gesture recognition.

关键词： Gesture Recognition Multi-modal Integration Hidden Markov Model Random Forests

作者: Yusuke Goutsu Takaki Kobayashi Junya Obara Ikuo Kusajima Kazunari Takeichi Wataru Takano Yoshihiko Nakamura

作者单位: Mechano-Informatics,The University of Tokyo,7-3-1 Hongo,Bunkyo-ku,Tokyo,Japan

会议类型: 国际会议

会议名称: The 3th IFToMM Asian Conference on Mechanism and Machine Science 2014 International Conference on Mechanism and Machine Science,2014(第三届IFTOMM亚洲机构与机器科学会议暨2014海峡两岸机构学学术会议)

会议地点: 天津

会议语种:英文

页码: 1-7

在线出版日期: 2014-07-06（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Multi-modal Gesture Recognition using Integrated Model of Motion, Audio and Video