Design and Implementation of Selectable Sound Separation on the Texai Telepresence System using HARK
This paper presents the design and implementation of selectable sound separation functions on the telepresence system “Texai using the robot audition software “HARK. An operator of Texai can “walk around a faraway office to attend a meeting or talk with people through video-conference instead of meeting in person. With a normal microphone, the operator has difficulty recognizing the auditory scene of the Texai, e.g., he/she cannot know the number and the locations of sounds. To solve this problem, we design selectable sound separation functions with 8 microphones in two modes, overview and filter modes, and implement them using HARK’s sound source localization and separation. The overview mode visualizes the direction-of-arrival of surrounding sounds, while the filter mode provides sounds that originate from the range of directions he/she specifies. The functions enable the operator to be aware of a sound even if it comes from behind the Texai, and to concentrate on a particular sound. The design and implementation was completed in five days due to the portability of HARK. Experimental evaluations with actual and simulated data show that the resulting system localizes sound sources with a tolerance of 5 degrees.
Takeshi Mizumoto Kazuhiro Nakadai Takami Yoshida Ryu Takeda Takuma Otsuka Toru Takahashi Hiroshi G. Okuno
School of Informatics,Kyoto University,Sakyo,Kyoto 606-8501,Japan Tokyo Institute of Technology,2-12-1,O-okayama,Meguro-ku,Tokyo,152-8552,Japan
国际会议
2011 IEEE International Conference on Robotics and Automation(2011年IEEE世界机器人与自动化大会 ICRA 2011)
上海
英文
2130-2137
2011-05-09(万方平台首次上网日期,不代表论文的发表时间)