Improvement of sector based Multiple speaker localization in a smart room
Recent advances in computer technology and speech processing and the interest on human-machine communication have made possible development of hands-free speech application with microphone array in smart room environments. One of the most important tasks in a smart room is localization of multispeaker that permits a wide spectrum of application. Combined of hyperbolae produced by time delay estimation (TDE) between several microphones pair utilizes for source localization. In this paper, by using the TDE combination based on multiplication of spatial likelihood function (SLFs) generated from each microphone pair and the head orientation information, a new acoustic multi-speaker localization function has been proposed that we call it OPROD-PHAT. For the search space reduction divided the space of meeting room into a few sections, and for each time frame, we estimate the average OPROD-PHAT function output power within a volume of section, and by using a new two step adaptive threshold, we determined much better which sections contain active speaker. Finally we also implemented a closed-form TDOA based localization approaches for each active section. Has been shown it is a way to apply single speaker TDOA method to a multispeaker problem. The result of simulation show superior performance of proposed system.
multiperson localization time delay of arrival(TDOA) head oriantation microphone array
M.Hesam H.Marvi
Department of electronic and robotic engineering, Shahrood University of technology, Shahrood, Iran
国际会议
2010 IEEE 10th International Conference on Signal Processing(第十届信号处理国际会议 ICSP 2010)
北京
英文
470-473
2010-08-24(万方平台首次上网日期,不代表论文的发表时间)