A Block-Based Blind Source Separation Approach With Equilateral Triangular Microphone Array

摘要：

In this paper we describe a method for multiple speech sources separation using an equilateral triangular microphone array. Firstly, the azimuths of horizontal plane are divided into many units and the spatial features of some directions observed by the microphone array are modeled precisely. Secondly, the input mixing signals are segmented into blocks, and then the number of active speakers and their directions are estimated in each block. Thirdly, the pre-trained model with the nearest azimuth to each speaker is adapted to obtain a precise model, which is then used for time-frequency binary mask estimation. Finally, we separate every source appeared in each block and concatenate those sounds from same unit to reproduce the whole stream. The experiments are set up in a real meeting room. The results show that our method can separate multiple speech sources correctly with low distortion, and are competitive with the total un-blind separation results. Index Terms: blind source separation, directions of arrival estimation, time-frequency mask, equilateral triangular microphone array

作者: Jian Zhang Zhonghua Fu Lei Xie

作者单位: Shaanxi Provincial Key Laboratory of Speech and Image Information Processing School of Computer Science, Northwestern Polytechnical University, Xi’an, 710129

会议类型: 国际会议

会议名称: 2011亚太信号与信息处理协会年度峰会(APSIPAASC 2011)

会议地点: 西安

会议语种:英文

页码: 1-5

在线出版日期: 2011-10-18（万方平台首次上网日期，不代表论文的发表时间）

会议专题

A Block-Based Blind Source Separation Approach With Equilateral Triangular Microphone Array