Raking the Cocktail Party

被引:36
|
作者
Dokmanic, Ivan [1 ]
Scheibler, Robin [1 ]
Vetterli, Martin [1 ]
机构
[1] LCAV EPFL, CH-1015 Lausanne, Switzerland
关键词
Acoustic rake receiver; beamforming; echo sorting; interference cancellation; noise suppression; perceptual evaluation of speech quality (PESQ); room impulse response; MICROPHONE-ARRAY; ROOM; REFLECTIONS;
D O I
10.1109/JSTSP.2015.2415761
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present the concept of an acoustic rake receiver-a microphone beamformer that uses echoes to improve the noise and interference suppression. The rake idea is well-known in wireless communications; it involves constructively combining different multipath components that arrive at the receiver antennas. Unlike spread-spectrum signals used in wireless communications, speech signals are not orthogonal to their shifts. Therefore, we focus on the spatial structure, rather than the temporal. Instead of explicitly estimating the channel, we create correspondences between early echoes in time and image sources in space. These multiple sources of the desired and the interfering signal offer additional spatial diversity that we can exploit in the beamformer design. We present several "intuitive" and optimal formulations of acoustic rake receivers, and show theoretically and numerically that the rake formulation of the maximum signal-to-interference-and-noise ratio beamformer offers significant performance boosts in terms of noise and interference suppression. Beyond signal-to-noise ratio, we observe gains in terms of the perceptual evaluation of speech quality (PESQ) metric for the speech quality. We accompany the paper by the complete simulation and processing chain written in Python. The code and the sound samples are available online at http://lcav.github.io/AcousticRake-Receiver/.
引用
收藏
页码:825 / 836
页数:12
相关论文
共 50 条