ENERGY-BASED MULTI-SPEAKER VOICE ACTIVITY DETECTION WITH AN AD HOC MICROPHONE ARRAY

被引:24
|
作者
Bertrand, Alexander
Moonen, Marc
机构
关键词
Signal detection; Random arrays; Voice activity detection;
D O I
10.1109/ICASSP.2010.5496183
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose an energy-based technique to track the power of multiple simultaneous speakers using an ad hoc microphone array with unknown microphone positions. By considering the short-term power of the microphone signals, the problem can be converted into a non-negative blind source separation (NBSS) problem. By exploiting the prior knowledge that the source signals are non-negative and well-grounded, very efficient algorithms can be used to solve this NBSS problem, based only on second order statistics. We provide simulation results that demonstrate the effectiveness of the presented algorithm.
引用
收藏
页码:85 / 88
页数:4
相关论文
共 50 条
  • [1] Multi-Speaker Voice Activity Detection Using a Camera-assisted Microphone Array
    Bergh, Trond E.
    Hafizovicz, Ines
    Holm, Sverre
    [J]. PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING, (IWSSIP 2016), 2016, : 327 - 330
  • [2] Multi-speaker Beamforming for Voice Activity Classification
    Tran, Thuy N.
    Cowley, William
    Pollok, Andre
    [J]. 2013 AUSTRALIAN COMMUNICATIONS THEORY WORKSHOP (AUSCTW), 2013, : 116 - 121
  • [3] MULTI-SPEAKER SEPARATION EMPLOYING MICROPHONE ARRAY AND VERTEX FINDING ALGORITHM
    Hai Quang Hong Dam
    Nordholm, Sven
    [J]. 2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 306 - 310
  • [4] Energy-based position estimation of microphones and speakers for ad hoc microphone arrays
    Chen, Minghua
    Liu, Zicheng
    He, Li-Wei
    Chou, Phil
    Zhang, Zhengyou
    [J]. 2007 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 2007, : 249 - 252
  • [5] Range based multi microphone array fusion for speaker activity detection in small meetings
    Even, Jani
    Heracleous, Panikos
    Ishi, Carlos
    Hagita, Norihiro
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2748 - +
  • [6] Energy-based sound source localization and gain normalization for ad hoc microphone arrays
    Liu, Zicheng
    Zhang, Zhengyou
    He, Li-Wei
    Chou, Phil
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PTS 1-3, 2007, : 761 - +
  • [7] Target-Speaker Voice Activity Detection: a Novel Approach for Multi-Speaker Diarization in a Dinner Party Scenario
    Medennikov, Ivan
    Korenevsky, Maxim
    Prisyach, Tatiana
    Khokhlov, Yuri
    Korenevskaya, Mariya
    Sorokin, Ivan
    Timofeeva, Tatiana
    Mitrofanov, Anton
    Andrusenko, Andrei
    Podluzhny, Ivan
    Laptev, Aleksandr
    Romanenko, Aleksei
    [J]. INTERSPEECH 2020, 2020, : 274 - 278
  • [8] Two-microphone multi-speaker localization based on a Laplacian Mixture Model
    Cobos, Maximo
    Lopez, Jose J.
    Martinez, David
    [J]. DIGITAL SIGNAL PROCESSING, 2011, 21 (01) : 66 - 76
  • [9] Attention-based multi-channel speaker verification with ad-hoc microphone arrays
    Liang, Chengdong
    Chen, Junqi
    Guan, Shanzheng
    Zhang, Xiao-Lei
    [J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1111 - 1115
  • [10] Single and Multi-Speaker Cloned Voice Detection: From Perceptual to Learned Features
    Barrington, Sarah
    Barua, Romit
    Koorma, Gautham
    Farid, Hany
    [J]. 2023 IEEE INTERNATIONAL WORKSHOP ON INFORMATION FORENSICS AND SECURITY, WIFS, 2023,