Range based multi microphone array fusion for speaker activity detection in small meetings

被引:0
|
作者
Even, Jani [1 ]
Heracleous, Panikos [1 ]
Ishi, Carlos [1 ]
Hagita, Norihiro [1 ]
机构
[1] ATR Intelligent Robot & Commun Labs, Kyoto, Japan
关键词
speaker identification; microphone array; fusion;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a method for speaker activity detection in small meetings. The activity of the participants is deduced from audio streams obtained by multiple microphone arrays. One of the novelty of the proposed approach is that it uses a human tracker that relies on scanning laser range finders to localize the participants. First, this additional information is exploited by the beamforming algorithm creating the audio streams for each of the microphone arrays. Then, at each array, the speaker activity detection is performed using Gaussian mixture models that were trained before hand. Finally, a fusion procedure, that also uses the location information, combines the detection results of the different microphone arrays. An experiment reproducing a meeting configuration demonstrates the effectiveness of the system.
引用
收藏
页码:2748 / +
页数:3
相关论文
共 50 条
  • [1] ENERGY-BASED MULTI-SPEAKER VOICE ACTIVITY DETECTION WITH AN AD HOC MICROPHONE ARRAY
    Bertrand, Alexander
    Moonen, Marc
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 85 - 88
  • [2] Multi-Speaker Voice Activity Detection Using a Camera-assisted Microphone Array
    Bergh, Trond E.
    Hafizovicz, Ines
    Holm, Sverre
    [J]. PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING, (IWSSIP 2016), 2016, : 327 - 330
  • [3] MULTI-MODAL FRONT-END FOR SPEAKER ACTIVITY DETECTION IN SMALL MEETINGS
    Even, Jani
    Heracleous, Panikos
    Ishi, Carlos
    Hagita, Norihiro
    [J]. 2011 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2011, : 536 - 541
  • [4] MULTICHANNEL SPEAKER ACTIVITY DETECTION FOR MEETINGS
    Meyer, Patrick
    Jongebloed, Rolf
    Fingscheidt, Tim
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5539 - 5543
  • [5] Visually Supervised Speaker Detection and Localization via Microphone Array
    Berghi, Davide
    Hilton, Adrian
    Jackson, Philip J. B.
    [J]. IEEE MMSP 2021: 2021 IEEE 23RD INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2021,
  • [6] AUDIO INPUTS FOR ACTIVE SPEAKER DETECTION AND LOCALIZATION VIA MICROPHONE ARRAY
    Berghi, Davide
    Jackson, Philip J. B.
    [J]. 2023 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, WASPAA, 2023,
  • [7] Online Speaker Change Detection by Combining BIC with Microphone Array Beamforming
    Schmalenstroeer, Joerg
    Haeb-Umbach, Reinhold
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1658 - 1661
  • [8] Speaker diarization for multi-party meetings using acoustic fusion
    Anguera, X
    Wooters, C
    Hernando, J
    [J]. 2005 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 2005, : 426 - 431
  • [9] Speech activity detection of moving speaker using microphone arrays
    Potamitis, I
    Fishler, E
    [J]. ELECTRONICS LETTERS, 2003, 39 (16) : 1223 - 1225
  • [10] Speaker diarization for multi-microphone meetings using only between-channel differences
    Pardo, Jose M.
    Anguera, Xavier
    Wooters, Chuck
    [J]. MACHINE LEARNING FOR MULTIMODAL INTERACTION, 2006, 4299 : 257 - +