ON THE EFFECT OF SNR AND SUPERDIRECTIVE BEAMFORMING IN SPEAKER DIARISATION IN MEETINGS

被引：0

作者：

Zwyssig, Erich ^{[1
]}

Renals, Steve ^{[1
]}

Lincoln, Mike ^{[1
]}

机构：

[1] Univ Edinburgh, Ctr Speech Technol Res, Edinburgh EH8 9AB, Midlothian, Scotland

来源：

2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2012年

基金：

英国工程与自然科学研究理事会;

关键词：

Speaker diarisation in meetings; digital MEMS microphone array; time difference of arrival (TDOA); superdirective beamforming;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper examines the effect of sensor performance on speaker diarisation in meetings and investigates the use of more advanced beamforming techniques, beyond the typically employed delay-sum beamformer, for mitigating the effects of poorer sensor performance. We present superdirective beamforming and investigate how different time difference of arrival (TDOA) smoothing and beamforming techniques influence the performance of state-of-the-art diarisation systems. We produced and transcribed a new corpus of meetings recorded in the instrumented meeting room using a high SNR analogue and a newly developed low SNR digital MEMS microphone array (DMMA. 2). This research demonstrates that TDOA smoothing has a significant effect on the diarisation error rate and that simple noise reduction and beamforming schemes suffice to overcome audio signal degradation due to the lower SNR of modern MEMS microphones.

引用

页码：4177 / 4180

页数：4

共 50 条

[1] Acoustic beamforming for speaker diarization of meetings
Anguera, Xavier
Wooters, Chuck
Hernando, Javier
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (07): : 2011 - 2022
[2] Who said that?: Audio-visual speaker diarisation of real-world meetings
Chung, Joon Son
Lee, Bong-Jin
Han, Icksang
[J]. INTERSPEECH 2019, 2019, : 371 - 375
[3] Adapting Speaker Embeddings for Speaker Diarisation
Kwon, Youngki
Jung, Jee-weon
Heo, Hee-Soo
Kim, You Jin
Lee, Bong-Jin
Chung, Joon Son
[J]. INTERSPEECH 2021, 2021, : 3101 - 3105
[4] CONTENT-AWARE SPEAKER EMBEDDINGS FOR SPEAKER DIARISATION
Sun, G.
Liu, D.
Zhang, C.
Woodland, P. C.
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7168 - 7172
[5] Combination of deep speaker embeddings for diarisation
Sun, Guangzhi
Zhang, Chao
Woodland, Philip C.
[J]. NEURAL NETWORKS, 2021, 141 : 372 - 384
[6] DNN APPROACH TO SPEAKER DIARISATION USING SPEAKER CHANNELS
Milner, Rosanna
Hain, Thomas
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 4925 - 4929
[7] Speaker overlap detection with prosodic features for speaker diarisation
Zelenak, M.
Hernando, J.
[J]. IET SIGNAL PROCESSING, 2012, 6 (08) : 798 - 804
[8] DNN-based speaker clustering for speaker diarisation
Milner, Rosanna
Hain, Thomas
[J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2185 - 2189
[9] DISCRIMINATIVE NEURAL CLUSTERING FOR SPEAKER DIARISATION
Li, Qiujia
Kreyssig, Florian L.
Zhang, Chao
Woodland, Philip C.
[J]. 2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 574 - 581
[10] Strategies to Improve a Speaker Diarisation Tool
Tavarez, David
Navas, Eva
Erro, Daniel
Saratxaga, Ibon
[J]. LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 4117 - 4121

← 1 2 3 4 5 →