SPEAKER DIARIZATION OF MEETINGS BASED ON LARGE TDOA FEATURE VECTORS

被引：0

作者：

Vijayasenan, Deepu ^{[1
]}

Valente, Fabio ^{[2
]}

机构：

[1] Univ Saarland, Saarbrucken, Germany

[2] Idiap Res Inst, Martigny, Switzerland

来源：

2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2012年

关键词：

Speaker diarization; Time Delay Of Arrival features; Meetings Recordings; Model combination;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper investigates the use of large TDOA feature vectors together with acoustic information in speaker diarization of meetings. TDOAs are obtained by considering all possible microphones pairs and this approach is compared with conventional TDOA features extracted w.r.t. a reference channel. The study is carried using two systems, the first based on Gaussian Mixture Modeling and the second based on the Information Bottleneck approach. Results on NIST RT06/RT07/RT09 evaluation datasets show a large speaker error reduction of 30% relative going from 14.3% to 10.8% for the first and from 12.3% to 8.2% for the second whenever the feature weighting is properly handled. Furthermore results reveal that the IB system is more robust to different number of microphones even when all pairs large TDOA vectors are used thus outperforming the HMM/GMM by 25% relative (8.2% error compared to 10.8%).

引用

页码：4173 / 4176

页数：4

共 50 条

[1] Multistream speaker diarization of meetings recordings beyond MFCC and TDOA features
Vijayasenan, Deepu
Valente, Fabio
Bourlard, Herve
[J]. SPEECH COMMUNICATION, 2012, 54 (01) : 55 - 67
[2] Automatic weighting for the combination of TDOA and acoustic features in speaker diarization for meetings
Anguera, Xavier
Wooters, Chuck
Pardo, Jose M.
Hernando, Javier
[J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 241 - +
[3] A DOA based speaker diarization system for real meetings
Araki, Shoko
Fujimoto, Masakiyo
Ishizuka, Kentaro
Sawada, Hiroshi
Makino, Shoji
[J]. 2008 HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS, 2008, : 30 - 33
[4] Selection of TDOA Parameters for MDM Speaker Diarization
Martinez-Gonzalez, Beatriz
Pardo, Jose M.
Echeverry-Correa, Julian D.
Vallejo-Pinto, Jose A.
Barra-Chicote, Roberto
[J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2155 - 2158
[5] IMPROVED SPEAKER DIARIZATION SYSTEM FOR MEETINGS
El-Khoury, Elie
Senac, Christine
Pinquier, Julien
[J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4097 - 4100
[6] Acoustic beamforming for speaker diarization of meetings
Anguera, Xavier
Wooters, Chuck
Hernando, Javier
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (07): : 2011 - 2022
[7] Automatic Speaker Positioning in Meetings Based on YOLO and TDOA
Hsieh, Chen-Chiung
Lu, Men-Ru
Tseng, Hsiao-Ting
[J]. SENSORS, 2023, 23 (14)
[8] SPEAKER DIARIZATION OF MEETINGS BASED ON SPEAKER ROLE N-GRAM MODELS
Valente, Fabio
Vijayasenan, Deepu
Motlicek, Petr
[J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4416 - 4419
[9] KL-HMM BASED SPEAKER DIARIZATION SYSTEM FOR MEETINGS
Madikeri, Srikanth
Bourlard, Herve
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4435 - 4439
[10] Clustering Initialization Based on Spatial Information for Speaker Diarization of Meetings
Luque, J.
Segura, C.
Hernando, J.
[J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 383 - 386

← 1 2 3 4 5 →