INFORMATION BOTTLENECK BASED SPEAKER DIARIZATION OF MEETINGS USING NON-SPEECH AS SIDE INFORMATION

被引：0

作者：

Yella, Sree Harsha ^{[1
]}

Bourlard, Herve ^{[1
]}

机构：

[1] Idiap Res Inst, CH-1920 Martigny, Switzerland

来源：

2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年

关键词：

speaker diarization; spontaneous meeting recordings; information bottleneck; clustering; side information;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Background noise and errors in speech/non-speech detection cause significant degradation to the output of a speaker diarization system. In a typical speaker diarization system, non-speech segments are excluded prior to unsupervised clustering. In the current study, we exploit the information present in the non-speech segments of a recording to improve the output of the speaker diarization system based on information bottleneck framework. This is achieved by providing information from non-speech segments as side (irrelevant) information to information bottleneck based clustering. Experiments on meeting recordings from RT 06, 07, 09, evaluation sets have shown that the proposed method decreases the diarization error rate by around 18% relative to the baseline speaker diarization system based on information bottleneck framework. Comparison with a state of the art system based on HMM/GMM framework shows that the proposed method significantly decreases the gap in performance between the information bottleneck system and HMM/GMM system.

引用

页数：5

共 50 条

[1] Agglomerative Information Bottleneck for speaker diarization of meetings data
Vijayasenan, Deepu
Valente, Fabio
Bourlard, Herve
[J]. 2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 250 - 255
[2] Novel Architectures for Unsupervised Information Bottleneck Based Speaker Diarization of Meetings
Dawalatabad, Nauman
Madikeri, Srikanth
Sekhar, C. Chandra
Murthy, Hema A.
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 14 - 27
[3] Information Bottleneck Features for HMM/GMM Speaker Diarization of Meetings Recordings
Yella, Sree Harsha
Valente, Fabio
[J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 960 - 963
[4] Phoneme Background Model for Information Bottleneck based Speaker Diarization
Yella, Sree Harsha
Motlicek, Petr
Bourlard, Herve
[J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 597 - 601
[5] INCREMENTAL TRANSFER LEARNING IN TWO-PASS INFORMATION BOTTLENECK BASED SPEAKER DIARIZATION SYSTEM FOR MEETINGS
Dawalatabad, Nauman
Madikeri, Srikanth
Sekhar, C. Chandra
Murthy, Hema A.
[J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6291 - 6295
[6] Clustering Initialization Based on Spatial Information for Speaker Diarization of Meetings
Luque, J.
Segura, C.
Hernando, J.
[J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 383 - 386
[7] On the Applicability of Speaker Diarization to Audio Indexing of Non-Speech and Mixed Non-Speech/Speech Video Soundtracks
Mertens, Robert
Huang, Po-Sen
Gottlieb, Luke
Friedland, Gerald
Divakaran, Ajay
Hasegawa-Johnson, Mark
[J]. INTERNATIONAL JOURNAL OF MULTIMEDIA DATA ENGINEERING & MANAGEMENT, 2012, 3 (03): : 1 - 19
[8] MUTUAL INFORMATION BASED CHANNEL SELECTION FOR SPEAKER DIARIZATION OF MEETINGS DATA
Vijayasenan, Deepu
Valente, Fabio
Bourlard, Herve
[J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4065 - 4068
[9] MULTISTREAM SPEAKER DIARIZATION THROUGH INFORMATION BOTTLENECK SYSTEM OUTPUTS COMBINATION
Vijayasenan, Deepu
Valente, Fabio
Motlicek, Petr
[J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4420 - 4423
[10] Integration of TDOA Features in Information Bottleneck Framework for Fast Speaker Diarization
Vijayasenan, Deepu
Valente, Fabio
Bourland, Herve
[J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 40 - 43

← 1 2 3 4 5 →