Extending the Task of Diarization to Speaker Attribution

被引：0

作者：

Ghaemmaghami, Houman ^{[1
]}

Dean, David ^{[1
]}

Vogt, Robbie ^{[1
]}

Sridharan, Sridha ^{[1
]}

机构：

[1] Queensland Univ Technol, Speech & Audio Res Lab, Brisbane, Qld 4001, Australia

来源：

12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 | 2011年

关键词：

speaker attribution; diarization; clustering; cross likelihood ratio; joint factor analysis;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper we extend the concept of speaker annotation within a single-recording, or speaker diarization, to a collection wide approach we call speaker attribution. Accordingly, speaker attribution is the task of clustering expectantly homogenous intersession clusters obtained using diarization according to common cross-recording identities. The result of attribution is a collection of spoken audio across multiple recordings attributed to speaker identities. In this paper, an attribution system is proposed using mean-only MAP adaptation of a combined-gender UBM to model clusters from a perfect diarization system, as well as a JFA-based system with session variability compensation. The normalized cross-likelihood ratio is calculated for each pair of clusters to construct an attribution matrix and the complete linkage algorithm is employed to conduct clustering of the inter-session clusters. A matched cluster purity and coverage of 87.1% was obtained on the NIST 2008 SRE corpus.

引用

页码：1056 / 1059

页数：4

共 50 条

[31] Robust Speaker Diarization for News Broadcast
Karthik, M. L. N. S.
Ganesh, Mirishkar Sai
Patnaik, Bijayananda
2018 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2018,
[32] IMPROVED SPEAKER DIARIZATION SYSTEM FOR MEETINGS
El-Khoury, Elie
Senac, Christine
Pinquier, Julien
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4097 - 4100
[33] A review on speaker diarization systems and approaches
Moattar, M. H.
Homayounpour, M. M.
SPEECH COMMUNICATION, 2012, 54 (10) : 1065 - 1103
[34] Speaker Diarization: A Review of Recent Research
Anguera Miro, Xavier
Bozonnet, Simon
Evans, Nicholas
Fredouille, Corinne
Friedland, Gerald
Vinyals, Oriol
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (02): : 356 - 370
[35] Speaker diarization of French broadcast news
Gupta, Vishwa
Boulianne, Gilles
Kenny, Patrick
Ouellet, Pierre
Dumouchel, Pierre
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4365 - 4368
[36] Acoustic beamforming for speaker diarization of meetings
Anguera, Xavier
Wooters, Chuck
Hernando, Javier
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (07): : 2011 - 2022
[37] AUDIOVISUAL SPEAKER DIARIZATION OF TV SERIES
Bost, Xavier
Linares, Georges
Gueye, Serigne
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4799 - 4803
[38] A Hybrid Approach to Online Speaker Diarization
Vaquero, Carlos
Vinyals, Oriol
Friedland, Gerald
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2646 - +
[39] SPEAKER DIARIZATION WITH REGION PROPOSAL NETWORK
Huang, Zili
Watanabe, Shinji
Fujita, Yusuke
Garcia, Paola
Shao, Yiwen
Povey, Daniel
Khudanpur, Sanjeev
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6514 - 6518
[40] On the Use of Dot Scoring for Speaker Diarization
Diez, Mireia
Penagarikano, Mikel
Varona, Amparo
Javier Rodriguez-Fuentes, Luis
Bordel, German
PATTERN RECOGNITION AND IMAGE ANALYSIS: 5TH IBERIAN CONFERENCE, IBPRIA 2011, 2011, 6669 : 612 - 619

← 1 2 3 4 5 →