Diarization-based Speaker Retrieval for Broadcast Television Archives

被引：0

作者：

Huijbregts, Marijn ^{[1
]}

van Leeuwen, David ^{[1
]}

机构：

[1] Radboud Univ Nijmegen, Dept Language & Speech Technol, Nijmegen, Netherlands

来源：

12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 | 2011年

关键词：

Speaker diarization; large scale speaker diarization; speaker retrieval; data mining;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this study we extend a query-by-example diarization-based speaker retrieval system to a full speaker retrieval system for broadcast television. The envisioned system is capable of finding all speakers in an archive using their names instead of example speech fragments. Information extracted from a television guide is used to label speaker clusters that most likely correspond to the found:names. As part of the labeling process, all speaker clusters are first classified automatically based on their role in the programs they appear in. The role classification accuracy is 64% on our evaluation set. Speaker names can automatically be attributed to a fraction of the speaker clusters with an accuracy of 70%.

引用

页码：1044 / 1047

页数：4

共 50 条

[31] A CLUSTER-VOTING APPROACH FOR SPEAKER DIARIZATION AND LINKING OF AUSTRALIAN BROADCAST NEWS RECORDINGS
Ghaemmaghami, Houman
Dean, David
Sridharan, Sridha
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4829 - 4833
[32] Speaker Diarization of Broadcast Audio Using Automatic Transcription, iVectors and Cosine Distance Scoring
Prazak, Jan
Bohac, Marek
[J]. PROCEEDINGS ELMAR-2012, 2012, : 211 - 214
[33] Active Learning Based Constrained Clustering For Speaker Diarization
Yu, Chengzhu
Hansen, John H. L.
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (11) : 2188 - 2198
[34] A DOA based speaker diarization system for real meetings
Araki, Shoko
Fujimoto, Masakiyo
Ishizuka, Kentaro
Sawada, Hiroshi
Makino, Shoji
[J]. 2008 HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS, 2008, : 30 - 33
[35] SPEAKER DIARIZATION OF MEETINGS BASED ON SPEAKER ROLE N-GRAM MODELS
Valente, Fabio
Vijayasenan, Deepu
Motlicek, Petr
[J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4416 - 4419
[36] Local Distribution Based Density Clustering for Speaker Diarization
Rho, Jinsang
Shon, Suwon
Kim, Sung Soo
Lee, Jae-Won
Ko, Hanseok
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2015, 34 (04): : 303 - 309
[37] Integration of evolutionary computation algorithms and new AUTO-TLBO technique in the speaker clustering stage for speaker diarization of broadcast news
Dabbabi, Karim
Hajji, Salah
Cherif, Adnen
[J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2017,
[38] Integration of evolutionary computation algorithms and new AUTO-TLBO technique in the speaker clustering stage for speaker diarization of broadcast news
Karim Dabbabi
Salah Hajji
Adnen Cherif
[J]. EURASIP Journal on Audio, Speech, and Music Processing, 2017
[39] SPEAKER DIARIZATION OF BROADCAST STREAMS USING TWO-STAGE CLUSTERING BASED ON I-VECTORS AND COSINE DISTANCE SCORING
Silovsky, Jan
Prazak, Jan
[J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4193 - 4196
[40] Optimized speaker change detection approach for speaker segmentation towards speaker diarization based on deep learning
VijayKumar, K.
Rao, R. Rajeswara
[J]. DATA & KNOWLEDGE ENGINEERING, 2023, 144

← 1 2 3 4 5 →