Diarization-based Speaker Retrieval for Broadcast Television Archives

被引:0
|
作者
Huijbregts, Marijn [1 ]
van Leeuwen, David [1 ]
机构
[1] Radboud Univ Nijmegen, Dept Language & Speech Technol, Nijmegen, Netherlands
关键词
Speaker diarization; large scale speaker diarization; speaker retrieval; data mining;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study we extend a query-by-example diarization-based speaker retrieval system to a full speaker retrieval system for broadcast television. The envisioned system is capable of finding all speakers in an archive using their names instead of example speech fragments. Information extracted from a television guide is used to label speaker clusters that most likely correspond to the found:names. As part of the labeling process, all speaker clusters are first classified automatically based on their role in the programs they appear in. The role classification accuracy is 64% on our evaluation set. Speaker names can automatically be attributed to a fraction of the speaker clusters with an accuracy of 70%.
引用
收藏
页码:1044 / 1047
页数:4
相关论文
共 50 条
  • [31] A CLUSTER-VOTING APPROACH FOR SPEAKER DIARIZATION AND LINKING OF AUSTRALIAN BROADCAST NEWS RECORDINGS
    Ghaemmaghami, Houman
    Dean, David
    Sridharan, Sridha
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4829 - 4833
  • [32] Speaker Diarization of Broadcast Audio Using Automatic Transcription, iVectors and Cosine Distance Scoring
    Prazak, Jan
    Bohac, Marek
    [J]. PROCEEDINGS ELMAR-2012, 2012, : 211 - 214
  • [33] Active Learning Based Constrained Clustering For Speaker Diarization
    Yu, Chengzhu
    Hansen, John H. L.
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (11) : 2188 - 2198
  • [34] A DOA based speaker diarization system for real meetings
    Araki, Shoko
    Fujimoto, Masakiyo
    Ishizuka, Kentaro
    Sawada, Hiroshi
    Makino, Shoji
    [J]. 2008 HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS, 2008, : 30 - 33
  • [35] SPEAKER DIARIZATION OF MEETINGS BASED ON SPEAKER ROLE N-GRAM MODELS
    Valente, Fabio
    Vijayasenan, Deepu
    Motlicek, Petr
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4416 - 4419
  • [36] Local Distribution Based Density Clustering for Speaker Diarization
    Rho, Jinsang
    Shon, Suwon
    Kim, Sung Soo
    Lee, Jae-Won
    Ko, Hanseok
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2015, 34 (04): : 303 - 309
  • [37] Integration of evolutionary computation algorithms and new AUTO-TLBO technique in the speaker clustering stage for speaker diarization of broadcast news
    Dabbabi, Karim
    Hajji, Salah
    Cherif, Adnen
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2017,
  • [38] Integration of evolutionary computation algorithms and new AUTO-TLBO technique in the speaker clustering stage for speaker diarization of broadcast news
    Karim Dabbabi
    Salah Hajji
    Adnen Cherif
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2017
  • [39] SPEAKER DIARIZATION OF BROADCAST STREAMS USING TWO-STAGE CLUSTERING BASED ON I-VECTORS AND COSINE DISTANCE SCORING
    Silovsky, Jan
    Prazak, Jan
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4193 - 4196
  • [40] Optimized speaker change detection approach for speaker segmentation towards speaker diarization based on deep learning
    VijayKumar, K.
    Rao, R. Rajeswara
    [J]. DATA & KNOWLEDGE ENGINEERING, 2023, 144