Diarization-based Speaker Retrieval for Broadcast Television Archives

被引:0
|
作者
Huijbregts, Marijn [1 ]
van Leeuwen, David [1 ]
机构
[1] Radboud Univ Nijmegen, Dept Language & Speech Technol, Nijmegen, Netherlands
关键词
Speaker diarization; large scale speaker diarization; speaker retrieval; data mining;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study we extend a query-by-example diarization-based speaker retrieval system to a full speaker retrieval system for broadcast television. The envisioned system is capable of finding all speakers in an archive using their names instead of example speech fragments. Information extracted from a television guide is used to label speaker clusters that most likely correspond to the found:names. As part of the labeling process, all speaker clusters are first classified automatically based on their role in the programs they appear in. The role classification accuracy is 64% on our evaluation set. Speaker names can automatically be attributed to a fraction of the speaker clusters with an accuracy of 70%.
引用
收藏
页码:1044 / 1047
页数:4
相关论文
共 50 条
  • [1] Large-Scale Speaker Diarization of Radio Broadcast Archives
    Yilmaz, Emre
    Derinel, Adem
    Zhou, Kun
    van den Heuvel, Henk
    Brummer, Niko
    Li, Haizhou
    van Leeuwen, David A.
    [J]. INTERSPEECH 2019, 2019, : 411 - 415
  • [2] Speaker diarization of French broadcast news
    Gupta, Vishwa
    Boulianne, Gilles
    Kenny, Patrick
    Ouellet, Pierre
    Dumouchel, Pierre
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4365 - 4368
  • [3] Multistage speaker diarization of broadcast news
    Barras, Claude
    Zhu, Xuan
    Meignier, Sylvain
    Gauvain, Jean-Luc
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (05): : 1505 - 1512
  • [4] Robust Speaker Diarization for News Broadcast
    Karthik, M. L. N. S.
    Ganesh, Mirishkar Sai
    Patnaik, Bijayananda
    [J]. 2018 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2018,
  • [5] PLDA-based Clustering for Speaker Diarization of Broadcast Streams
    Silovsky, Jan
    Prazak, Jan
    Cerva, Petr
    Zdansky, Jindrich
    Nouza, Jan
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2920 - +
  • [6] Adaptive speaker diarization of broadcast news based on factor analysis
    Desplanques, Brecht
    Demuynck, Kris
    Martens, Jean-Pierre
    [J]. COMPUTER SPEECH AND LANGUAGE, 2017, 46 : 72 - 93
  • [7] Speaker diarization: From broadcast news to lectures
    Zhu, X.
    Barras, C.
    Lamel, L.
    Gauvain, J-L.
    [J]. MACHINE LEARNING FOR MULTIMODAL INTERACTION, 2006, 4299 : 396 - +
  • [8] Speaker Diarization in Broadcast News Using SubGlottal Resonances
    Kadijani, Homa Afaghi
    Razzazi, Farbod
    [J]. 2019 5TH IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS 2019), 2019,
  • [9] SPEAKER EMBEDDINGS FOR DIARIZATION OF BROADCAST DATA IN THE ALLIES CHALLENGE
    Larcher, Anthony
    Mehrish, Ambuj
    Tahon, Marie
    Meignier, Sylvain
    Carrive, Jean
    Doukhan, David
    Galibert, Olivier
    Evans, Nicholas
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5799 - 5803
  • [10] Study on Integration of Speaker Diarization with Speaker Adaptive Speech Recognition for Broadcast Transcription
    Silovsky, Jan
    Cerva, Petr
    Zdansky, Jindrich
    Nouza, Jan
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 478 - 481