Speaker Diarization and Detection System using A Priori Speaker Information

被引:0
|
作者
Kenai, Ouassila [1 ]
Asbai, Nassim [1 ]
Ouamour, Siham [1 ]
Guerti, Mhania [2 ]
Djeghiour, Salim [2 ]
机构
[1] USTHB, Fac Elect & Comp Sci, Speech Com & Signal Proc Lab, Bab Ezzouar 16111, Algeria
[2] Natl Polytech Sch Algiers, Lab Speech, El Harrach 16200, Algeria
关键词
Diarization and Detection system; HAC clustering; SVM Models; A Priori Information;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper shows the interest of supplementary information in speaker diarization and detection system. This information consists in using of a priori speaker information, which is the number of speakers involved in audio streams and training data available for one speaker, or for all the speakers involved in conversation. Two different speaker diarization systems are built, using two clustering approaches; Hierarchical Ascending Classification (HAC) and Support Vector Machines (SVM) models. The impact of this a priori information is evaluated in terms of speaker diarization error rate (DER) and speaker detection rate (SDR). The experiments were achieved on NIST2005, show that the diarization and detection performances are butter, when using both of information (number of speakers and training data available for one speaker), than when knowing only the number of speakers. In accordance with this, our results show that the speaker segmentation with SVM generates approximately 12.01% of absolute diarization error less than the HAC method.
引用
收藏
页码:73 / 78
页数:6
相关论文
共 50 条
  • [1] Speaker Diarization Using a priori Acoustic Information
    Aronowitz, Hagai
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 944 - 947
  • [2] SPEAKER CHANGE DETECTION AND SPEAKER DIARIZATION USING SPATIAL INFORMATION
    Hu, Mathieu
    Sharma, Dushyant
    Doclo, Simon
    Brookes, Mike
    Naylor, Patrick A.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5743 - 5747
  • [3] Speaker Diarization with Lexical Information
    Park, Tae Jin
    Han, Kyu J.
    Huang, Jing
    He, Xiaodong
    Zhou, Bowen
    Georgiou, Panayiotis
    Narayanan, Shrikanth
    [J]. INTERSPEECH 2019, 2019, : 391 - 395
  • [4] CONVOLUTIONAL NEURAL NETWORK FOR SPEAKER CHANGE DETECTION IN TELEPHONE SPEAKER DIARIZATION SYSTEM
    Hruz, Marek
    Zajic, Zbynek
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 4945 - 4949
  • [5] An Improved Speaker Diarization System
    Fu, Rong
    Benest, Ian D.
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1253 - 1256
  • [6] Leveraging speaker attribute information using multi task learning for speaker verification and diarization
    Luu, Chau
    Bell, Peter
    Renals, Steve
    [J]. INTERSPEECH 2021, 2021, : 491 - 495
  • [7] IMPROVING SPEAKER DIARIZATION USING SOCIAL ROLE INFORMATION
    Sapru, Ashtosh
    Yella, Sree Harsha
    Bourlard, Herve
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [8] Online Target Speaker Voice Activity Detection for Speaker Diarization
    Wang, Weiqing
    Lin, Qingjian
    Li, Ming
    [J]. INTERSPEECH 2022, 2022, : 1441 - 1445
  • [9] TSUP Speaker Diarization System for Conversational Short-phrase Speaker Diarization Challenge
    Pang, Bowen
    Zhao, Huan
    Zhang, Gaosheng
    Yang, Xiaoyue
    Sun, Yang
    Zhang, Li
    Wang, Qing
    Xie, Lei
    [J]. 2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 502 - 506
  • [10] SPEAKER DIARIZATION THROUGH SPEAKER EMBEDDINGS
    Rouvier, Mickael
    Bousquet, Pierre-Michel
    Favre, Benoit
    [J]. 2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2082 - 2086