Speaker Diarization and Detection System using A Priori Speaker Information

被引：0

作者：

Kenai, Ouassila ^{[1
]}

Asbai, Nassim ^{[1
]}

Ouamour, Siham ^{[1
]}

Guerti, Mhania ^{[2
]}

Djeghiour, Salim ^{[2
]}

机构：

[1] USTHB, Fac Elect & Comp Sci, Speech Com & Signal Proc Lab, Bab Ezzouar 16111, Algeria

[2] Natl Polytech Sch Algiers, Lab Speech, El Harrach 16200, Algeria

来源：

2018 2ND INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE AND SPEECH PROCESSING (ICNLSP) | 2018年

关键词：

Diarization and Detection system; HAC clustering; SVM Models; A Priori Information;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper shows the interest of supplementary information in speaker diarization and detection system. This information consists in using of a priori speaker information, which is the number of speakers involved in audio streams and training data available for one speaker, or for all the speakers involved in conversation. Two different speaker diarization systems are built, using two clustering approaches; Hierarchical Ascending Classification (HAC) and Support Vector Machines (SVM) models. The impact of this a priori information is evaluated in terms of speaker diarization error rate (DER) and speaker detection rate (SDR). The experiments were achieved on NIST2005, show that the diarization and detection performances are butter, when using both of information (number of speakers and training data available for one speaker), than when knowing only the number of speakers. In accordance with this, our results show that the speaker segmentation with SVM generates approximately 12.01% of absolute diarization error less than the HAC method.

引用

页码：73 / 78

页数：6

共 50 条

[1] Speaker Diarization Using a priori Acoustic Information
Aronowitz, Hagai
[J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 944 - 947
[2] SPEAKER CHANGE DETECTION AND SPEAKER DIARIZATION USING SPATIAL INFORMATION
Hu, Mathieu
Sharma, Dushyant
Doclo, Simon
Brookes, Mike
Naylor, Patrick A.
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5743 - 5747
[3] Speaker Diarization with Lexical Information
Park, Tae Jin
Han, Kyu J.
Huang, Jing
He, Xiaodong
Zhou, Bowen
Georgiou, Panayiotis
Narayanan, Shrikanth
[J]. INTERSPEECH 2019, 2019, : 391 - 395
[4] CONVOLUTIONAL NEURAL NETWORK FOR SPEAKER CHANGE DETECTION IN TELEPHONE SPEAKER DIARIZATION SYSTEM
Hruz, Marek
Zajic, Zbynek
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 4945 - 4949
[5] An Improved Speaker Diarization System
Fu, Rong
Benest, Ian D.
[J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1253 - 1256
[6] Leveraging speaker attribute information using multi task learning for speaker verification and diarization
Luu, Chau
Bell, Peter
Renals, Steve
[J]. INTERSPEECH 2021, 2021, : 491 - 495
[7] IMPROVING SPEAKER DIARIZATION USING SOCIAL ROLE INFORMATION
Sapru, Ashtosh
Yella, Sree Harsha
Bourlard, Herve
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[8] Online Target Speaker Voice Activity Detection for Speaker Diarization
Wang, Weiqing
Lin, Qingjian
Li, Ming
[J]. INTERSPEECH 2022, 2022, : 1441 - 1445
[9] TSUP Speaker Diarization System for Conversational Short-phrase Speaker Diarization Challenge
Pang, Bowen
Zhao, Huan
Zhang, Gaosheng
Yang, Xiaoyue
Sun, Yang
Zhang, Li
Wang, Qing
Xie, Lei
[J]. 2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 502 - 506
[10] SPEAKER DIARIZATION THROUGH SPEAKER EMBEDDINGS
Rouvier, Mickael
Bousquet, Pierre-Michel
Favre, Benoit
[J]. 2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2082 - 2086

← 1 2 3 4 5 →