Phone Adaptive Training for Speaker Diarization

被引：0

作者：

Bozonnet, Simon ^{[1
]}

Vipperla, Ravichander ^{[1
]}

Evans, Nicholas ^{[1
]}

机构：

[1] EURECOM, F-06904 Sophia Antipolis, France

来源：

13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 | 2012年

关键词：

Speaker Diarization; Phone Adaptive Training; Speaker Discrimination;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The linguistic content of a speech signal is a source of unwanted variation which can degrade speaker diarization performance. This paper presents our latest work to reduce its impact. The new approach, referred to as Phone Adaptive Training (PAT), is analogous to speaker adaptive training used in automatic speech recognition. We report an oracle experiment which shows that PAT has the potential to deliver a 33% relative improvement in the diarization error rate over our baseline system. Practical experiments show significant improvements across two standard, independent evaluation datasets.

引用

页码：494 / 497

页数：4

共 50 条

[21] An Improved Speaker Diarization System
Fu, Rong
Benest, Ian D.
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1253 - 1256
[22] SPEAKER DIARIZATION IN MEETING AUDIO
Nwe, Tin Lay
Sun, Hanwu
Li, Haizhou
Rahardja, Susanto
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4073 - 4076
[23] FULLY SUPERVISED SPEAKER DIARIZATION
Zhang, Aonan
Wang, Quan
Zhu, Zhenyao
Paisley, John
Wang, Chong
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6301 - 6305
[24] Speaker Diarization with Lexical Information
Park, Tae Jin
Han, Kyu J.
Huang, Jing
He, Xiaodong
Zhou, Bowen
Georgiou, Panayiotis
Narayanan, Shrikanth
INTERSPEECH 2019, 2019, : 391 - 395
[25] Speaker count: a new building block for speaker diarization
Duong, Thanh Thi-Hien
Nguyen, Phi-Le
Nguyen, Hong-Son
Nguyen, Duc-Chien
Phan, Huy
Duong, Ngoc Q. K.
2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1149 - 1155
[26] End-to-end neural speaker diarization with an iterative adaptive attractor estimation
Hao, Fengyuan
Li, Xiaodong
Zheng, Chengshi
NEURAL NETWORKS, 2023, 166 : 566 - 578
[27] Bayes Factor Based Speaker Segmentation for Speaker Diarization
Speech and Audio Research Laboratory, Queensland University of Technology, Brisbane, Australia
Proc. Annu. Conf. Int. Speech. Commun. Assoc., INTERSPEECH, (1405-1408):
[28] Bayes Factor Based Speaker Segmentation for Speaker Diarization
Wang, D.
Vogt, R.
Sridharan, S.
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1405 - 1408
[29] Factor Analysis for Speaker Segmentation and Improved Speaker Diarization
Desplanques, Brecht
Demuynck, Kris
Martens, Jean-Pierre
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3081 - 3085
[30] Exploring methods of improving speaker accuracy for speaker diarization
Knox, Mary Tai
Mirghafori, Nikki
Friedland, Gerald
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2782 - 2786

← 1 2 3 4 5 →