Unsupervised help-trained LS-SVR-based segmentation in speaker diarization system

被引：0

作者：

Farshad Teimoori

Farbod Razzazi

机构：

[1] Science and Research Branch,Department of Electrical and Computer Engineering

[2] Islamic Azad University,undefined

来源：

Multimedia Tools and Applications | 2019年 / 78卷

关键词：

Online speech segmentation; Help-training; LS-SVR; Unsupervised segmentation; Speaker diarization;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In this paper, we propose a new segmentation method for diarization applications. In the proposed method, segmentation is performed using a discriminatively trained support vector regression, while a generative classifier helps it to estimate the probable change points. Since, there is no pre-labeled training samples in segmentation task, the proposed model-based segmentation method tries to suggest a proper solution to bridge this gap. It is assumed that initial applied samples are labeled with the first speaker in an unsupervised manner, while the subsequent training samples are chosen by applying the help-training approach. These samples are estimated to be conducive when both regression and classifier blocks, label positive/negative samples to be advantageous. These samples would be purified in next steps and speakers’ models would be updated iteratively. In addition, a new procedure is introduced to estimate deleted and inserted change points that is executed when segmentation is completed. In comparison to similar approaches, experiments have shown performance improvement about 29% in diarization error rate.

引用

页码：11743 / 11777

页数：34

共 50 条

[31] Speaker Diarization System based on DPCA Algorithm For Fearless Steps Challenge Phase-2
Zhang, Xueshuai
Wang, Wenchao
Zhang, Pengyuan
INTERSPEECH 2020, 2020, : 2602 - 2606
[32] Integrating Online I-vector extractor with Information Bottleneck based Speaker Diarization system
Madikeri, Srikanth
Himawan, Ivan
Motlicek, Petr
Ferras, Marc
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3105 - 3109
[33] OVERLAP-AWARE LOW-LATENCY ONLINE SPEAKER DIARIZATION BASED ON END-TO-END LOCAL SEGMENTATION
Coria, Juan M.
Bredin, Herve
Ghannay, Sahar
Rosset, Sophie
2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 1139 - 1146
[34] Speaker Segmentation System Using Eigenvoice-based Speaker Weight Distance Method
Choi, Mu Yeol
Kim, Hyung Soon
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2012, 31 (04): : 266 - 272
[35] INCREMENTAL TRANSFER LEARNING IN TWO-PASS INFORMATION BOTTLENECK BASED SPEAKER DIARIZATION SYSTEM FOR MEETINGS
Dawalatabad, Nauman
Madikeri, Srikanth
Sekhar, C. Chandra
Murthy, Hema A.
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6291 - 6295
[36] A hybrid HXPLS-TMFCC parameterization and DCNN-SFO clustering based speaker diarization system
Sailaja, C.
Maloji, Suman
Mannepalli, Kasiprasad
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (15):
[37] Two-Pass IB based Speaker Diarization System using Meeting-Specific ANN based Features
Dawalatabad, Nauman
Madikeri, Srikanth
Sekhar, C. Chandra
Murthy, Hema A.
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2199 - 2203
[38] Adaptive online prediction method based on LS-SVR and its application in an electronic system
Yang-ming GUO
Frontiers of Information Technology & Electronic Engineering, 2012, (12) : 881 - 890
[39] Adaptive online prediction method based on LS-SVR and its application in an electronic system
Yang-ming Guo
Cong-bao Ran
Xiao-lei Li
Jie-zhong Ma
Journal of Zhejiang University SCIENCE C, 2012, 13 : 881 - 890
[40] Adaptive online prediction method based on LS-SVR and its application in an electronic system
Guo, Yang-ming
Ran, Cong-bao
Li, Xiao-lei
Ma, Jie-zhong
JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE C-COMPUTERS & ELECTRONICS, 2012, 13 (12): : 881 - 890

← 1 2 3 4 5 →