Unsupervised help-trained LS-SVR-based segmentation in speaker diarization system

被引:0
|
作者
Farshad Teimoori
Farbod Razzazi
机构
[1] Science and Research Branch,Department of Electrical and Computer Engineering
[2] Islamic Azad University,undefined
来源
关键词
Online speech segmentation; Help-training; LS-SVR; Unsupervised segmentation; Speaker diarization;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we propose a new segmentation method for diarization applications. In the proposed method, segmentation is performed using a discriminatively trained support vector regression, while a generative classifier helps it to estimate the probable change points. Since, there is no pre-labeled training samples in segmentation task, the proposed model-based segmentation method tries to suggest a proper solution to bridge this gap. It is assumed that initial applied samples are labeled with the first speaker in an unsupervised manner, while the subsequent training samples are chosen by applying the help-training approach. These samples are estimated to be conducive when both regression and classifier blocks, label positive/negative samples to be advantageous. These samples would be purified in next steps and speakers’ models would be updated iteratively. In addition, a new procedure is introduced to estimate deleted and inserted change points that is executed when segmentation is completed. In comparison to similar approaches, experiments have shown performance improvement about 29% in diarization error rate.
引用
收藏
页码:11743 / 11777
页数:34
相关论文
共 50 条
  • [31] Speaker Diarization System based on DPCA Algorithm For Fearless Steps Challenge Phase-2
    Zhang, Xueshuai
    Wang, Wenchao
    Zhang, Pengyuan
    INTERSPEECH 2020, 2020, : 2602 - 2606
  • [32] Integrating Online I-vector extractor with Information Bottleneck based Speaker Diarization system
    Madikeri, Srikanth
    Himawan, Ivan
    Motlicek, Petr
    Ferras, Marc
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3105 - 3109
  • [33] OVERLAP-AWARE LOW-LATENCY ONLINE SPEAKER DIARIZATION BASED ON END-TO-END LOCAL SEGMENTATION
    Coria, Juan M.
    Bredin, Herve
    Ghannay, Sahar
    Rosset, Sophie
    2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 1139 - 1146
  • [34] Speaker Segmentation System Using Eigenvoice-based Speaker Weight Distance Method
    Choi, Mu Yeol
    Kim, Hyung Soon
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2012, 31 (04): : 266 - 272
  • [35] INCREMENTAL TRANSFER LEARNING IN TWO-PASS INFORMATION BOTTLENECK BASED SPEAKER DIARIZATION SYSTEM FOR MEETINGS
    Dawalatabad, Nauman
    Madikeri, Srikanth
    Sekhar, C. Chandra
    Murthy, Hema A.
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6291 - 6295
  • [36] A hybrid HXPLS-TMFCC parameterization and DCNN-SFO clustering based speaker diarization system
    Sailaja, C.
    Maloji, Suman
    Mannepalli, Kasiprasad
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (15):
  • [37] Two-Pass IB based Speaker Diarization System using Meeting-Specific ANN based Features
    Dawalatabad, Nauman
    Madikeri, Srikanth
    Sekhar, C. Chandra
    Murthy, Hema A.
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2199 - 2203
  • [39] Adaptive online prediction method based on LS-SVR and its application in an electronic system
    Yang-ming Guo
    Cong-bao Ran
    Xiao-lei Li
    Jie-zhong Ma
    Journal of Zhejiang University SCIENCE C, 2012, 13 : 881 - 890
  • [40] Adaptive online prediction method based on LS-SVR and its application in an electronic system
    Guo, Yang-ming
    Ran, Cong-bao
    Li, Xiao-lei
    Ma, Jie-zhong
    JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE C-COMPUTERS & ELECTRONICS, 2012, 13 (12): : 881 - 890