Unsupervised help-trained LS-SVR-based segmentation in speaker diarization system

被引:0
|
作者
Farshad Teimoori
Farbod Razzazi
机构
[1] Science and Research Branch,Department of Electrical and Computer Engineering
[2] Islamic Azad University,undefined
来源
关键词
Online speech segmentation; Help-training; LS-SVR; Unsupervised segmentation; Speaker diarization;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we propose a new segmentation method for diarization applications. In the proposed method, segmentation is performed using a discriminatively trained support vector regression, while a generative classifier helps it to estimate the probable change points. Since, there is no pre-labeled training samples in segmentation task, the proposed model-based segmentation method tries to suggest a proper solution to bridge this gap. It is assumed that initial applied samples are labeled with the first speaker in an unsupervised manner, while the subsequent training samples are chosen by applying the help-training approach. These samples are estimated to be conducive when both regression and classifier blocks, label positive/negative samples to be advantageous. These samples would be purified in next steps and speakers’ models would be updated iteratively. In addition, a new procedure is introduced to estimate deleted and inserted change points that is executed when segmentation is completed. In comparison to similar approaches, experiments have shown performance improvement about 29% in diarization error rate.
引用
收藏
页码:11743 / 11777
页数:34
相关论文
共 50 条
  • [1] Unsupervised help-trained LS-SVR-based segmentation in speaker diarization system
    Teimoori, Farshad
    Razzazi, Farbod
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (09) : 11743 - 11777
  • [2] LS-SVR-based solving Volterra integral equations
    Guo, X. C.
    Wu, C. G.
    Marchese, M.
    Liang, Y. C.
    APPLIED MATHEMATICS AND COMPUTATION, 2012, 218 (23) : 11404 - 11409
  • [3] Bayes Factor Based Speaker Segmentation for Speaker Diarization
    Wang, D.
    Vogt, R.
    Sridharan, S.
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1405 - 1408
  • [4] Bayes Factor Based Speaker Segmentation for Speaker Diarization
    Speech and Audio Research Laboratory, Queensland University of Technology, Brisbane, Australia
    Proc. Annu. Conf. Int. Speech. Commun. Assoc., INTERSPEECH, (1405-1408):
  • [5] Experiments with Segmentation in an Online Speaker Diarization System
    Kunesova, Marie
    Zajic, Zbynek
    Radova, Vlasta
    TEXT, SPEECH, AND DIALOGUE, TSD 2017, 2017, 10415 : 429 - 437
  • [6] LS-SVR-based uncalibrated 4DOF visual positioning of robot
    Xin, Jing
    Liu, Ding
    Xu, Qing-Kun
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2010, 27 (01): : 77 - 85
  • [7] I-vector similarity based speech segmentation for interested speaker to speaker diarization system
    Bae, Ara
    Yoon, Ki-mu
    Jung, Jaehee
    Chung, Bokyung
    Kim, Wooil
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2020, 39 (05): : 461 - 467
  • [8] Novel Architectures for Unsupervised Information Bottleneck Based Speaker Diarization of Meetings
    Dawalatabad, Nauman
    Madikeri, Srikanth
    Sekhar, C. Chandra
    Murthy, Hema A.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 14 - 27
  • [9] MIMO LS-SVR-Based Multi-Point Vibration Response Prediction in the Frequency Domain
    Wang, Cheng
    Chen, Delei
    Huang, Haiyang
    Zhan, Wei
    Lai, Xiongming
    Chen, Jianwei
    APPLIED SCIENCES-BASEL, 2020, 10 (24): : 1 - 17
  • [10] Real-time multilingual speech recognition and speaker diarization system based on Whisper segmentation
    Lyu, Ke-Ming
    Lyu, Ren-yuan
    Chang, Hsien-Tsung
    PEERJ COMPUTER SCIENCE, 2024, 10