Unsupervised help-trained LS-SVR-based segmentation in speaker diarization system

被引:0
|
作者
Farshad Teimoori
Farbod Razzazi
机构
[1] Science and Research Branch,Department of Electrical and Computer Engineering
[2] Islamic Azad University,undefined
来源
关键词
Online speech segmentation; Help-training; LS-SVR; Unsupervised segmentation; Speaker diarization;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we propose a new segmentation method for diarization applications. In the proposed method, segmentation is performed using a discriminatively trained support vector regression, while a generative classifier helps it to estimate the probable change points. Since, there is no pre-labeled training samples in segmentation task, the proposed model-based segmentation method tries to suggest a proper solution to bridge this gap. It is assumed that initial applied samples are labeled with the first speaker in an unsupervised manner, while the subsequent training samples are chosen by applying the help-training approach. These samples are estimated to be conducive when both regression and classifier blocks, label positive/negative samples to be advantageous. These samples would be purified in next steps and speakers’ models would be updated iteratively. In addition, a new procedure is introduced to estimate deleted and inserted change points that is executed when segmentation is completed. In comparison to similar approaches, experiments have shown performance improvement about 29% in diarization error rate.
引用
收藏
页码:11743 / 11777
页数:34
相关论文
共 50 条
  • [41] Mongolian word segmentation system based on unsupervised statistical model
    Wang, Siriguleng
    Bao, Meirong
    Arong
    INFORMATION SCIENCE AND MANAGEMENT ENGINEERING, VOLS 1-3, 2014, 46 : 707 - 714
  • [42] Adaptive online prediction method based on LS-SVR and its application in an electronic system
    Yangming GUO Congbao RANXiaolei LIJiezhong MASchool of Computer Science and TechnologyNorthwestern Polytechnical UniversityXian China
    JournalofZhejiangUniversity-ScienceC(Computers&Electronics), 2012, 13 (12) : 881 - 890
  • [43] Semi-supervised speaker verification system based on pre-trained models
    Li, Yishuang
    Chen, Zhicong
    Miao, Shiyu
    Su, Qi
    Li, Lin
    Hong, Qingyang
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2024, 64 (11): : 1936 - 1943
  • [44] Online fault prediction for nonlinear system based on sliding ARMA combined with online LS-SVR
    Su Shengchao
    Zhang Wei
    Zhao Shuguang
    2014 33RD CHINESE CONTROL CONFERENCE (CCC), 2014, : 3293 - 3297
  • [45] Hybrid speaker-based segmentation system using model-level clustering
    Kim, HG
    Ertelt, D
    Sikora, T
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 745 - 748
  • [46] Unsupervised Bayesian image segmentation using adaptive EM algorithm based on pearson system
    Rekik, Ahmed
    Zribi, Mourad
    Ben Hamida, Ahmed
    Benjelloun, Mohammed
    WMSCI 2006: 10TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL V, PROCEEDINGS, 2006, : 165 - +
  • [47] Parameter probabilistic prediction for satellite power system based on unsupervised multi-dimensional sequence segmentation
    Kang, Shouqiang
    Gao, Yanjiao
    Song, Yuchen
    Zhou, Ruzhi
    Pang, Jingyue
    AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 146
  • [48] Unsupervised shape-and-texture-based generative adversarial tuning of pre-trained networks for carotid segmentation from 3D ultrasound images
    Chen, Zhaozheng
    Jiang, Mingjie
    Chiu, Bernard
    MEDICAL PHYSICS, 2024, 51 (10) : 7240 - 7256
  • [49] Skin-color segmentation by histogram based TS-type Fuzzy System trained by Support Vector Machine
    Juang, Chia-Feng
    Chiu, Shih-Hsuan
    Chung, I-Fang
    Liou, Yuan-Chang
    2006 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2006, : 947 - +
  • [50] Data-driven predictive control of molten iron quality in blast furnace ironmaking using multi-output LS-SVR based inverse system identification
    Zhou, Ping
    Guo, Dongwei
    Chai, Tianyou
    NEUROCOMPUTING, 2018, 308 : 101 - 110