Unsupervised help-trained LS-SVR-based segmentation in speaker diarization system

被引:0
|
作者
Farshad Teimoori
Farbod Razzazi
机构
[1] Science and Research Branch,Department of Electrical and Computer Engineering
[2] Islamic Azad University,undefined
来源
关键词
Online speech segmentation; Help-training; LS-SVR; Unsupervised segmentation; Speaker diarization;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we propose a new segmentation method for diarization applications. In the proposed method, segmentation is performed using a discriminatively trained support vector regression, while a generative classifier helps it to estimate the probable change points. Since, there is no pre-labeled training samples in segmentation task, the proposed model-based segmentation method tries to suggest a proper solution to bridge this gap. It is assumed that initial applied samples are labeled with the first speaker in an unsupervised manner, while the subsequent training samples are chosen by applying the help-training approach. These samples are estimated to be conducive when both regression and classifier blocks, label positive/negative samples to be advantageous. These samples would be purified in next steps and speakers’ models would be updated iteratively. In addition, a new procedure is introduced to estimate deleted and inserted change points that is executed when segmentation is completed. In comparison to similar approaches, experiments have shown performance improvement about 29% in diarization error rate.
引用
收藏
页码:11743 / 11777
页数:34
相关论文
共 50 条
  • [21] Unsupervised Speaker Segmentation Framework Based on Sparse Correlation Feature
    Sun, Yi Xin
    Ma, Yong
    Shi, Kai Bo
    Hu, Jiang Ping
    Zhao, Yi Yi
    Zhang, Yu Ping
    2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 3058 - 3063
  • [22] A REAL-TIME SPEAKER DIARIZATION SYSTEM BASED ON SPATIAL SPECTRUM
    Zheng, Siqi
    Huang, Weilong
    Wang, Xianliang
    Suo, Hongbin
    Feng, Jinwei
    Yan, Zhijie
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7208 - 7212
  • [23] A speaker based unsupervised speech segmentation algorithm used in conversational speech
    Chen, Yanxiang
    Wang, Qiong
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, 2007, 4798 : 396 - +
  • [24] Development of Supervised Speaker Diarization System Based on the PyAnnote Audio Processing Library
    Khoma, Volodymyr
    Khoma, Yuriy
    Brydinskyi, Vitalii
    Konovalov, Alexander
    SENSORS, 2023, 23 (04)
  • [25] Randomization Effect on Iterative-Based Speaker Diarization System for Telephone Conversations
    Furmanov, Tal
    Aminov, Lidiya
    Moyal, Ami
    Lapidot, Itshak
    2014 IEEE 28TH CONVENTION OF ELECTRICAL & ELECTRONICS ENGINEERS IN ISRAEL (IEEEI), 2014,
  • [26] Generalized Viterbi-based models for time-series segmentation and clustering applied to speaker diarization
    Lapidot, Itshak
    Shoa, Alon
    Furmanov, Tal
    Aminov, Lidiya
    Moyal, Ami
    Bonastre, Jean-Francois
    COMPUTER SPEECH AND LANGUAGE, 2017, 45 : 1 - 20
  • [27] A two-level method for unsupervised speaker-based audio segmentation
    Zhang, Shilei
    Zhang, Shuwu
    Xu, Bo
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 298 - +
  • [28] New implementations of the E-HMM-based system for speaker diarization in meeting rooms
    Fredouille, Corinne
    Evans, Nicholas
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4357 - 4360
  • [29] Technical improvements of the E-HMM based speaker diarization system for meeting records
    Fredouille, Corinne
    Senay, Gregory
    MACHINE LEARNING FOR MULTIMODAL INTERACTION, 2006, 4299 : 359 - +
  • [30] Recurrent Neural Network Based Speaker Change Detection from Text Transcription Applied in Telephone Speaker Diarization System
    Zajic, Zbynek
    Soutner, Daniel
    Hruz, Marek
    Muller, Ludek
    Radova, Vlasta
    TEXT, SPEECH, AND DIALOGUE (TSD 2018), 2018, 11107 : 342 - 350