Randomization Effect on Iterative-Based Speaker Diarization System for Telephone Conversations

被引:0
|
作者
Furmanov, Tal [1 ]
Aminov, Lidiya [2 ]
Moyal, Ami [2 ]
Lapidot, Itshak [2 ]
机构
[1] Appl Mat Inc, Rehovot, Israel
[2] Afeka Tel Aviv Acad Coll Engn, ACLP Afeka Ctr Language Proc, Tel Aviv, Israel
关键词
hidden-distortion model (HDM); self-organizing maps (SOM); K-means; initialization; speaker diarization;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The primary objective of speaker diarization system is to designate speech segments to one of K speakers in the conversation. We use a hidden-distortion-model (HDM)-based system. HDM allows using different emission models as speaker models. We investigate the effect of randomization in two different levels. One level is stochastic training versus deterministic training and the other, random model initialization versus preserving initialization from the previous iteration. The emission models were codebooks (CBs) trained using K-means algorithm, both, batch and stochastic versions, as well as a self-organizing map (SOM) in its stochastic version. The evaluation performed on 108 telephone conversations from the LDC CallHome corpus. We will show that randomizing is always outperforming the deterministic training. Stochastic training demonstrated relative improvement of 3.5%. Random initialization achieved relative improvement of 7.28% comparing to preservation of initialization from the previous iteration.
引用
收藏
页数:5
相关论文
共 34 条
  • [31] Replay attack: Its effect on GMM-UBM based text-independent speaker verification system
    Singh, Madhusudan
    Mishra, Jagabandhu
    Pati, Debadatta
    2016 IEEE UTTAR PRADESH SECTION INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND ELECTRONICS ENGINEERING (UPCON), 2016, : 619 - 623
  • [32] Using the conformal embedding analysis to compensate the channel effect in the i-vector based speaker verification system
    Boulkenafet, Z.
    Bengherabi, M.
    Nouali, O.
    Cheriet, M.
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE OF THE BIOMETRICS SPECIAL INTEREST GROUP (BIOSIG 2013), 2013,
  • [33] Iterative Learning-Based Negative Effect Compensation Control of Disturbance to Improve the Disturbance Isolation of System
    Li, Xiantao
    Wang, Lu
    Xia, Xianqi
    Liu, Yuzhang
    Zhang, Bao
    SENSORS, 2022, 22 (09)
  • [34] Effect of a Simulated Analogue Telephone Channel on the Performance of a Remote Automatic System for the Detection of Pathologies in Voice: Impact of Linear Distortions on Cepstrum-Based Assessment - Band Limitation, Frequency Response and Additive Noise
    Fraile, Ruben
    Saenz-Lechon, Nicolas
    Ignacio Godino-Llorente, Juan
    Osma-Ruiz, Victor
    Fredouille, Corinne
    BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, 2010, 52 : 173 - +