I-Vector Extraction Using Speaker Relevancy for Short Duration Speaker Recognition

被引:0
|
作者
Kang, Woo Hyun
Cho, Won Ik
Jang, Se Young
Lee, Hyeon Seung
Kim, Nam Soo [1 ]
机构
[1] Seoul Natl Univ, Dept Elect & Comp Engn, 1 Gwanak Ro, Seoul 08826, South Korea
来源
基金
新加坡国家研究基金会;
关键词
Speaker recognition; i-vector; DNN; NEURAL-NETWORKS;
D O I
10.1007/978-981-10-6451-7_10
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a novel scheme for considering the frame-level speaker relevancy during i-vector extraction for speaker recognition. In the proposed system, the frame-level point-wise mutual information is utilized to directly modify the Baum-Welch statistics in order to extract a robust i-vector. Furthermore, a method for computing the frame-level speaker relevancy using deep neural network (DNN) analogous to the DNN used in robust automatic speech recognition (ASR) is proposed. The results show that the modified i-vectors obtained using the proposed methods outperformed the conventional i-vectors.
引用
收藏
页码:79 / 87
页数:9
相关论文
共 50 条
  • [21] A Comparison of Covariance Matrix and i-vector Based Speaker Recognition
    Jakovljevic, Niksa
    Jokic, Ivan
    Josic, Slobodan
    Delic, Vlado
    [J]. SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 37 - 45
  • [22] ADDITIVE NOISE COMPENSATION IN THE I-VECTOR SPACE FOR SPEAKER RECOGNITION
    Ben Kheder, Waad
    Matrouf, Driss
    Bonastre, Jean-Francois
    Ajili, Moez
    Bousquet, Pierre-Michel
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4190 - 4194
  • [23] Analysis of I-vector Length Normalization in Speaker Recognition Systems
    Garcia-Romero, Daniel
    Espy-Wilson, Carol Y.
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 256 - 259
  • [24] DEEP BELIEF NETWORKS FOR I-VECTOR BASED SPEAKER RECOGNITION
    Ghahabi, Omid
    Hernando, Javier
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [25] Emotional Speaker Recognition Based on i-vector Space Model
    Mansour, Asma
    Chenchah, Farah
    Lachiri, Zied
    [J]. 2016 4TH INTERNATIONAL CONFERENCE ON CONTROL ENGINEERING & INFORMATION TECHNOLOGY (CEIT), 2016,
  • [26] Evaluation of i-vector Speaker Recognition Systems for Forensic Application
    Mandasari, Miranti Indar
    McLaren, Mitchell
    van Leeuwen, David
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 28 - 31
  • [27] Discriminatively learned network for i-vector based speaker recognition
    Yao, Shengyu
    Zhou, Ruohua
    Zhang, Pengyuan
    Yan, Yonghong
    [J]. ELECTRONICS LETTERS, 2018, 54 (22) : 1302 - 1303
  • [28] Tied Variational Autoencoder Backends for i-Vector Speaker Recognition
    Villalba, Jesus
    Brummer, Niko
    Dehak, Najim
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1004 - 1008
  • [29] Clustering-Based I-Vector Formulation for Speaker Recognition
    Lee, Hung-Shin
    Tsao, Yu
    Wang, Hsin-Min
    Jeng, Shyh-Kang
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1101 - 1105
  • [30] Speaker Adaptation Using i-Vector Based Clustering
    Kim, Minsoo
    Jang, Gil-Jin
    Kim, Ji-Hwan
    Lee, Minho
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2020, 14 (07): : 2785 - 2799