Analysis of I-Vector framework for Speaker Identification in TV-shows

被引:0
|
作者
Fredouille, Corinne [1 ]
Charlet, Delphine [2 ]
机构
[1] Univ Avignon, CERI LIA, Avignon, France
[2] Orange Labs, Lannion, France
关键词
speaker identification; i-vector; REPERE challenge; TV shows;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Inspired from the Joint Factor Analysis, the I-vector-based analysis has become the most popular and state-of-the-art framework for the speaker verification task. Mainly applied within the NIST/SRE evaluation campaigns, many studies have been proposed to improve more and more performance of speaker verification systems. Nevertheless, while the i-vector framework has been used in other speech processing fields like language recognition, a very few studies have been reported for the speaker identification task on TV shows. This work was done in the REPERE challenge context, focused on the people recognition task in multimodal conditions (audio, video, text) from TV show corpora. Moreover, the challenge participants are invited for providing systems for monomodal tasks, like speaker identification. The application of the i-vector framework is investigatedthrough different points of views: (1) some of the i-vector based approaches are compared, (2) a specific i-vector extraction protocol is proposed in order to deal with widely varying amounts of training data among speaker population, (3) the joint use of both speaker diarization and identification is finally analyzed. Based on a 533 speaker dictionary, this joint system wins the monomodal speaker identification task of the 2014 REPERE challenge.
引用
收藏
页码:71 / 75
页数:5
相关论文
共 50 条
  • [1] Simplification of I-Vector Extraction for Speaker Identification
    XU Longting
    YANG Zhen
    SUN Linhui
    [J]. Chinese Journal of Electronics, 2016, 25 (06) : 1121 - 1126
  • [2] Simplification of I-Vector Extraction for Speaker Identification
    Xu Longting
    Yang Zhen
    Sun Linhui
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2016, 25 (06) : 1121 - 1126
  • [3] Feature Switching in the i-vector Framework for Speaker Verification
    Asha, T.
    Saranya, M. S.
    Pandia, Karthik D. S.
    Madikeri, Srikanth
    Murthy, Hema A.
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1125 - 1129
  • [4] Speaker identification using fuzzy i-vector tree
    Galka, Jakub
    Jaciow, Pawel
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (04) : 4937 - 4949
  • [5] Improving Speaker Identification in TV-shows using person name detection in overlaid text and speech
    Charlet, Delphine
    Fredouille, Corinne
    Damnati, Geraldine
    Senay, Gregory
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2777 - 2781
  • [6] I-vector Based Text-Independent Speaker Identification
    Liu, Tingting
    Kang, Kai
    Guan, Shengxiao
    [J]. 2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 5420 - 5425
  • [7] Speaker Identification Using Vector Quantization and I-vector with Reference to Assamese Language
    Bharali, Sruti Sruba
    Kalita, Sanjib Kr.
    [J]. 2017 2ND IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2017, : 164 - 168
  • [8] An I-Vector Backend for Speaker Verification
    Kenny, Patrick
    Stafylakis, Themos
    Alam, Jahangir
    Kockmann, Marcel
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2307 - 2311
  • [9] Sparsity Analysis and Compensation for i-Vector Based Speaker Verification
    Li, Wei
    Fu, Tian Fan
    Zhu, Jie
    Chen, Ning
    [J]. SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 381 - 388
  • [10] Feature sparsity analysis for i-vector based speaker verification
    Li, Wei
    Fu, Tianfan
    You, Hanxu
    Zhu, Jie
    Chen, Ning
    [J]. SPEECH COMMUNICATION, 2016, 80 : 60 - 70