Comparison of algorithms for speaker identification under adverse far-field recording conditions with extremely short utterances

被引:0
|
作者
Tang, Hao [1 ]
Chen, Zhixiong [2 ]
Huang, Thomas S. [1 ]
机构
[1] Univ Illinois, Dept Elect & Comp Engn, Urbana, IL 61801 USA
[2] Mercy Coll, MCIS, Dobbs Ferry, NY 10522 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we compare the state-of-the-art algorithms for text-independent speaker identification under adverse far-field recording conditions with extremely short training and testing utterances. The algorithms include both the generative and discriminative methods. For the generative methods, three variants of the original Gaussian Mixture Model (GMM) and the Universal Background Model adapted Gaussian Mixture Model (UBM-GMM) are involved. For the discriminative methods, two kernel-based algorithms, namely, the Support Vector Machine (SVM) and the Relevance Vector Machine (RVM), are considered. The comparison mainly focuses on the speaker identification accuracy and the speed of the individual algorithms (for both training and testing) as well as the sparseness of the resulting model for the kernel-based methods. Finally, we demonstrate through experiments that multi-channel fusion of the far-field recordings yields improved performance across all the above algorithms.
引用
收藏
页码:796 / +
页数:2
相关论文
共 23 条
  • [1] Robust Far-Field Speaker Identification under Mismatched Conditions
    Jin, Qin
    Schultz, Tanja
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1893 - 1896
  • [2] Modulation Spectral Features for Robust Far-Field Speaker Identification
    Falk, Tiago H.
    Chan, Wai-Yip
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (01): : 90 - 100
  • [3] Spectro-Temporal Features for Robust Far-Field Speaker Identification
    Falk, Tiago H.
    Chan, Wai-Yip
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 634 - 637
  • [4] I-vector-based Speaker Identification with Extremely Short Utterances for Both Training and Testing
    Tsujikawa, Misaki
    Nishikawa, Tsuyoki
    Matsui, Tomoko
    2017 IEEE 6TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE), 2017,
  • [5] Comparison and application of the far-field identification algorithms for multiple sound sources based on microphone array
    Wang, Yansong
    Yang, Chao
    Guo, Hui
    Yuan, Tao
    Wang, Yue
    INTERNATIONAL JOURNAL OF AEROACOUSTICS, 2024, : 634 - 656
  • [6] Why talk to people when you can talk to robots? Far-field speaker identification in the wild
    Humblot-Renaux, Galadrielle
    Li, Chen
    Chrysostomou, Dimitrios
    2021 30TH IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN), 2021, : 272 - 278
  • [7] Analysis of the Radiated Susceptibility of a Transmission Line under Near and Far-field Conditions
    Zhao, Zhao
    Leone, Marco
    PROCEEDINGS OF THE 2016 INTERNATIONAL SYMPOSIUM ON ELECTROMAGNETIC COMPATIBILITY - EMC EUROPE, 2016, : 889 - 893
  • [8] Unexpected measurement results of 94 GHz lens antenna in short far-field conditions
    Vaaja, M.
    Lanteri, J.
    Migliaccio, C.
    Ala-Laurinaho, J.
    Raisanen, A. V.
    ELECTRONICS LETTERS, 2009, 45 (14) : 725 - 726
  • [9] CRACK INITIATION UNDER FAR-FIELD CYCLIC COMPRESSION AND THE STUDY OF SHORT FATIGUE CRACKS
    CHRISTMAN, T
    SURESH, S
    ENGINEERING FRACTURE MECHANICS, 1986, 23 (06) : 953 - 964
  • [10] Residual Time-restricted Self-Attentive TDNN Speaker Embedding for Noisy and Far-field Conditions
    Benhafid, Zhor
    Selouani, Sid Ahmed
    Yakoub, Mohammed Sidi
    Amrouche, Abderrahmane
    2022 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2022, : 471 - 476