Enhancing the Performance of a GMM-based Speaker Identification System in a Multi-Microphone Setup

被引:0
|
作者
Stergiou, Andreas [1 ]
Pnevmatikakis, Aristodemos [1 ]
Polymenakos, Lazaros C. [1 ]
机构
[1] Athens Informat Technol, Auton & Grid Comp Grp, Athens, Greece
关键词
far-field speaker identification; gaussian mixture models; principal component analysis; microphone arrays;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper the speaker identification system developed at Athens Information Technology is presented. It is based on the Gaussian Mixture modeling of the Mel-Frequency Cepstral Coefficients of speech. Starting from this basic algorithm, we describe and discuss two significant modifications that have resulted in performance enhancements, in terms of both processing speed and identification accuracy. We present the performance of our system in the recent CLEAR 2006 evaluation workshop and also discuss approaches to further improve our system by fusing decisions derived from a multitude of sensors in a multi-microphone setup.
引用
收藏
页码:1463 / 1466
页数:4
相关论文
共 50 条
  • [1] A GMM-Based Speaker Identification System on FPGA
    Kan, Phak Len Eh
    Allen, Tim
    Quigley, Steven F.
    [J]. RECONFIGURABLE COMPUTING: ARCHITECTURES, TOOLS AND APPLICATIONS, 2010, 5992 : 358 - 363
  • [2] FPGA Implementation for GMM-Based Speaker Identification
    EhKan, Phaklen
    Allen, Timothy
    Quigley, Steven F.
    [J]. INTERNATIONAL JOURNAL OF RECONFIGURABLE COMPUTING, 2011, 2011
  • [3] Robust speaker identification using multi-microphone systems
    Barger, P
    Sridharan, S
    [J]. IEEE TENCON'97 - IEEE REGIONAL 10 ANNUAL CONFERENCE, PROCEEDINGS, VOLS 1 AND 2: SPEECH AND IMAGE TECHNOLOGIES FOR COMPUTING AND TELECOMMUNICATIONS, 1997, : 261 - 264
  • [4] An Improved GMM-based Clustering Algorithm for Efficient Speaker Identification
    Lin, Wenyong
    [J]. PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 1490 - 1493
  • [5] Multi-Microphone Speaker Separation based on Deep DOA Estimation
    Chazan, Shlomo E.
    Hammer, Hodaya
    Hazan, Gershon
    Goldberger, Jacob
    Gannot, Sharon
    [J]. 2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [6] Speaker and session variability in GMM-based speaker verification
    Kenny, Patrick
    Boulianne, Gilles
    Ouellet, Pierre
    Dumouchel, Pierre
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (04): : 1448 - 1460
  • [7] GMM-Based Maghreb Dialect Identification System
    Nour-Eddine, Lachachi
    Abdelkader, Adla
    [J]. JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2015, 11 (01): : 22 - 38
  • [8] Experimental Study on GMM-Based Speaker Recognition
    Ye, Wenxing
    Wu, Dapeng
    Nucci, Antonio
    [J]. MOBILE MULTIMEDIA/IMAGE PROCESSING, SECURITY, AND APPLICATIONS 2010, 2010, 7708
  • [9] Quantization for adapted GMM-based speaker verification
    Tseng, Ivy H.
    Verscheure, Olivier
    Turaga, Deepak S.
    Chaudhari, Upendra V.
    [J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 653 - 656
  • [10] A GMM-based handset selector for channel mismatch compensation with applications to speaker identification
    Yiu, KK
    Mak, MW
    Kung, SY
    [J]. ADVANCES IN MUTLIMEDIA INFORMATION PROCESSING - PCM 2001, PROCEEDINGS, 2001, 2195 : 1132 - 1137