Optimization of Gabor features for text-independent speaker identification

被引:3
|
作者
Mildner, Volker [1 ]
Goetze, Stefan [1 ]
Kammeyer, Karl-Dirk [1 ]
Mertins, Alfred [2 ]
机构
[1] Univ Bremen, Dept Commun Engn, D-28334 Bremen, Germany
[2] Carl von Ossietzky Univ Oldenburg, Signal Proc Grp, D-26111 Oldenburg, Germany
关键词
D O I
10.1109/ISCAS.2007.378660
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For text-independent speaker identification a prominent combination is to use Gaussian Mixture Models (GMM) for classification while relying on Mel-Frequency Cepstral Coefficients (MFCC) as features. To take temporal information into account the time difference of features of adjacent speech frames are appended to the initial features. In this paper we investigate the applicability of spectro-temporal features obtained from Gabor-Filters and present an algorithm for optimizing the possible parameters. Simulation results on a database show that spectro-temporal features achieve higher recognition rates than purely temporal features for clean speech as well as for disturbed speech.
引用
收藏
页码:3932 / +
页数:2
相关论文
共 50 条
  • [1] A New Set of Features for Text-Independent Speaker Identification
    Espy-Wilson, Carol Y.
    Manocha, Sandeep
    Vishnubhotla, Srikanth
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1475 - +
  • [2] Text-independent speaker identification
    Gish, Herbert
    Schmidt, Michael
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 1994, 11 (04) : 18 - 32
  • [3] Combining Dynamic Features with MFCC for Text-independent Speaker Identification
    Chaudhari, Amol
    Rahulkar, Amol
    Dhonde, S. B.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING (ICIP), 2015, : 160 - 164
  • [4] Text-Independent Speaker Identification by Combining MFCC and MVA Features
    Korba, Mohamed Cherif Amara
    Bourouba, Houcine
    Rafik, Djemili
    [J]. 2018 INTERNATIONAL CONFERENCE ON SIGNAL, IMAGE, VISION AND THEIR APPLICATIONS (SIVA), 2018,
  • [5] Text-independent Speaker Identification in Birds
    Fox, E. J. S.
    Roberts, J. D.
    Bennamoun, M.
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2122 - 2125
  • [6] Text-independent speaker identification in noisy background
    Zhou, Y
    Xu, BL
    [J]. PROGRESS IN NATURAL SCIENCE, 2001, 11 : S384 - S387
  • [7] Voice text-independent system for speaker identification
    Babenko, LK
    Makarevich, OB
    Fedorov, VM
    Yurkov, PY
    [J]. IZVESTIYA VYSSHIKH UCHEBNYKH ZAVEDENII RADIOELEKTRONIKA, 2004, 47 (3-4): : 66 - 70
  • [8] On the use of Classifiers for Text-independent Speaker Identification
    Jawarkar, Naresh P.
    Holambe, Raghunath S.
    Basu, Tapan Kumar
    [J]. 2014 FIRST INTERNATIONAL CONFERENCE ON AUTOMATION, CONTROL, ENERGY & SYSTEMS (ACES-14), 2014, : 238 - 242
  • [9] Higher order information set based features for text-independent speaker identification
    Medikonda J.
    Madasu H.
    [J]. International Journal of Speech Technology, 2018, 21 (03) : 451 - 461
  • [10] HISTOGRAM TRANSFORM MODEL USING MFCC FEATURES FOR TEXT-INDEPENDENT SPEAKER IDENTIFICATION
    Yu, Hong
    Ma, Zhanyu
    Li, Minyue
    Guo, Jun
    [J]. CONFERENCE RECORD OF THE 2014 FORTY-EIGHTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2014, : 500 - 504