Fusion of a complementary feature set with MFCC for improved closed set text-independent speaker identification

被引:0
|
作者
Chakroborty, Sandipan [1 ]
Roy, Anindya [1 ]
Saha, Goutam [2 ]
机构
[1] Indian Inst Technol, Dept Elect & Elect Commun Engn, Kharagpur 721302, W Bengal, India
[2] Univ South Calif, Dept Biomed Engn, Los Angeles, CA 90089 USA
关键词
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
A state of the art Speaker Identification (SI) system requires a robust feature extraction unit followed by a speaker modeling scheme for generalized representation of these features. Over the years, Mel-Frequency Cepstral Coefficients (MFCC) modeled on the human auditory system have been used as a standard acoustic feature set for SI applications. However, due to the structure of its filter bank, it captures vocal tract characteristics more effectively in the lower frequency regions. This work proposes a new set of features using a complementary filter bank structure which improves distinguishability of speaker specific cues present in the higher frequency zone. Unlike high level features that are difficult to extract, the proposed feature set involves little computational burden during the extraction process. When combined with MFCC via a parallel implementation of speaker models, the proposed feature improves performance baseline of MFCC based system. The proposition is validated by experiments conducted on two different kinds of databases namely YOHO (microphone speech) and POLYCOST (telephone speech) with Gaussian Mixture Model (GMM) as a classifier for various model orders.
引用
收藏
页码:2914 / +
页数:2
相关论文
共 50 条
  • [1] Multi-feature Fusion for Closed Set Text Independent Speaker Identification
    Verma, Gyanendra K.
    [J]. INFORMATION INTELLIGENCE, SYSTEMS, TECHNOLOGY AND MANAGEMENT, 2011, 141 : 170 - 179
  • [2] A New Set of Features for Text-Independent Speaker Identification
    Espy-Wilson, Carol Y.
    Manocha, Sandeep
    Vishnubhotla, Srikanth
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1475 - +
  • [3] An Improved Approach to Open Set Text-Independent Speaker Identification (OSTI-SI)
    Chakraborty, ShrutiSarika
    Parekh, Ranjan
    [J]. 2017 THIRD IEEE INTERNATIONAL CONFERENCE ON RESEARCH IN COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (ICRCICN), 2017, : 51 - 56
  • [4] Text-independent speaker identification using backpropacration MLP network classifier for a closed set of speakers
    Sharma, A
    Singh, SP
    Kumar, V
    [J]. 2005 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), VOLS 1 AND 2, 2005, : 665 - 669
  • [5] Closed-Set Text-Independent Speaker Identification System Using Multiple ANN Classifiers
    Dutta, Munmi
    Patgiri, Chayashree
    Sarma, Mousmita
    Sarma, Kandarpa Kumar
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF INTELLIGENT COMPUTING: THEORY AND APPLICATIONS (FICTA) 2014, VOL 1, 2015, 327 : 377 - 385
  • [6] Combining Dynamic Features with MFCC for Text-independent Speaker Identification
    Chaudhari, Amol
    Rahulkar, Amol
    Dhonde, S. B.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING (ICIP), 2015, : 160 - 164
  • [7] Text-Independent Speaker Identification by Combining MFCC and MVA Features
    Korba, Mohamed Cherif Amara
    Bourouba, Houcine
    Rafik, Djemili
    [J]. 2018 INTERNATIONAL CONFERENCE ON SIGNAL, IMAGE, VISION AND THEIR APPLICATIONS (SIVA), 2018,
  • [8] Capturing complementary information via reversed filter bank and parallel implementation with MFCC for improved text-independent speaker identification
    Chakroborty, Sandipan
    Roy, Anindya
    Majumdar, Sourav
    Saha, Goutam
    [J]. ICCTA 2007: INTERNATIONAL CONFERENCE ON COMPUTING: THEORY AND APPLICATIONS, PROCEEDINGS, 2007, : 463 - +
  • [9] Higher order information set based features for text-independent speaker identification
    Medikonda J.
    Madasu H.
    [J]. International Journal of Speech Technology, 2018, 21 (03) : 451 - 461
  • [10] Toward open-set text-independent speaker identification in tactical communications
    Wolf, Matt B.
    Park, WonKyung
    Oh, Jae C.
    Blowers, Misty K.
    [J]. 2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN SECURITY AND DEFENSE APPLICATIONS, 2007, : 7 - +