Fusion of a complementary feature set with MFCC for improved closed set text-independent speaker identification

被引：0

作者：

Chakroborty, Sandipan ^{[1
]}

Roy, Anindya ^{[1
]}

Saha, Goutam ^{[2
]}

机构：

[1] Indian Inst Technol, Dept Elect & Elect Commun Engn, Kharagpur 721302, W Bengal, India

[2] Univ South Calif, Dept Biomed Engn, Los Angeles, CA 90089 USA

来源：

2006 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY, VOLS 1-6 | 2006年

关键词：

D O I：

暂无

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

A state of the art Speaker Identification (SI) system requires a robust feature extraction unit followed by a speaker modeling scheme for generalized representation of these features. Over the years, Mel-Frequency Cepstral Coefficients (MFCC) modeled on the human auditory system have been used as a standard acoustic feature set for SI applications. However, due to the structure of its filter bank, it captures vocal tract characteristics more effectively in the lower frequency regions. This work proposes a new set of features using a complementary filter bank structure which improves distinguishability of speaker specific cues present in the higher frequency zone. Unlike high level features that are difficult to extract, the proposed feature set involves little computational burden during the extraction process. When combined with MFCC via a parallel implementation of speaker models, the proposed feature improves performance baseline of MFCC based system. The proposition is validated by experiments conducted on two different kinds of databases namely YOHO (microphone speech) and POLYCOST (telephone speech) with Gaussian Mixture Model (GMM) as a classifier for various model orders.

引用

页码：2914 / +

页数：2

共 50 条

[1] Multi-feature Fusion for Closed Set Text Independent Speaker Identification
Verma, Gyanendra K.
[J]. INFORMATION INTELLIGENCE, SYSTEMS, TECHNOLOGY AND MANAGEMENT, 2011, 141 : 170 - 179
[2] A New Set of Features for Text-Independent Speaker Identification
Espy-Wilson, Carol Y.
Manocha, Sandeep
Vishnubhotla, Srikanth
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1475 - +
[3] An Improved Approach to Open Set Text-Independent Speaker Identification (OSTI-SI)
Chakraborty, ShrutiSarika
Parekh, Ranjan
[J]. 2017 THIRD IEEE INTERNATIONAL CONFERENCE ON RESEARCH IN COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (ICRCICN), 2017, : 51 - 56
[4] Text-independent speaker identification using backpropacration MLP network classifier for a closed set of speakers
Sharma, A
Singh, SP
Kumar, V
[J]. 2005 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), VOLS 1 AND 2, 2005, : 665 - 669
[5] Closed-Set Text-Independent Speaker Identification System Using Multiple ANN Classifiers
Dutta, Munmi
Patgiri, Chayashree
Sarma, Mousmita
Sarma, Kandarpa Kumar
[J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF INTELLIGENT COMPUTING: THEORY AND APPLICATIONS (FICTA) 2014, VOL 1, 2015, 327 : 377 - 385
[6] Combining Dynamic Features with MFCC for Text-independent Speaker Identification
Chaudhari, Amol
Rahulkar, Amol
Dhonde, S. B.
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING (ICIP), 2015, : 160 - 164
[7] Text-Independent Speaker Identification by Combining MFCC and MVA Features
Korba, Mohamed Cherif Amara
Bourouba, Houcine
Rafik, Djemili
[J]. 2018 INTERNATIONAL CONFERENCE ON SIGNAL, IMAGE, VISION AND THEIR APPLICATIONS (SIVA), 2018,
[8] Capturing complementary information via reversed filter bank and parallel implementation with MFCC for improved text-independent speaker identification
Chakroborty, Sandipan
Roy, Anindya
Majumdar, Sourav
Saha, Goutam
[J]. ICCTA 2007: INTERNATIONAL CONFERENCE ON COMPUTING: THEORY AND APPLICATIONS, PROCEEDINGS, 2007, : 463 - +
[9] Higher order information set based features for text-independent speaker identification
Medikonda J.
Madasu H.
[J]. International Journal of Speech Technology, 2018, 21 (03) : 451 - 461
[10] Toward open-set text-independent speaker identification in tactical communications
Wolf, Matt B.
Park, WonKyung
Oh, Jae C.
Blowers, Misty K.
[J]. 2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN SECURITY AND DEFENSE APPLICATIONS, 2007, : 7 - +

← 1 2 3 4 5 →