Optimization of Gabor features for text-independent speaker identification

被引：3

作者：

Mildner, Volker ^{[1
]}

Goetze, Stefan ^{[1
]}

Kammeyer, Karl-Dirk ^{[1
]}

Mertins, Alfred ^{[2
]}

机构：

[1] Univ Bremen, Dept Commun Engn, D-28334 Bremen, Germany

[2] Carl von Ossietzky Univ Oldenburg, Signal Proc Grp, D-26111 Oldenburg, Germany

来源：

2007 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11 | 2007年

关键词：

D O I：

10.1109/ISCAS.2007.378660

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

For text-independent speaker identification a prominent combination is to use Gaussian Mixture Models (GMM) for classification while relying on Mel-Frequency Cepstral Coefficients (MFCC) as features. To take temporal information into account the time difference of features of adjacent speech frames are appended to the initial features. In this paper we investigate the applicability of spectro-temporal features obtained from Gabor-Filters and present an algorithm for optimizing the possible parameters. Simulation results on a database show that spectro-temporal features achieve higher recognition rates than purely temporal features for clean speech as well as for disturbed speech.

引用

页码：3932 / +

页数：2

共 50 条

[1] A New Set of Features for Text-Independent Speaker Identification
Espy-Wilson, Carol Y.
Manocha, Sandeep
Vishnubhotla, Srikanth
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1475 - +
[2] Text-independent speaker identification
Gish, Herbert
Schmidt, Michael
[J]. IEEE SIGNAL PROCESSING MAGAZINE, 1994, 11 (04) : 18 - 32
[3] Combining Dynamic Features with MFCC for Text-independent Speaker Identification
Chaudhari, Amol
Rahulkar, Amol
Dhonde, S. B.
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING (ICIP), 2015, : 160 - 164
[4] Text-Independent Speaker Identification by Combining MFCC and MVA Features
Korba, Mohamed Cherif Amara
Bourouba, Houcine
Rafik, Djemili
[J]. 2018 INTERNATIONAL CONFERENCE ON SIGNAL, IMAGE, VISION AND THEIR APPLICATIONS (SIVA), 2018,
[5] Text-independent Speaker Identification in Birds
Fox, E. J. S.
Roberts, J. D.
Bennamoun, M.
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2122 - 2125
[6] Text-independent speaker identification in noisy background
Zhou, Y
Xu, BL
[J]. PROGRESS IN NATURAL SCIENCE, 2001, 11 : S384 - S387
[7] Voice text-independent system for speaker identification
Babenko, LK
Makarevich, OB
Fedorov, VM
Yurkov, PY
[J]. IZVESTIYA VYSSHIKH UCHEBNYKH ZAVEDENII RADIOELEKTRONIKA, 2004, 47 (3-4): : 66 - 70
[8] On the use of Classifiers for Text-independent Speaker Identification
Jawarkar, Naresh P.
Holambe, Raghunath S.
Basu, Tapan Kumar
[J]. 2014 FIRST INTERNATIONAL CONFERENCE ON AUTOMATION, CONTROL, ENERGY & SYSTEMS (ACES-14), 2014, : 238 - 242
[9] Higher order information set based features for text-independent speaker identification
Medikonda J.
Madasu H.
[J]. International Journal of Speech Technology, 2018, 21 (03) : 451 - 461
[10] HISTOGRAM TRANSFORM MODEL USING MFCC FEATURES FOR TEXT-INDEPENDENT SPEAKER IDENTIFICATION
Yu, Hong
Ma, Zhanyu
Li, Minyue
Guo, Jun
[J]. CONFERENCE RECORD OF THE 2014 FORTY-EIGHTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2014, : 500 - 504

← 1 2 3 4 5 →