Dynamic time warping based approach to text-dependent speaker identification using spectrograms

被引：13

作者：

Dutta, Tridibesh ^{[1
]}

机构：

[1] Indian Stat Inst, Kolkata, India

来源：

CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 2, PROCEEDINGS | 2008年

关键词：

D O I：

10.1109/CISP.2008.560

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The goal of this paper is to study a new approach to text dependent speaker identification using the complex patterns of variation infrequency and amplitude with time while an individual utters a given word through spectrogram segmentation and template matching. The optimally segmented spectrograms are used as a database to successfully identify, the unknown individual from his/her voice. The methodology used for identifying, rely on classification of spectrograms (of speech signals), based on dynamic time warping (DTW) matching of conditionally quantized frequency-time domain features of the database samples and the unknown speech sample. Experimental results on a sample collected from 40 speakers show that this methodology can be effectively used to produce a desirable success rate.

引用

页码：354 / 360

页数：7

共 50 条

[1] Text-dependent speaker identification using spectrograms based on conditional quantization
Dutta, Tridibesh
[J]. PROCEEDINGS OF THE IMAGE MINING THEORY AND APPLICATIONS, 2008, : 133 - 142
[2] APPLICATION OF DYNAMIC TIME WARPING AND CEPSTROGRAMS TO TEXT-DEPENDENT SPEAKER VERIFICATION
Kaczmarek, Andrzej
Staworko, Michal
[J]. SPA 2009: SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS CONFERENCE PROCEEDINGS, 2009, : 169 - +
[3] Text-dependent speaker identification using fisher differentiation vector
Li, B
Liu, WJ
Zhong, QH
[J]. 2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 309 - 314
[4] A dynamic-threshold approach to text-dependent speaker recognition using principles of Immune System
Dey, Subhomoy
Kashyap, Kishore
[J]. 2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,
[5] A modified HME architecture for text-dependent speaker identification
Chen, K
Xie, DH
Chi, HS
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1996, 7 (05): : 1309 - 1313
[6] Text-dependent speaker recognition using speaker specific compensation
Laxman, S
Sastry, PS
[J]. IEEE TENCON 2003: CONFERENCE ON CONVERGENT TECHNOLOGIES FOR THE ASIA-PACIFIC REGION, VOLS 1-4, 2003, : 384 - 387
[7] DNN BASED SPEAKER EMBEDDING USING CONTENT INFORMATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION
Dey, Subhadeep
Koshinaka, Takafumi
Motlicek, Petr
Madikeri, Srikanth
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5344 - 5348
[8] Noise-robust text-dependent speaker identification using cochlear models
Islam, Md. Atiqul
Xu, Ying
Monk, Travis
Afshar, Saeed
van Schaik, Andre
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2022, 151 (01): : 500 - 516
[9] Text-dependent speaker identification based on Input/Output HMMs: An empirical study
Chen, K
Xie, DH
Chi, HS
[J]. NEURAL PROCESSING LETTERS, 1996, 3 (02) : 81 - 89
[10] The Text-Dependent Chinese Speaker Recognition System Based on the Universal Individual Identification
Wang, Lili
Li, Zhihua
Chen, Kai
[J]. 2021 IEEE 9TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND NETWORKS (ICICN 2021), 2021, : 58 - 64

← 1 2 3 4 5 →