Dynamic time warping based approach to text-dependent speaker identification using spectrograms

被引:13
|
作者
Dutta, Tridibesh [1 ]
机构
[1] Indian Stat Inst, Kolkata, India
关键词
D O I
10.1109/CISP.2008.560
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The goal of this paper is to study a new approach to text dependent speaker identification using the complex patterns of variation infrequency and amplitude with time while an individual utters a given word through spectrogram segmentation and template matching. The optimally segmented spectrograms are used as a database to successfully identify, the unknown individual from his/her voice. The methodology used for identifying, rely on classification of spectrograms (of speech signals), based on dynamic time warping (DTW) matching of conditionally quantized frequency-time domain features of the database samples and the unknown speech sample. Experimental results on a sample collected from 40 speakers show that this methodology can be effectively used to produce a desirable success rate.
引用
收藏
页码:354 / 360
页数:7
相关论文
共 50 条
  • [1] Text-dependent speaker identification using spectrograms based on conditional quantization
    Dutta, Tridibesh
    [J]. PROCEEDINGS OF THE IMAGE MINING THEORY AND APPLICATIONS, 2008, : 133 - 142
  • [2] APPLICATION OF DYNAMIC TIME WARPING AND CEPSTROGRAMS TO TEXT-DEPENDENT SPEAKER VERIFICATION
    Kaczmarek, Andrzej
    Staworko, Michal
    [J]. SPA 2009: SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS CONFERENCE PROCEEDINGS, 2009, : 169 - +
  • [3] Text-dependent speaker identification using fisher differentiation vector
    Li, B
    Liu, WJ
    Zhong, QH
    [J]. 2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 309 - 314
  • [4] A dynamic-threshold approach to text-dependent speaker recognition using principles of Immune System
    Dey, Subhomoy
    Kashyap, Kishore
    [J]. 2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,
  • [5] A modified HME architecture for text-dependent speaker identification
    Chen, K
    Xie, DH
    Chi, HS
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1996, 7 (05): : 1309 - 1313
  • [6] Text-dependent speaker recognition using speaker specific compensation
    Laxman, S
    Sastry, PS
    [J]. IEEE TENCON 2003: CONFERENCE ON CONVERGENT TECHNOLOGIES FOR THE ASIA-PACIFIC REGION, VOLS 1-4, 2003, : 384 - 387
  • [7] DNN BASED SPEAKER EMBEDDING USING CONTENT INFORMATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION
    Dey, Subhadeep
    Koshinaka, Takafumi
    Motlicek, Petr
    Madikeri, Srikanth
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5344 - 5348
  • [8] Noise-robust text-dependent speaker identification using cochlear models
    Islam, Md. Atiqul
    Xu, Ying
    Monk, Travis
    Afshar, Saeed
    van Schaik, Andre
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2022, 151 (01): : 500 - 516
  • [9] Text-dependent speaker identification based on Input/Output HMMs: An empirical study
    Chen, K
    Xie, DH
    Chi, HS
    [J]. NEURAL PROCESSING LETTERS, 1996, 3 (02) : 81 - 89
  • [10] The Text-Dependent Chinese Speaker Recognition System Based on the Universal Individual Identification
    Wang, Lili
    Li, Zhihua
    Chen, Kai
    [J]. 2021 IEEE 9TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND NETWORKS (ICICN 2021), 2021, : 58 - 64