Feature windowing-based for Thai text-dependent speaker identification using MLP with backpropagation algorithm

被引:0
|
作者
Sae-Tang, S [1 ]
Tanprasert, C [1 ]
机构
[1] Minist Sci Technol & Environm, Natl Sci & Technol Dev Agcy, Natl Elect & Comp Technol Ctr, Software & Language Engn Lab, Bangkok 10400, Thailand
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents the development of Thai text-dependent speaker identification system by applying two feature-feeding approaches. A well-known multilayer perceptron (MLP) network with backpropagation learning algorithm is chosen due to its fast processing time and good performance for pattern recognition problems. But MLP has a limitation in that a network must have a fixed amount of input nodes. Therefore, the linear interpolation time normalization is chosen to adjust the input speech signal into a fixed size of input vector. Furthermore, the windowing technique is developed to avoid the distortion caused by a time normalization process. A fixed size window is sliced through the preprocessed features with fixed amount of overlapping frames. The high identification rate observed in experiments confirms that the developed windowing is suitable for the proposed Thai text-dependent speaker identification system.
引用
收藏
页码:579 / 582
页数:4
相关论文
共 50 条
  • [1] Text-dependent speaker identification using spectrograms based on conditional quantization
    Dutta, Tridibesh
    [J]. PROCEEDINGS OF THE IMAGE MINING THEORY AND APPLICATIONS, 2008, : 133 - 142
  • [2] Deep feature for text-dependent speaker verification
    Liu, Yuan
    Qian, Yanmin
    Chen, Nanxin
    Fu, Tianfan
    Zhang, Ya
    Yu, Kai
    [J]. SPEECH COMMUNICATION, 2015, 73 : 1 - 13
  • [3] Covariance Based Deep Feature for Text-Dependent Speaker Verification
    Wang, Shuai
    Dinkel, Heinrich
    Qian, Yanmin
    Yu, Kai
    [J]. INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING, 2018, 11266 : 231 - 242
  • [4] Text-dependent speaker identification using fisher differentiation vector
    Li, B
    Liu, WJ
    Zhong, QH
    [J]. 2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 309 - 314
  • [5] Improved Deep Speaker Feature Learning for Text-Dependent Speaker Recognition
    Li, Lantian
    Lin, Yiye
    Zhang, Zhiyong
    Wang, Dong
    [J]. 2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 426 - 429
  • [6] Dynamic time warping based approach to text-dependent speaker identification using spectrograms
    Dutta, Tridibesh
    [J]. CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 2, PROCEEDINGS, 2008, : 354 - 360
  • [7] A modified HME architecture for text-dependent speaker identification
    Chen, K
    Xie, DH
    Chi, HS
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1996, 7 (05): : 1309 - 1313
  • [8] EXPLORING SEQUENTIAL CHARACTERISTICS IN SPEAKER BOTTLENECK FEATURE FOR TEXT-DEPENDENT SPEAKER VERIFICATION
    Chen, Liping
    Zhao, Yong
    Zhang, Shi-Xiong
    Li, Jie
    Ye, Guoli
    Soong, Frank
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5364 - 5368
  • [9] Text-dependent speaker recognition using speaker specific compensation
    Laxman, S
    Sastry, PS
    [J]. IEEE TENCON 2003: CONFERENCE ON CONVERGENT TECHNOLOGIES FOR THE ASIA-PACIFIC REGION, VOLS 1-4, 2003, : 384 - 387
  • [10] DNN BASED SPEAKER EMBEDDING USING CONTENT INFORMATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION
    Dey, Subhadeep
    Koshinaka, Takafumi
    Motlicek, Petr
    Madikeri, Srikanth
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5344 - 5348