Research on Identity Recognition Algorithm Based on Speech Signal

被引:0
|
作者
Cai, Chengtao [1 ]
Liu, Fan [1 ]
机构
[1] Harbin Engn Univ, Coll Automat, Harbin 150001, Peoples R China
关键词
Speaker identification; Convolutional Recurrent Neural Network; Spectrogram; Text-independent;
D O I
10.1109/ccdc.2019.8833151
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The purpose of speaker recognition technology is to identify the identity of the speaker through the speaker's speech. Speaker recognition technology has been studied for many years. An improved convolutional recurrent neural network is proposed to realize the speaker recognition technology in this article. CNN-LSTM consists with convolutional neural network and recurrent neural network. After the original speech is processed into a grayscale spectrogram, the features are extracted by the optimized CNN structure. The output of the LSTM will be input into the two fully connected layer. After the fully connected layer, the output is sorted. When speech is trained through the CNN-LSTM network, a model with high recognition accuracy will be obtained. Other speech is input into the trained model. If the output meets the Established accuracy, the identity of the speaker is identified. CNN-LSTM has better recognition accuracy than CNN-DNN structure. The recognition rate has increased by about 4%. Converting a speech signal into a spectrogram is easy to implement text-independent speaker recognition technology. We added L2 regularization to the final classification layer in CNN-LSTM. After each layer of the network, we added the nomalization layer. At the same time, the Adam optimizer and the GaussianNoise layer are added.. The accuracy of the original model increases by 80% to 92% with the combination of these four methods. This improved network makes it easy to implement text-independent speaker recognition techniques than traditional identification method. It's superior to unmodified CNN structure. A satisfactory recognition rate can be achieved without using an overly complex neural network model.
引用
收藏
页码:1085 / 1090
页数:6
相关论文
共 50 条
  • [1] Unvoiced Speech Recognition Algorithm Based on Myoelectric Signal
    He, Jianrong
    Wang, Xin'an
    Zhang, Xing
    Wang, Bo
    Li, Qiuping
    Qiu, Changpei
    [J]. ICMLC 2020: 2020 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2018, : 450 - 456
  • [2] The research for speech signal enhancement based on MLMS algorithm
    Yan, Xu
    Xiaosuo, Wu
    Tianbing, He
    [J]. 2007 International Symposium on Computer Science & Technology, Proceedings, 2007, : 258 - 262
  • [3] Speech Signal Recognition Based on Genetic Algorithm and Fisher Projection
    Wang, Xu
    Han, Zhiyan
    Wang, Jian
    Li, Kaiyu
    [J]. 2008 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-11, 2008, : 2546 - 2549
  • [4] Research on the Algorithm of Tibetan Speech Recognition based on DBN
    Pan, Xiuqin
    Xu, Xiaona
    Zhang, Hong
    Zhao, Yue
    Cao, Yongcun
    [J]. 2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION WORKSHOP: IITA 2008 WORKSHOPS, PROCEEDINGS, 2008, : 412 - 415
  • [5] Research on speech recognition algorithm based on HTK toolbox
    Wang, Lei
    [J]. PROCEEDINGS OF THE 2016 3RD INTERNATIONAL CONFERENCE ON MATERIALS ENGINEERING, MANUFACTURING TECHNOLOGY AND CONTROL, 2016, 67 : 184 - 186
  • [6] An algorithm for robust signal modelling in speech recognition
    Vergin, R
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 969 - 972
  • [7] Research on English language learning algorithm based on speech recognition
    [J]. Liu, Jinping, 1600, TeknoScienze, Viale Brianza,22, Milano, 20127, Italy (28):
  • [8] Research on English Language Learning Algorithm Based on Speech Recognition
    Liu, Jinping
    [J]. AGRO FOOD INDUSTRY HI-TECH, 2017, 28 (03): : 2653 - 2656
  • [9] Geometry Analysis and Recognition Research of Speech Signal
    Wan Xianbao
    Xu Chunyan
    Chen Yong
    Pan Xiaoxia
    Wang Shoujue
    [J]. PACIIA: 2008 PACIFIC-ASIA WORKSHOP ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION, VOLS 1-3, PROCEEDINGS, 2008, : 1219 - 1222
  • [10] Sound signal analysis in Japanese speech recognition based on deep learning algorithm
    Yang, Xiaoxing
    [J]. INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2023,