Research on Identity Recognition Algorithm Based on Speech Signal

被引：0

作者：

Cai, Chengtao ^{[1
]}

Liu, Fan ^{[1
]}

机构：

[1] Harbin Engn Univ, Coll Automat, Harbin 150001, Peoples R China

来源：

PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019) | 2019年

关键词：

Speaker identification; Convolutional Recurrent Neural Network; Spectrogram; Text-independent;

D O I：

10.1109/ccdc.2019.8833151

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The purpose of speaker recognition technology is to identify the identity of the speaker through the speaker's speech. Speaker recognition technology has been studied for many years. An improved convolutional recurrent neural network is proposed to realize the speaker recognition technology in this article. CNN-LSTM consists with convolutional neural network and recurrent neural network. After the original speech is processed into a grayscale spectrogram, the features are extracted by the optimized CNN structure. The output of the LSTM will be input into the two fully connected layer. After the fully connected layer, the output is sorted. When speech is trained through the CNN-LSTM network, a model with high recognition accuracy will be obtained. Other speech is input into the trained model. If the output meets the Established accuracy, the identity of the speaker is identified. CNN-LSTM has better recognition accuracy than CNN-DNN structure. The recognition rate has increased by about 4%. Converting a speech signal into a spectrogram is easy to implement text-independent speaker recognition technology. We added L2 regularization to the final classification layer in CNN-LSTM. After each layer of the network, we added the nomalization layer. At the same time, the Adam optimizer and the GaussianNoise layer are added.. The accuracy of the original model increases by 80% to 92% with the combination of these four methods. This improved network makes it easy to implement text-independent speaker recognition techniques than traditional identification method. It's superior to unmodified CNN structure. A satisfactory recognition rate can be achieved without using an overly complex neural network model.

引用

页码：1085 / 1090

页数：6

共 50 条

[1] Unvoiced Speech Recognition Algorithm Based on Myoelectric Signal
He, Jianrong
Wang, Xin'an
Zhang, Xing
Wang, Bo
Li, Qiuping
Qiu, Changpei
[J]. ICMLC 2020: 2020 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2018, : 450 - 456
[2] The research for speech signal enhancement based on MLMS algorithm
Yan, Xu
Xiaosuo, Wu
Tianbing, He
[J]. 2007 International Symposium on Computer Science & Technology, Proceedings, 2007, : 258 - 262
[3] Speech Signal Recognition Based on Genetic Algorithm and Fisher Projection
Wang, Xu
Han, Zhiyan
Wang, Jian
Li, Kaiyu
[J]. 2008 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-11, 2008, : 2546 - 2549
[4] Research on the Algorithm of Tibetan Speech Recognition based on DBN
Pan, Xiuqin
Xu, Xiaona
Zhang, Hong
Zhao, Yue
Cao, Yongcun
[J]. 2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION WORKSHOP: IITA 2008 WORKSHOPS, PROCEEDINGS, 2008, : 412 - 415
[5] Research on speech recognition algorithm based on HTK toolbox
Wang, Lei
[J]. PROCEEDINGS OF THE 2016 3RD INTERNATIONAL CONFERENCE ON MATERIALS ENGINEERING, MANUFACTURING TECHNOLOGY AND CONTROL, 2016, 67 : 184 - 186
[6] An algorithm for robust signal modelling in speech recognition
Vergin, R
[J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 969 - 972
[7] Research on English language learning algorithm based on speech recognition
[J]. Liu, Jinping, 1600, TeknoScienze, Viale Brianza,22, Milano, 20127, Italy (28):
[8] Research on English Language Learning Algorithm Based on Speech Recognition
Liu, Jinping
[J]. AGRO FOOD INDUSTRY HI-TECH, 2017, 28 (03): : 2653 - 2656
[9] Geometry Analysis and Recognition Research of Speech Signal
Wan Xianbao
Xu Chunyan
Chen Yong
Pan Xiaoxia
Wang Shoujue
[J]. PACIIA: 2008 PACIFIC-ASIA WORKSHOP ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION, VOLS 1-3, PROCEEDINGS, 2008, : 1219 - 1222
[10] Sound signal analysis in Japanese speech recognition based on deep learning algorithm
Yang, Xiaoxing
[J]. INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2023,

← 1 2 3 4 5 →