Research on Speech Accurate Recognition Technology Based on Deep Learning DNN-HMM

被引:0
|
作者
Xia Wanyu [1 ,2 ]
Qiu Wu [3 ]
Feng Xiancheng [1 ,2 ]
机构
[1] Wuhan Inst Technol, Coll Elect & Elect Engn, Wuhan 430205, Peoples R China
[2] Hubei Engn Res Ctr Video Image & HD Project, Wuhan 430205, Peoples R China
[3] Hubei Yingtong Telecommun Cable Co Ltd, Tongcheng 437400, Peoples R China
关键词
Speech Accurate Recognition; Deep Learning; DNN-HMM; Speech quality assessment;
D O I
10.1117/12.2539467
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, with the rapid development of artificial intelligence technology, human auditory intelligence perception has received extensive attention. The human-like auditory intelligent speech separation of robots in complex acoustic environment is studied. Through in-depth learning of key technologies such as DNN-HMM, a new deep network cluster structure, optimization objectives and deep learning algorithm capable of denoising in complex frequency domain are proposed to improve the accuracy of speech recognition, solve the problem of speech separation in human-like hearing in harsh environments, realize high-quality auditory perception in real environments, and enhance intelligence in far-field and complex acoustic environments. Human-computer interaction performance.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] DNN-HMM based Automatic Speech Recognition for HRI Scenarios
    Novoa, Jose
    Wuth, Jorge
    Pablo Escudero, Juan
    Fredes, Josue
    Mahu, Rodrigo
    Becerra Yoma, Nestor
    [J]. HRI '18: PROCEEDINGS OF THE 2018 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, 2018, : 150 - 159
  • [2] Hybrid Deep Neural Network - Hidden Markov Model (DNN-HMM) Based Speech Emotion Recognition
    Li, Longfei
    Zhao, Yong
    Jiang, Dongmei
    Zhang, Yanning
    Wang, Fengna
    Gonzalez, Isabel
    Valentin, Enescu
    Sahli, Hichem
    [J]. 2013 HUMAINE ASSOCIATION CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2013, : 312 - 317
  • [3] Contaminated speech training methods for robust DNN-HMM distant speech recognition
    Ravanelli, Mirco
    Omologo, Maurizio
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 756 - 760
  • [4] Multilingual Approach to Joint Speech and Accent Recognition with DNN-HMM Framework
    Peng, Yizhou
    Zhang, Jicheng
    Zhang, Haobo
    Xu, Haihua
    Huang, Hao
    Li, Sheng
    Chng, Eng Siong
    [J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1043 - 1048
  • [5] Comparison of syllable-based and phoneme-based DNN-HMM in Japanese Speech Recognition
    Seki, Hiroshi
    Yamamoto, Kazumasa
    Nakagawa, Seiichi
    [J]. 2014 INTERNATIONAL CONFERENCE OF ADVANCED INFORMATICS: CONCEPT, THEORY AND APPLICATION (ICAICTA), 2014, : 249 - 254
  • [6] Comparison of DCT and Autoencoder-based Features for DNN-HMM Multimodal Silent Speech Recognition
    Liu, Licheng
    Ji, Yan
    Wang, Hongcui
    Denby, Bruce
    [J]. 2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [7] Labeling Unsegmented Sequence Data with DNN-HMM and Its Application for Speech Recognition
    Li, Xiangang
    Wu, Xihong
    [J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 10 - 14
  • [8] Phonotactic Language Recognition Based on DNN-HMM Acoustic Model
    Liu, Wei-Wei
    Cai, Meng
    Yuan, Hua
    Shi, Xiao-Bei
    Zhang, Wei-Qiang
    Liu, Jia
    [J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 153 - +
  • [9] Syllable based DNN-HMM Cantonese Speech-to-Text System
    Wong, Timothy
    Li, Claire W. Y.
    Lam, Sam
    Chiu, Billy
    Lu, Qin
    Li, Minglei
    Xiong, Dan
    Yu, Roy S.
    Ng, Vincent T. Y.
    [J]. LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 3856 - 3862
  • [10] Large Vocabulary Children's Speech Recognition with DNN-HMM and SGMM Acoustic Modeling
    Giuliani, Diego
    BabaAli, Bagher
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1635 - 1639