Convolution Neural Network Based Visual Speech Recognition System for Syllable Identification

被引:0
|
作者
Pahuja H. [1 ,2 ]
Ranjan P. [1 ]
Ujlayan A. [3 ]
Goyal A. [4 ]
机构
[1] Department of Electronics and Communication Engineering, Amity University, Uttar Pradesh, Noida
[2] Department of Electronics and Communication Engineering, KIET Group of Institutions, Delhi-NCR, Uttar Pradesh, Ghaziabad
[3] Department of Mathematics, Gautam Buddha University, Uttar Pradesh, Greater Noida
[4] Department of Electrical Engineering and Computer Science, Texas A&M University-Kingsville, Kingsville, 78363, Texas, TX
关键词
Convolution Neural Network (CNN); Random Test Image (RTI); Region Of Interest (ROI); Separate Test Image (STI); Syllables; Visual Speech Recognition (VSR);
D O I
10.2174/2666255813999200917142628
中图分类号
学科分类号
摘要
Introduction: This paper introduces a novel and reliable approach for people with speech impairment to assist them in communicating effectively in real-time. A deep learning technique named as convolution neural network is used as its classifier. With the help of this algorithm, words are recognized from an input which is visual speech, disregarding the audible or acoustic property. Methods: This network extracts the features from mouth movements and different images, respec-tively. With the help of a source, non-audible mouth movements are taken as an input and then seg-regated as subsets to get the desired output. The Complete Datum is then arranged to recognize the word as an affricate. Results: Convolution neural network is one of the most effective algorithms that extract features, perform classification and provides the desired output from the input images for the speech recognition system. Conclusion: Recognizing the syllables at real-time from visual mouth movement input is the main objective of the proposed method. When the proposed system was tested, datum accuracy and quan-tity of training sets proved to be satisfactory. A small set of datum is taken as the first step of learn-ing. In future, a large set of datum can be considered for analyzing the data. Discussion: On the basis of the type of datum, the network proposed in this paper is tested for its precision level. A network is maintained to identify the syllables, but it fails when syllables are of the same set. There is a requirement of a higher end graphics processing units to reduce the time con-sumption and increase the efficiency of a network. © 2022 Bentham Science Publishers.
引用
收藏
页码:139 / 150
页数:11
相关论文
共 50 条
  • [21] A Syllable-Based Turkish Speech Recognition System by Using Time Delay Neural Networks (TDNNs)
    Can, Burcu
    Artuner, Harun
    [J]. 2013 INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2013, : 219 - 224
  • [22] Facial Expression Recognition Based on Convolution Neural Network
    Duan, Yue
    Zhou, Linli
    Wu, Yue
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING, INFORMATION SCIENCE & APPLICATION TECHNOLOGY (ICCIA 2017), 2017, 74 : 339 - 343
  • [23] Buckwheat Disease Recognition Based on Convolution Neural Network
    Liu, Xiaojuan
    Zhou, Shangbo
    Chen, Shanxiong
    Yi, Zelin
    Pan, Hongyu
    Yao, Rui
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (09):
  • [24] Emotion Recognition Algorithm Based on Convolution Neural Network
    Cheng, Chunling
    Wei, Xianwei
    Jian, Zhou
    [J]. 2017 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (IEEE ISKE), 2017,
  • [25] The Ancient Pictogram Recognition Based on Convolution Neural Network
    Cui, Qiao
    Zheng, Yutong
    [J]. 2017 16TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING AND SCIENCE (DCABES), 2017, : 97 - 99
  • [26] Hand gesture recognition based on convolution neural network
    Gongfa Li
    Heng Tang
    Ying Sun
    Jianyi Kong
    Guozhang Jiang
    Du Jiang
    Bo Tao
    Shuang Xu
    Honghai Liu
    [J]. Cluster Computing, 2019, 22 : 2719 - 2729
  • [27] Hand gesture recognition based on convolution neural network
    Li, Gongfa
    Tang, Heng
    Sun, Ying
    Kong, Jianyi
    Jiang, Guozhang
    Jiang, Du
    Tao, Bo
    Xu, Shuang
    Liu, Honghai
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (02): : S2719 - S2729
  • [28] Deep Neural Networks for Syllable based Acoustic Modeling in Chinese Speech Recognition
    Li, Xiangang
    Hong, Caifu
    Yang, Yuning
    Wu, Xihong
    [J]. 2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
  • [29] Audio-Visual (Multimodal) Speech Recognition System Using Deep Neural Network
    Paulin, Hebsibah
    Milton, R. S.
    JanakiRaman, S.
    Chandraprabha, K.
    [J]. JOURNAL OF TESTING AND EVALUATION, 2019, 47 (06) : 3963 - 3974
  • [30] A hybrid neural network based speech recognition system for pervasive environments
    Sehgal, MSB
    Gondal, I
    Dooley, L
    [J]. INMIC 2004: 8th International Multitopic Conference, Proceedings, 2004, : 309 - 314