Lipreading model based on a two-way convolutional neural network and feature fusion

被引:1
|
作者
Zhu, Meili [1 ]
Wang, Qingqing [2 ]
Ge, Yingying [1 ]
机构
[1] Jilin Animat Inst, Sch Game, Changchun, Peoples R China
[2] Jilin Animat Inst, Sch Animat Art, Changchun, Peoples R China
关键词
visual speech recognition; bidirectional dynamic image; histogram of oriented gradients; convolutional neural network; RECOGNITION; CLASSIFICATION; IMAGE;
D O I
10.1117/1.JEI.30.6.063003
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Lipreading feature extraction is essentially the feature extraction of continuous video frame sequences. A lipreading model based on a two-way convolutional neural network and features is proposed to obtain more reasonable visual-spatial-temporal characteristics. Unlike other lipreading methods based on deep learning, the rank pooling method transforms lip video into a standard RGB image that can be directly input into the convolutional neural network, which effectively reduces the input dimension. In addition, to compensate for the lack of spatial information, the apparent shape and depth features are fused, and then the joint cost function is used to guide the network model learning to obtain more distinguishing features. The experimental results were evaluated on the public GRID database and OuluVS2 database. It shows that the accuracy of the proposed method can reach more than 93%, which validates the effectiveness of the method. (C) 2021 SPIE and IS&T
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Feature Cloning and Feature Fusion Based Transportation Mode Detection Using Convolutional Neural Network
    Alam, Md. Golam Rabiul
    Haque, Mahmudul
    Hassan, Md. Rafiul
    Huda, Shamsul
    Hassan, Mohammad Mehedi
    Strickland, Fred L. L.
    AlQahtani, Salman A. A.
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (04) : 4671 - 4681
  • [22] A Convolutional Graph Neural Network Model for Water Distribution Network Leakage Detection Based on Segment Feature Fusion Strategy
    Li, Xuan
    Wu, Yongqiang
    [J]. Water (Switzerland), 2024, 16 (24)
  • [23] Bearing Fault Diagnosis with a Feature Fusion Method Based on an Ensemble Convolutional Neural Network and Deep Neural Network
    Li, Hongmei
    Huang, Jinying
    Ji, Shuwei
    [J]. SENSORS, 2019, 19 (09)
  • [24] An optimal 3D convolutional neural network based lipreading method
    He, Lun
    Ding, Biyun
    Wang, Hao
    Zhang, Tao
    [J]. IET IMAGE PROCESSING, 2022, 16 (01) : 113 - 122
  • [25] A Multi-Feature Fusion Model Based on Denoising Convolutional Neural Network and Attention Mechanism for Image Classification
    Zhang, Jingsi
    Yu, Xiaosheng
    Lei, Xiaoliang
    Wu, Chengdong
    [J]. INTERNATIONAL JOURNAL OF SWARM INTELLIGENCE RESEARCH, 2023, 14 (02)
  • [26] TA-CNN: Two-way attention models in deep convolutional neural network for plant recognition
    Zhu, Youxiang
    Sun, Weiming
    Cao, Xiangying
    Wang, Chunyan
    Wu, Dongyang
    Yang, Yin
    Ye, Ning
    [J]. NEUROCOMPUTING, 2019, 365 : 191 - 200
  • [27] Advanced Feature Fusion Algorithm Based on Multiple Convolutional Neural Network for Scene Recognition
    Chen, Lei
    Bo, Kanghu
    Lee, Feifei
    Chen, Qiu
    [J]. CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2020, 122 (02): : 505 - 523
  • [28] Rolling bearing fault diagnosis based on feature fusion with parallel convolutional neural network
    Liang, Mingxuan
    Cao, Pei
    Tang, J.
    [J]. INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2021, 112 (3-4): : 819 - 831
  • [29] Flower growth status recognition method based on feature fusion convolutional neural network
    Liu, Haiming
    Guan, Shixuan
    Lu, Weizhong
    Li, Haiou
    Wu, Hongjie
    [J]. JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2021, 21 (06) : 1935 - 1946
  • [30] SAR Ship Detection Based on Convolutional Neural Network with Deep Multiscale Feature Fusion
    Long, Yang
    Juan, Su
    Hua, Huang
    Xiang, Li
    [J]. ACTA OPTICA SINICA, 2020, 40 (02)