Lip-reading via a DNN-HMM Hybrid System Using Combination of The Image-based and Model-based Features

被引:0
|
作者
Rahmani, Mohammad Hasan [1 ]
Almasganj, Farshad [1 ]
机构
[1] Amirkabir Univ Technol, Tehran Polytech, Biomed Engn Dept, Tehran, Iran
关键词
lip-reading; feature extraction; deep auto-encoder; DBNF; NEURAL-NETWORKS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Introducing features that better represent the visual information of speakers during the speech production is still an open issue that highly affects the quality of the lip-reading and Audio Visual Speech Recognition (AVSR) tasks. In this paper, three different types of visual features from both the image-based and model-based ones are investigated inside a professional lip reading task. The simple raw gray level information of the lips Region of Interest (ROI), the geometric representation of lips shape and the Deep Bottle-neck Features (DBNFs) extracted from a 6-layer Deep Auto-encoder Neural Network (DANN) are three valuable feature sets compared while employed for the lip reading purpose. Two different recognition systems, including the conventional GMM-HMM and the state-of-the-art DNN-HMM hybrid, are utilized to perform an isolated and connected digit recognition task. The results indicate that the high level information extracted from deep layers of the lips ROI can represent the visual modality with advantage of "high amount of information in a low dimension feature vector". Moreover, the DBNFs showed a relative improvement with an average of 15.4% in comparison to the shape features and the shape features showed a relative improvement with an average of 20.4% in comparison to the ROI features over the test data.
引用
下载
收藏
页码:195 / 199
页数:5
相关论文
共 50 条
  • [31] Model-based measurement of food portion size for image-based dietary assessment using 3D/2D registration
    Chen, Hsin-Chen
    Jia, Wenyan
    Yue, Yaofeng
    Li, Zhaoxin
    Sun, Yung-Nien
    Fernstrom, John D.
    Sun, Mingui
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2013, 24 (10)
  • [32] Using a Hybrid Neural Network Model DCNN-LSTM for Image-Based Nitrogen Nutrition Diagnosis in Muskmelon
    Chang, Liying
    Li, Daren
    Hameed, Muhammad Khalid
    Yin, Yilu
    Huang, Danfeng
    Niu, Qingliang
    HORTICULTURAE, 2021, 7 (11)
  • [33] Predicting vehicle trajectory via combination of model-based and data-driven methods using Kalman filter
    Zhang, Bowei
    Yu, Weiguang
    Jia, Yifan
    Huang, Jin
    Yang, Diange
    Zhong, Zhihua
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2023, 238 (08) : 2437 - 2450
  • [34] Image-based occupancy positioning system using pose-estimation model for demand-oriented ventilation
    Wang, Huan
    Wang, Guijin
    Li, Xianting
    JOURNAL OF BUILDING ENGINEERING, 2021, 39
  • [35] Image-based occupancy positioning system using pose-estimation model for demand-oriented ventilation
    Wang, Huan
    Wang, Guijin
    Li, Xianting
    Journal of Building Engineering, 2021, 39
  • [36] Design and Implementation of a Real-Time Image Processing System Using Model-Based Design Methods
    Demirci, Mustafa Yusuf
    Yabanova, Ismail
    JOURNAL OF POLYTECHNIC-POLITEKNIK DERGISI, 2019, 22 (04): : 827 - 838
  • [37] Hy-MOM: Hybrid Recommender System Framework Using Memory-Based and Model-Based Collaborative Filtering Framework
    George, Gina
    Lal, Anisha M.
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2022, 22 (01) : 134 - 150
  • [38] A New Image-Based Hybrid Reversible Data Hiding Model Using IHWT and RP-PEHM for Secured Data Communication
    Ahmad Shaik
    V. Thanikaiselvan
    Circuits, Systems, and Signal Processing, 2018, 37 : 4907 - 4928
  • [39] A New Image-Based Hybrid Reversible Data Hiding Model Using IHWT and RP-PEHM for Secured Data Communication
    Shaik, Ahmad
    Thanikaiselvan, V.
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2018, 37 (11) : 4907 - 4928
  • [40] Design and Implementation of a Real-Time Image Processing System Based on Sobel Edge Detection using Model-based Design Methods
    Saidani, Taoufik
    Ghodhbani, Refka
    Ben Ammar, Mohamed
    Kouki, Marouan
    Algarni, Mohammad H.
    Said, Yahia
    Kachoukh, Amani
    Alsuwaylimi, Amjad A.
    Maqbool, Albia
    Abd-Elkawy, Eman H.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (03) : 273 - 278