Visual speech recognition using wavelet transform and moment based features

被引:0
|
作者
Yau, Wai C. [1 ]
Kumar, Dinesh K. [1 ]
Arjunan, Sridhar P. [1 ]
Kumar, Sanjay [1 ]
机构
[1] RMIT Univ, Sch Elect & Comp Engn, GPO Box 2476V, Melbourne, Vic 3001, Australia
关键词
Visual Speech Recognition; Motion History Image; Discrete Stationary Wavelet Transform; Image Moments; Artificial Neural Network;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a novel vision based approach to identify utterances consisting of consonants. A view based method is adopted to represent the 3-D image sequence of the mouth movement in a 2-D space using grayscale images named as motion history image (MHI). MHI is produced by applying accumulative image differencing technique on the sequence of images to implicitly capture the temporal information of the mouth movement. The proposed technique combines Discrete Stationary Wavelet Transform (SWT) and image moments to classify the MHI. A 2-D SWT at level 1 is applied to decompose MHI to produce one approximate and three detail sub images. The paper reports on the testing of the classification accuracy of three different moment-based features, namely Zernike moments, geometric moments and Hu moments computed from the approximate representation of MHI. Supervised feed forward multilayer perceptron (MLP) type artificial neural network (ANN) with back propagation learning algorithm is used to classify the moment-based features. The performance and image representation ability of the three moments features are compared in this paper. The preliminary results show that all these moments can achieve high recognition rate in classification of 3 consonants.
引用
收藏
页码:340 / 345
页数:6
相关论文
共 50 条
  • [1] Visual hand gestures classification using wavelet transform and moment based features
    Kumar, S
    Kumar, DK
    [J]. INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2005, 3 (01) : 79 - 101
  • [2] Emotion recognition from speech using wavelet packet transform and prosodic features
    Gupta, Manish
    Bharti, Shambhu Shankar
    Agarwal, Suneeta
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 35 (02) : 1541 - 1553
  • [3] Robust Automatic Speech Recognition Features using Complex Wavelet Packet Transform Coefficients
    Sen, Tjong Wan
    Trilaksono, Bambang Riyanto
    Arman, Arry Akhmad
    Mandala, Rila
    [J]. JOURNAL OF ICT RESEARCH AND APPLICATIONS, 2009, 3 (02) : 123 - 134
  • [4] Wavelet Transform Based Features Vector Extraction in Isolated Words Speech Recognition System
    Al-Qaraawi, Salih M.
    Mahmood, Sarah Shukur
    [J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON COMMUNICATION SYSTEMS, NETWORKS & DIGITAL SIGNAL PROCESSING (CSNDSP), 2014, : 847 - 850
  • [5] Continuous Wavelet Transform based Speech Emotion Recognition
    Shegokar, Pankaj
    Sircar, Pradip
    [J]. 2016 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2016,
  • [6] A new feature in speech recognition based on wavelet transform
    Hao, Y
    Zhu, XY
    [J]. 2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 1526 - 1529
  • [7] Speech Recognition using Hilbert-Huang Transform Based Features
    Hanna, Samer S.
    Korany, Noha
    Abd-el-Malek, Mina B.
    [J]. 2017 40TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2017, : 338 - 341
  • [8] Emotion Recognition in Speech Using MFCC and Wavelet Features
    Kishore, K. V. Krishna
    Satish, P. Krishna
    [J]. PROCEEDINGS OF THE 2013 3RD IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2013, : 842 - 847
  • [9] Improving speech recognition using bionic wavelet features
    Vani, H.Y.
    Anusuya, M.A.
    [J]. AIMS Electronics and Electrical Engineering, 2020, 4 (02): : 200 - 215
  • [10] Robust speech recognition using wavelet coefficient features
    Gupta, M
    Gilbert, A
    [J]. ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 445 - 448