Visual words for lip-reading

被引：2

作者：

Hassanat, Ahmad B. A. ^{[1
]}

Jassim, Sabah ^{[1
]}

机构：

[1] Univ Buckingham, Buckingham, England

来源：

MOBILE MULTIMEDIA/IMAGE PROCESSING, SECURITY, AND APPLICATIONS 2010 | 2010年 / 7708卷

关键词：

lip reading; visual speech recognition; speech reading; VSR; KNN; DTW; visual feature extraction; visual words;

D O I：

10.1117/12.850635

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

In this paper, the automatic lip reading problem is investigated, and an innovative approach to providing solutions to this problem has been proposed. This new VSR approach is dependent on the signature of the word itself, which is obtained from a hybrid feature extraction method dependent on geometric, appearance, and image transform features. The proposed VSR approach is termed "visual words". The visual words approach consists of two main parts, 1) Feature extraction/selection, and 2) Visual speech feature recognition. After localizing face and lips, several visual features for the lips where extracted. Such as the height and width of the mouth, mutual information and the quality measurement between the DWT of the current ROI and the DWT of the previous ROI, the ratio of vertical to horizontal features taken from DWT of ROI, The ratio of vertical edges to horizontal edges of ROI, the appearance of the tongue and the appearance of teeth. Each spoken word is represented by 8 signals, one of each feature. Those signals maintain the dynamic of the spoken word, which contains a good portion of information. The system is then trained on these features using the KNN and DTW. This approach has been evaluated using a large database for different people, and large experiment sets. The evaluation has proved the visual words efficiency, and shown that the VSR is a speaker dependent problem.

引用

页数：12

共 50 条

[1] Visual Lip-Reading for Quranic Arabic Alphabets and Words Using Deep Learning
Aljohani, Nada Faisal
Jaha, Emad Sami
[J]. Computer Systems Science and Engineering, 2023, 46 (03): : 3037 - 3058
[2] LIP-READING
Lindquist, Ida P.
[J]. VOLTA REVIEW, 1917, 19 (04) : 188 - 188
[3] LIP-READING
Naber, Joseph E.
[J]. VOLTA REVIEW, 1920, 22 (08) : 527 - 528
[4] LIP-READING
Wilson, Ida H.
[J]. VOLTA REVIEW, 1920, 22 (04) : 221 - 222
[5] LIP-READING
Wadleigh, Grace K.
[J]. VOLTA REVIEW, 1921, 23 (01) : 46 - 47
[6] LIP-READING
不详
[J]. VOLTA REVIEW, 1919, 21 (12) : 800 - 800
[7] Visual units and confusion modelling for automatic lip-reading
Howell, Dominic
Cox, Stephen
Theobald, Barry
[J]. IMAGE AND VISION COMPUTING, 2016, 51 : 1 - 12
[8] ROI Processing for Visual Features Extraction in Lip-reading
Wang, Xiaoping
Hao, Yufeng
Fu, Degang
Yuan, Chunwei
[J]. 2008 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND SIGNAL PROCESSING, VOLS 1 AND 2, 2007, : 178 - +
[9] Convolutional Neural Networks for Predicting Words: A lip-Reading System
Sindhura, P., V
Preethi, S. J.
Krupa, Niranjana B.
[J]. 2018 3RD INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER, AND OPTIMIZATION TECHNIQUES (ICEECCOT - 2018), 2018, : 929 - 933
[10] ADVENTURES IN LIP-READING
McKenna, Alice
[J]. VOLTA REVIEW, 1921, 23 (05) : 213 - 215

← 1 2 3 4 5 →