Extraction of Features for Lip-reading Using Autoencoders

被引：0

作者：

Palecek, Karel ^{[1
]}

机构：

[1] Tech Univ Liberec, Inst Informat Technol & Elect, Liberec 46117, Czech Republic

来源：

SPEECH AND COMPUTER | 2014年 / 8773卷

关键词：

Autoencoder; Hidden Markov Model; Kinect; Lip-reading;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We study the incorporation of facial depth data in the task of isolated word visual speech recognition. We propose novel features based on unsupervised training of a single layer autoencoder. The features are extracted from both video and depth channels obtained by Microsoft Kinect device. We perform all experiments on our database of 54 speakers, each uttering 50 words. We compare our autoencoder features to traditional methods such as DCT or PCA. The features are further processed by simplified variant of hierarchical linear discriminant analysis in order to capture the speech dynamics. The classification is performed using a multi-stream Hidden Markov Model for various combinations of audio, video, and depth channels. We also evaluate visual features in the join audio-video isolated word recognition in noisy environments. English

引用

页码：209 / 216

页数：8

共 50 条

[21] LIP-READING EXPERIENCES
Andrews, Harriet U.
VOLTA REVIEW, 1917, 19 (01) : 19 - 20
[22] LIP-READING AND PREPAREDNESS
Crain, Lina M.
VOLTA REVIEW, 1917, 19 (10) : 513 - 514
[23] LIP-READING EXPERIENCES
Durfee, Marion A.
VOLTA REVIEW, 1917, 19 (10) : 509 - 510
[24] LIP-READING CLASS
Kenfield, Coralie N.
VOLTA REVIEW, 1918, 20 (01) : 69 - 70
[25] THE ART OF LIP-READING
Scriver, Helen
VOLTA REVIEW, 1925, 27 (08) : 422 - 424
[26] LIP-READING EXPERIENCES
Silverfriend, Selina
VOLTA REVIEW, 1916, 18 (12) : 488 - 488
[27] ADVOCATES LIP-READING
不详
VOLTA REVIEW, 1917, 19 (09) : 489 - 489
[28] OF LIP-READING FOR ADULTS
Andrews, Harriet U.
ASSOCIATION REVIEW, 1909, 11 (03): : 13 - 17
[29] LIP-READING VIA DEEP NEURAL NETWORKS USING HYBRID VISUAL FEATURES
Vakhshiteh, Fatemeh
Almasganj, Farshad
Nickabadi, Ahmad
IMAGE ANALYSIS & STEREOLOGY, 2018, 37 (02): : 159 - 171
[30] MANUAL OF LIP-READING
Bruhn, Martha E.
VOLTA REVIEW, 1917, 19 (09) : 465 - 478

← 1 2 3 4 5 →