Development of a Low-Cost, Noninvasive, Portable Visual Speech Recognition Program
被引:3
|
作者:
Kohlberg, Gavriel D.
论文数: 0引用数: 0
h-index: 0
机构:
Columbia Univ Coll Phys & Surg, Dept Otolaryngol Head & Neck Surg, 630 W 168th St, New York, NY 10032 USAColumbia Univ Coll Phys & Surg, Dept Otolaryngol Head & Neck Surg, 630 W 168th St, New York, NY 10032 USA
Kohlberg, Gavriel D.
[1
]
论文数: 引用数:
h-index:
机构:
Gal, Ya'akov
[2
]
Lalwani, Anil K.
论文数: 0引用数: 0
h-index: 0
机构:
Columbia Univ Coll Phys & Surg, Dept Otolaryngol Head & Neck Surg, 630 W 168th St, New York, NY 10032 USAColumbia Univ Coll Phys & Surg, Dept Otolaryngol Head & Neck Surg, 630 W 168th St, New York, NY 10032 USA
Lalwani, Anil K.
[1
]
机构:
[1] Columbia Univ Coll Phys & Surg, Dept Otolaryngol Head & Neck Surg, 630 W 168th St, New York, NY 10032 USA
[2] Ben Gurion Univ Negev, Dept Informat Syst Engn, Beer Sheva, Israel
communication aids for disabled;
lipreading;
user computer interface;
visual speech recognition;
silent speech interface;
TOTAL LARYNGECTOMY;
CARE;
COMMUNICATION;
NETWORK;
D O I:
10.1177/0003489416650689
中图分类号:
R76 [耳鼻咽喉科学];
学科分类号:
100213 ;
摘要:
Objectives: Loss of speech following tracheostomy and laryngectomy severely limits communication to simple gestures and facial expressions that are largely ineffective. To facilitate communication in these patients, we seek to develop a low-cost, noninvasive, portable, and simple visual speech recognition program (VSRP) to convert articulatory facial movements into speech. Methods: A Microsoft Kinect-based VSRP was developed to capture spatial coordinates of lip movements and translate them into speech. The articulatory speech movements associated with 12 sentences were used to train an artificial neural network classifier. The accuracy of the classifier was then evaluated on a separate, previously unseen set of articulatory speech movements. Results: The VSRP was successfully implemented and tested in 5 subjects. It achieved an accuracy rate of 77.2% (65.0%-87.6% for the 5 speakers) on a 12-sentence data set. The mean time to classify an individual sentence was 2.03 milliseconds (1.91-2.16). Conclusion: We have demonstrated the feasibility of a low-cost, noninvasive, portable VSRP based on Kinect to accurately predict speech from articulation movements in clinically trivial time. This VSRP could be used as a novel communication device for aphonic patients.
机构:
Lucerne School of Engineering and Architecture, Institute of Medical Engineering, Space Biology Group, Hergiswil, SwitzerlandLucerne School of Engineering and Architecture, Institute of Medical Engineering, Space Biology Group, Hergiswil, Switzerland
Salvadori, Vincent
Fäh, Daniel
论文数: 0引用数: 0
h-index: 0
机构:
Lucerne School of Engineering and Architecture, Institute of Medical Engineering, Space Biology Group, Hergiswil, SwitzerlandLucerne School of Engineering and Architecture, Institute of Medical Engineering, Space Biology Group, Hergiswil, Switzerland
Fäh, Daniel
Flühler, Sarina
论文数: 0引用数: 0
h-index: 0
机构:
Lucerne School of Engineering and Architecture, Institute of Medical Engineering, Space Biology Group, Hergiswil, SwitzerlandLucerne School of Engineering and Architecture, Institute of Medical Engineering, Space Biology Group, Hergiswil, Switzerland
Flühler, Sarina
Wandeler, Jan
论文数: 0引用数: 0
h-index: 0
机构:
Lucerne School of Engineering and Architecture, Institute of Mechanical Engineering and Energy Technology, Horw, SwitzerlandLucerne School of Engineering and Architecture, Institute of Medical Engineering, Space Biology Group, Hergiswil, Switzerland
Wandeler, Jan
Jacome, Maria J.
论文数: 0引用数: 0
h-index: 0
机构:
Lucerne School of Engineering and Architecture, Institute of Medical Engineering, Space Biology Group, Hergiswil, SwitzerlandLucerne School of Engineering and Architecture, Institute of Medical Engineering, Space Biology Group, Hergiswil, Switzerland
Jacome, Maria J.
Koller, Adrian
论文数: 0引用数: 0
h-index: 0
机构:
Lucerne School of Engineering and Architecture, Institute of Mechanical Engineering and Energy Technology, Horw, SwitzerlandLucerne School of Engineering and Architecture, Institute of Medical Engineering, Space Biology Group, Hergiswil, Switzerland
Koller, Adrian
Egli, Marcel
论文数: 0引用数: 0
h-index: 0
机构:
Lucerne School of Engineering and Architecture, Institute of Medical Engineering, Space Biology Group, Hergiswil, SwitzerlandLucerne School of Engineering and Architecture, Institute of Medical Engineering, Space Biology Group, Hergiswil, Switzerland
Egli, Marcel
Wuest, Simon L.
论文数: 0引用数: 0
h-index: 0
机构:
Lucerne School of Engineering and Architecture, Institute of Medical Engineering, Space Biology Group, Hergiswil, SwitzerlandLucerne School of Engineering and Architecture, Institute of Medical Engineering, Space Biology Group, Hergiswil, Switzerland
机构:
Mahidol Univ, Fac Trop Med, Mahidol Oxford Trop Med Res Unit, Bangkok 10400, Thailand
Univ Oxford, Nuffield Dept Clin Med, Ctr Clin Vaccinol & Trop Med, Oxford, EnglandMahidol Univ, Fac Trop Med, Mahidol Oxford Trop Med Res Unit, Bangkok 10400, Thailand
Maude, Richard J.
Plewes, Katherine
论文数: 0引用数: 0
h-index: 0
机构:
Mahidol Univ, Fac Trop Med, Mahidol Oxford Trop Med Res Unit, Bangkok 10400, ThailandMahidol Univ, Fac Trop Med, Mahidol Oxford Trop Med Res Unit, Bangkok 10400, Thailand
Plewes, Katherine
Dimock, Joss
论文数: 0引用数: 0
h-index: 0
机构:
Mahidol Univ, Fac Trop Med, Mahidol Oxford Trop Med Res Unit, Bangkok 10400, ThailandMahidol Univ, Fac Trop Med, Mahidol Oxford Trop Med Res Unit, Bangkok 10400, Thailand
Dimock, Joss
Dondorp, Arjen M.
论文数: 0引用数: 0
h-index: 0
机构:
Mahidol Univ, Fac Trop Med, Mahidol Oxford Trop Med Res Unit, Bangkok 10400, Thailand
Univ Oxford, Nuffield Dept Clin Med, Ctr Clin Vaccinol & Trop Med, Oxford, EnglandMahidol Univ, Fac Trop Med, Mahidol Oxford Trop Med Res Unit, Bangkok 10400, Thailand