3DPalsyNet: A Facial Palsy Grading and Motion Recognition Framework Using Fully 3D Convolutional Neural Networks

被引：27

作者：

Storey, Gary ^{[1
]}

Jiang, Richard ^{[2
]}

Keogh, Shelagh ^{[1
]}

Bouridane, Ahmed ^{[1
]}

Li, Chang-Tsun ^{[3
]}

机构：

[1] Northumbria Univ, Dept Comp & Informat Sci, Newcastle Upon Tyne NE1 8ST, Tyne & Wear, England

[2] Univ Lancaster, Dept Comp & Commun, Lancaster LA1 4WA, England

[3] Deakin Univ, Sch Informat Technol, Geelong, Vic 3220, Australia

来源：

IEEE ACCESS | 2019年 / 7卷

基金：

英国工程与自然科学研究理事会;

关键词：

Computer vision; face detection; facial action recognition; machine learning; PATTERNS;

D O I：

10.1109/ACCESS.2019.2937285

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The capability to perform facial analysis from video sequences has significant potential to positively impact in many areas of life. One such area relates to the medical domain to specifically aid in the diagnosis and rehabilitation of patients with facial palsy. With this application in mind, this paper presents an end-to-end framework, named 3DPalsyNet, for the tasks of mouth motion recognition and facial palsy grading. 3DPalsyNet utilizes a 3D CNN architecture with a ResNet backbone for the prediction of these dynamic tasks. Leveraging transfer learning from a 3D CNNs pre-trained on the Kinetics data set for general action recognition, the model is modified to apply joint supervised learning using center and softmax loss concepts. 3DPalsyNet is evaluated on a test set consisting of individuals with varying ranges of facial palsy and mouth motions and the results have shown an attractive level of classification accuracy in these tasks of 82% and 86% respectively. The frame duration and the loss function affect was studied in terms of the predictive qualities of the proposed 3DPalsyNet, where it was found shorter frame duration's of 8 performed best for this specific task. Centre loss and softmax have shown improvements in spatio-temporal feature learning than softmax loss alone, this is in agreement with earlier work involving the spatial domain.

引用

页码：121655 / 121664

页数：10

共 50 条

[41] Robust 3D Face Alignment with Efficient Fully Convolutional Neural Networks
Jiang, Lei
Wu, Xiao-Jun
Kittler, Josef
IMAGE AND GRAPHICS, ICIG 2019, PT II, 2019, 11902 : 266 - 277
[42] Using Motion History Images With 3D Convolutional Networks in Isolated Sign Language Recognition
Mercanoglu Sincan, Ozge
Keles, Hacer Yalim
IEEE ACCESS, 2022, 10 : 18608 - 18618
[43] 3D Pose Regression using Convolutional Neural Networks
Mahendran, Siddharth
Ali, Haider
Vidal, Rene
2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 494 - 495
[44] Violence Detection using 3D Convolutional Neural Networks
Su, Jiayi
Her, Paris
Clemens, Erik
Yaz, Edwin
Schneider, Susan
Medeiros, Henry
2022 18TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2022), 2022,
[45] Video Steganography Using 3D Convolutional Neural Networks
Abdolmohammadi, Mahdi
Toroghi, Rahil Mahdian
Bastanfard, Azam
PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2020, 1144 : 149 - 161
[46] 3D Pose Regression using Convolutional Neural Networks
Mahendran, Siddharth
Ali, Haider
Vidal, Rene
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 2174 - 2182
[47] 3D object understanding with 3D Convolutional Neural Networks
Leng, Biao
Liu, Yu
Yu, Kai
Zhang, Xiangyang
Xiong, Zhang
INFORMATION SCIENCES, 2016, 366 : 188 - 201
[48] PREDICTING TONGUE MOTION IN UNLABELED ULTRASOUND VIDEO USING 3D CONVOLUTIONAL NEURAL NETWORKS
Wu, Chengrui
Chen, Shicheng
Sheng, Guorui
Roussel, Pierre
Denby, Bruce
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5764 - 5768
[49] Recognition, location, measurement, and 3D reconstruction of concealed cracks using convolutional neural networks
Tong, Zheng
Gao, Jie
Zhang, Haitao
CONSTRUCTION AND BUILDING MATERIALS, 2017, 146 : 775 - 787
[50] Using Convolutional 3D Neural Networks for User-Independent Continuous Gesture Recognition
Camgoz, Necati Cihan
Hadfield, Simon
Koller, Oscar
Bowden, Richard
2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 49 - 54

← 1 2 3 4 5 →