3DPalsyNet: A Facial Palsy Grading and Motion Recognition Framework Using Fully 3D Convolutional Neural Networks

被引:27
|
作者
Storey, Gary [1 ]
Jiang, Richard [2 ]
Keogh, Shelagh [1 ]
Bouridane, Ahmed [1 ]
Li, Chang-Tsun [3 ]
机构
[1] Northumbria Univ, Dept Comp & Informat Sci, Newcastle Upon Tyne NE1 8ST, Tyne & Wear, England
[2] Univ Lancaster, Dept Comp & Commun, Lancaster LA1 4WA, England
[3] Deakin Univ, Sch Informat Technol, Geelong, Vic 3220, Australia
基金
英国工程与自然科学研究理事会;
关键词
Computer vision; face detection; facial action recognition; machine learning; PATTERNS;
D O I
10.1109/ACCESS.2019.2937285
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The capability to perform facial analysis from video sequences has significant potential to positively impact in many areas of life. One such area relates to the medical domain to specifically aid in the diagnosis and rehabilitation of patients with facial palsy. With this application in mind, this paper presents an end-to-end framework, named 3DPalsyNet, for the tasks of mouth motion recognition and facial palsy grading. 3DPalsyNet utilizes a 3D CNN architecture with a ResNet backbone for the prediction of these dynamic tasks. Leveraging transfer learning from a 3D CNNs pre-trained on the Kinetics data set for general action recognition, the model is modified to apply joint supervised learning using center and softmax loss concepts. 3DPalsyNet is evaluated on a test set consisting of individuals with varying ranges of facial palsy and mouth motions and the results have shown an attractive level of classification accuracy in these tasks of 82% and 86% respectively. The frame duration and the loss function affect was studied in terms of the predictive qualities of the proposed 3DPalsyNet, where it was found shorter frame duration's of 8 performed best for this specific task. Centre loss and softmax have shown improvements in spatio-temporal feature learning than softmax loss alone, this is in agreement with earlier work involving the spatial domain.
引用
收藏
页码:121655 / 121664
页数:10
相关论文
共 50 条
  • [41] Robust 3D Face Alignment with Efficient Fully Convolutional Neural Networks
    Jiang, Lei
    Wu, Xiao-Jun
    Kittler, Josef
    IMAGE AND GRAPHICS, ICIG 2019, PT II, 2019, 11902 : 266 - 277
  • [42] Using Motion History Images With 3D Convolutional Networks in Isolated Sign Language Recognition
    Mercanoglu Sincan, Ozge
    Keles, Hacer Yalim
    IEEE ACCESS, 2022, 10 : 18608 - 18618
  • [43] 3D Pose Regression using Convolutional Neural Networks
    Mahendran, Siddharth
    Ali, Haider
    Vidal, Rene
    2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 494 - 495
  • [44] Violence Detection using 3D Convolutional Neural Networks
    Su, Jiayi
    Her, Paris
    Clemens, Erik
    Yaz, Edwin
    Schneider, Susan
    Medeiros, Henry
    2022 18TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2022), 2022,
  • [45] Video Steganography Using 3D Convolutional Neural Networks
    Abdolmohammadi, Mahdi
    Toroghi, Rahil Mahdian
    Bastanfard, Azam
    PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2020, 1144 : 149 - 161
  • [46] 3D Pose Regression using Convolutional Neural Networks
    Mahendran, Siddharth
    Ali, Haider
    Vidal, Rene
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 2174 - 2182
  • [47] 3D object understanding with 3D Convolutional Neural Networks
    Leng, Biao
    Liu, Yu
    Yu, Kai
    Zhang, Xiangyang
    Xiong, Zhang
    INFORMATION SCIENCES, 2016, 366 : 188 - 201
  • [48] PREDICTING TONGUE MOTION IN UNLABELED ULTRASOUND VIDEO USING 3D CONVOLUTIONAL NEURAL NETWORKS
    Wu, Chengrui
    Chen, Shicheng
    Sheng, Guorui
    Roussel, Pierre
    Denby, Bruce
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5764 - 5768
  • [49] Recognition, location, measurement, and 3D reconstruction of concealed cracks using convolutional neural networks
    Tong, Zheng
    Gao, Jie
    Zhang, Haitao
    CONSTRUCTION AND BUILDING MATERIALS, 2017, 146 : 775 - 787
  • [50] Using Convolutional 3D Neural Networks for User-Independent Continuous Gesture Recognition
    Camgoz, Necati Cihan
    Hadfield, Simon
    Koller, Oscar
    Bowden, Richard
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 49 - 54