A Generative Approach for Face Mask Removal Using Audio and Appearance

被引:1
|
作者
Coelho, Luiz E. L. [1 ]
Prates, Raphael [1 ]
Schwartz, William Robson [1 ]
机构
[1] Univ Fed Minas Gerais, Dept Comp Sci, Smart Sense Lab, Belo Horizonte, MG, Brazil
关键词
D O I
10.1109/SIBGRAPI54419.2021.00040
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Since the COVID-19 pandemic, the use of facial masks in public spaces or during people gatherings has become common. Therefore, journalists, reporters, and interviewees frequently use a mask, following the public health measures to contain the pandemic. However, using a mask while speaking or conducting a presentation can be uncomfortable for viewers. Furthermore, the usage of a mask prevents lip reading, which can harm the speech comprehension of people with hearing impairment. Thus, this work aims at artificially removing masks in videos while recovering the lip movements using the audio and uncovered face features. We use the audio to infer the lip movement in a way it matches with the uttered phrase. From the audio, we estimate landmarks representing the mouth structure. Finally, the landmarks (i.e. uncovered and estimated) are the input in a generative adversarial network (GAN) that reconstructs the full face image with the mouth in a correct shape. We present quantitative results in the form of evaluation metrics and qualitative results in the form of visual examples.
引用
收藏
页码:239 / 246
页数:8
相关论文
共 50 条
  • [41] Face Mask Detection using Vision Transformer
    Pandya, Bhavik
    Patel, Darshana
    Yow, Kin-Choong
    2023 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CCECE, 2023,
  • [42] FACE MASK DETECTION USING DEEP LEARNING
    Kodali, Ravi Kishore
    Dhanekula, Rekha
    2021 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2021,
  • [43] Face Mask Detection Using Machine Learning
    Eladham, Mohamed
    Nassif, Ali Bou
    AlShabi, Mohammad A.
    REAL-TIME IMAGE PROCESSING AND DEEP LEARNING 2023, 2023, 12528
  • [44] Animating Face using Disentangled Audio Representations
    Mittal, Gaurav
    Wang, Baoyuan
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 3279 - 3287
  • [45] Optimized face detector-based intelligent face mask detection model in IoT using deep learning approach
    Naseri, Raghda Awad Shaban
    Kurnaz, Ayca
    Farhan, Hameed Mutlag
    APPLIED SOFT COMPUTING, 2023, 134
  • [46] Survey Of Football Helmets, Helmet Hardware, And Tools For Face Mask Removal
    Brooks, Toby J.
    Kleiner, Douglas M.
    MEDICINE AND SCIENCE IN SPORTS AND EXERCISE, 2005, 37 : S356 - S356
  • [47] A new approach to appearance-based face recognition
    Cheung, KH
    Kong, A
    You, J
    Li, Q
    Zhang, D
    Bhattacharya, P
    INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOL 1-4, PROCEEDINGS, 2005, : 1686 - 1691
  • [48] Face mask removal is safer than helmet removal for emergent airway access in American football
    Swartz, Erik E.
    Mihalik, Jason P.
    Beltz, Nora M.
    Day, Molly A.
    Decoster, Laura C.
    SPINE JOURNAL, 2014, 14 (06): : 996 - 1004
  • [49] A novel hybrid face mask detection approach using Transformer and convolutional neural network models
    Al-Sarrar, Haifa M.
    Al-Baity, Heyam H.
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [50] A novel hybrid face mask detection approach using Transformer and convolutional neural network models
    Al-Sarrar H.M.
    Al-Baity H.H.
    PeerJ Computer Science, 2023, 9