Automating Manga Character Analysis: A Robust Deep Vision-Transformer Approach to Facial Landmark Detection

被引:0
|
作者
Vachmanus, Sirawich [1 ]
Phinklao, Noppanan [1 ]
Phongsarnariyakul, Naruparn [1 ]
Plongcharoen, Thanat [1 ]
Hotta, Seiji [2 ]
Tuarob, Suppawong [1 ]
机构
[1] Mahidol Univ, Fac Informat & Commun Technol, Nakhon Pathom 73170, Thailand
[2] Tokyo Univ Agr & Technol, Inst Engn, Tokyo 1838538, Japan
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Face recognition; Cultural aspects; Feature extraction; Accuracy; Heating systems; Neural networks; Transformers; Deep learning; Face landmarks; deep learning; manga; cultural heritage;
D O I
10.1109/ACCESS.2024.3459419
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Comics, particularly Japanese manga, are a powerful medium that blends images and text to convey ideas and encapsulate a unique cultural heritage. Going beyond mere entertainment, manga merges diverse styles and content deeply rooted in Japanese cultural heritage. This study utilizes computer vision analysis, with a specific focus on facial landmark detection, acknowledging the growing significance of technology in analyzing manga images. Through a comprehensive exploration of various methods, the research identifies the extended version of Bidirectional Encoder Representations from Transformers (BERT), BERT Pre-Training of Image Transformers (BEiT), model as a standout performer due to its efficiency and effectiveness. The BEiT model's success lies in its ability to extract facial features, consequently establishing itself as a go-to solution for landmark detection on manga faces. The outcomes achieved the lowest Failure Rate compared to other landmark detection networks, with a Failure Rate of approximately 9.4% and a Mean Average Error of about 4.6 pixels. Beyond its technical accomplishments, this study carries a cultural significance, contributing to the ongoing narrative of manga in Japan.
引用
收藏
页码:131284 / 131295
页数:12
相关论文
共 19 条
  • [1] TransMarker: A Pure Vision Transformer for Facial Landmark Detection
    Wu, Wenyan
    Cai, Yici
    Zhou, Qiang
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3580 - 3587
  • [2] Deep coupling neural network for robust facial landmark detection
    Wu, Wenyan
    Wu, Xingzhe
    Cai, Yici
    Zhou, Qiang
    COMPUTERS & GRAPHICS-UK, 2019, 82 : 286 - 294
  • [3] Robust Facial Landmark Detection by Multiorder Multiconstraint Deep Networks
    Wan, Jun
    Lai, Zhihui
    Li, Jing
    Zhou, Jie
    Gao, Can
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (05) : 2181 - 2194
  • [4] DEEP GEOMETRY EMBEDDING NETWORKS FOR ROBUST FACIAL LANDMARK DETECTION
    Zhu, Meilu
    Shi, Daming
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1222 - 1227
  • [5] Robust Facial Landmark Detection via Occlusion-adaptive Deep Networks
    Zhu, Meilu
    Shi, Daming
    Zheng, Mingjie
    Sadiq, Muhammad
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3481 - 3491
  • [6] Deep Neural Networks and Computer Vision for Real-Time Facial Landmark Detection
    Ciobotaru, Alexandru
    Gota, Dan Ioan
    Puscasiu, Adela
    Stan, Ovidiu
    Fanca, Alexandra
    Domnuta, Claudiu
    Valean, Honoriu
    Miclea, Liviu
    INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND ENERGY TECHNOLOGIES (ICECET 2021), 2021, : 424 - 429
  • [7] A robust occlusion-adaptive attention-based deep network for facial landmark detection
    Sadiq, Muhammad
    Shi, D.
    Liang, Junwei
    APPLIED INTELLIGENCE, 2022, 52 (08) : 9320 - 9333
  • [8] A robust occlusion-adaptive attention-based deep network for facial landmark detection
    Muhammad Sadiq
    D. Shi
    Junwei Liang
    Applied Intelligence, 2022, 52 : 9320 - 9333
  • [9] Robust facial landmark detection by cross-order cross-semantic deep network
    Wan, Jun
    Lai, Zhihui
    Shen, Linlin
    Zhou, Jie
    Gao, Can
    Xiao, Gang
    Hou, Xianxu
    NEURAL NETWORKS, 2021, 136 : 233 - 243
  • [10] Improving Video Vision Transformer for Deepfake Video Detection Using Facial Landmark, Depthwise Separable Convolution and Self Attention
    Ramadhani, Kurniawan Nur
    Munir, Rinaldi
    Utama, Nugraha Priya
    IEEE ACCESS, 2024, 12 : 8932 - 8939