Lightweight real-time hand segmentation leveraging MediaPipe landmark detection

被引:0
|
作者
Guillermo Sánchez-Brizuela
Ana Cisnal
Eusebio de la Fuente-López
Juan-Carlos Fraile
Javier Pérez-Turiel
机构
[1] Universidad de Valladolid,ITAP (Instituto de las Tecnologías Avanzadas de la Producción)
来源
Virtual Reality | 2023年 / 27卷
关键词
Augmented reality; Hand segmentation; MediaPipe; Online processing; Semantic segmentation;
D O I
暂无
中图分类号
学科分类号
摘要
Real-time hand segmentation is a key process in applications that require human–computer interaction, such as gesture recognition or augmented reality systems. However, the infinite shapes and orientations that hands can adopt, their variability in skin pigmentation and the self-occlusions that continuously appear in images make hand segmentation a truly complex problem, especially with uncontrolled lighting conditions and backgrounds. The development of robust, real-time hand segmentation algorithms is essential to achieve immersive augmented reality and mixed reality experiences by correctly interpreting collisions and occlusions. In this paper, we present a simple but powerful algorithm based on the MediaPipe Hands solution, a highly optimized neural network. The algorithm processes the landmarks provided by MediaPipe using morphological and logical operators to obtain the masks that allow dynamic updating of the skin color model. Different experiments were carried out comparing the influence of the color space on skin segmentation, with the CIELab color space chosen as the best option. An average intersection over union of 0.869 was achieved on the demanding Ego2Hands dataset running at 90 frames per second on a conventional computer without any hardware acceleration. Finally, the proposed segmentation procedure was implemented in an augmented reality application to add hand occlusion for improved user immersion. An open-source implementation of the algorithm is publicly available at https://github.com/itap-robotica-medica/lightweight-hand-segmentation.
引用
收藏
页码:3125 / 3132
页数:7
相关论文
共 50 条
  • [1] Lightweight real-time hand segmentation leveraging MediaPipe landmark detection
    Sanchez-Brizuela, Guillermo
    Cisnal, Ana
    de la Fuente-Lopez, Eusebio
    Fraile, Juan-Carlos
    Perez-Turiel, Javier
    VIRTUAL REALITY, 2023, 27 (04) : 3125 - 3132
  • [2] MediaPipe with LSTM Architecture for Real-Time Hand Gesture Recognization
    Biswas, Sougatamoy
    Nandy, Anup
    Naskar, Asim Kumar
    Saw, Rahul
    COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT II, 2024, 2010 : 422 - 431
  • [3] Real-Time Hand Detection Method Based on Lightweight Network
    Jin, Fangrui
    Wang, Yangping
    Yong, Jiu
    Computer Engineering and Applications, 2023, 59 (14) : 192 - 200
  • [4] TinySegformer: A lightweight visual segmentation model for real-time agricultural pest detection
    Zhang, Yan
    Lv, Chunli
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2024, 218
  • [5] Real-Time Hand Gesture Monitoring Model Based on MediaPipe's Registerable System
    Meng, Yuting
    Jiang, Haibo
    Duan, Nengquan
    Wen, Haijun
    SENSORS, 2024, 24 (19)
  • [6] Lightweight Bilateral Network for Real-Time Semantic Segmentation
    Wang, Pengtao
    Li, Lihong
    Pan, Feiyang
    Wang, Lin
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2023, 27 (04) : 673 - 682
  • [7] Real-Time Detection of Hand Gestures
    Muzyka, Piotr
    Frydrysiak, Marek
    Roszkowska, Elzbieta
    2016 21ST INTERNATIONAL CONFERENCE ON METHODS AND MODELS IN AUTOMATION AND ROBOTICS (MMAR), 2016, : 168 - 173
  • [8] Real-time face and hand detection
    González-Ortega, D
    Díaz-Pernas, FJ
    Díez-Higuera, JF
    Martínez-Zarzuela, M
    Boto-Giralda, D
    PROCEEDINGS OF THE FIFTH IASTED INTERNATIONAL CONFERENCE ON VISUALIZATION, IMAGING, AND IMAGE PROCESSING, 2005, : 77 - 82
  • [9] Real-time pavement surface crack detection based on lightweight semantic segmentation model
    Yu, Huayang
    Deng, Yihao
    Guo, Feng
    TRANSPORTATION GEOTECHNICS, 2024, 48
  • [10] Real-time Efficient Facial Landmark Detection Algorithms
    Xiong, Hanying
    Lu, Tongwei
    Zhang, Hongzhi
    AIPR 2020: 2020 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND PATTERN RECOGNITION, 2020, : 191 - 195