Fingertips Detection in Egocentric Video Frames using Deep Neural Networks

被引:2
|
作者
Mishra, Purnendu [1 ]
Sarawadekar, Kishor [1 ]
机构
[1] Indian Inst Technol BHU, Dept Elect Engn, Varanasi 221005, Uttar Pradesh, India
关键词
Computer Vision; Fingertip; RGB; HCI; Ego-centric; Multi-gesture; POINT DETECTION;
D O I
10.1109/ivcnz48456.2019.8960957
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, there has been much advancement in Augmented Reality technologies. Also, there has been a rise in the usage of wearable cameras. These technologies allow us to interact with the virtual world and the real world simultaneously. Hand gestures or finger gestures can be used to provide input instructions replacing conventional tools like a keyboard or a mouse. This paper introduces an improvement over the YOLSE (You Only Look what You Should See) model towards multiple fingertip position estimation. We propose a regression-based technique to locate fingertip(s) in a multi-gesture condition. First, the hand gesture is segmented from the scene using a deep neural network (DNN) based object detection model. Next, fingertip(s) positions are estimated using MobileNetv2 architecture. It is difficult to use direct regression when the varying number of visible fingertips are present in different egocentric hand gestures. We used the multi-label classification concept to identify all the visible extended fingers in the image. Average errors on RGB image with a resolution of 640 x 480 is 6.1527 pixels. The processing time of 9.072 ms is achieved on Nvidia GeForce GTX 1080 GPU.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Speech Activity Detection Using Deep Neural Networks
    Shahsavari, Sajad
    Sameti, Hossein
    Hadian, Hossein
    [J]. 2017 25TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2017, : 1564 - 1568
  • [32] Scalable Object Detection using Deep Neural Networks
    Erhan, Dumitru
    Szegedy, Christian
    Toshev, Alexander
    Anguelov, Dragomir
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 2155 - 2162
  • [33] Object Detection Using Deep Convolutional Neural Networks
    Qian, Huimin
    Xu, Jiawei
    Zhou, Jun
    [J]. 2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 1151 - 1156
  • [34] Power Theft Detection Using Deep Neural Networks
    Mangat, Gagandeep
    Divya, Divya
    Gupta, Varun
    Sambyal, Nitigya
    [J]. ELECTRIC POWER COMPONENTS AND SYSTEMS, 2021, 49 (4-5) : 458 - 473
  • [35] Detection-based video surveillance using Deep Neural Networks on STM32 microcontroller
    Canepa, Alessio
    Ragusa, Edoardo
    Zunino, Rodolfo
    Gastaldo, Paolo
    [J]. 2022 29TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS (IEEE ICECS 2022), 2022,
  • [36] Abnormal Events Detection Using Deep Networks for Video Surveillance
    Meng, Binghao
    Zhang, Lu
    Jin, Fan
    Yang, Lu
    Cheng, Hong
    Wang, Qian
    [J]. COGNITIVE SYSTEMS AND SIGNAL PROCESSING, ICCSIP 2016, 2017, 710 : 197 - 204
  • [37] Novelty detection in video surveillance using hierarchical neural networks
    Owens, J
    Hunter, A
    Fletcher, E
    [J]. ARTIFICIAL NEURAL NETWORKS - ICANN 2002, 2002, 2415 : 1249 - 1254
  • [38] Video sequence boundary detection using neural gas networks
    Cao, X
    Suganthan, PN
    [J]. ARTIFICIAL NEURAL NETWORKS-ICANN 2001, PROCEEDINGS, 2001, 2130 : 1048 - 1053
  • [39] Generalizable Deep Video Inpainting Detection Based on Constrained Convolutional Neural Networks
    Li, Jinchuan
    Zhao, Xianfeng
    Cao, Yun
    [J]. DIGITAL FORENSICS AND WATERMARKING, IWDW 2023, 2024, 14511 : 125 - 138
  • [40] Toward Efficient and Adaptive Design of Video Detection System with Deep Neural Networks
    Mao, Jiachen
    Yang, Qing
    Li, Ang
    Nixon, Kent W.
    Li, Hai
    Chen, Yiran
    [J]. ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2022, 21 (03)