Fingertips Detection in Egocentric Video Frames using Deep Neural Networks

被引:2
|
作者
Mishra, Purnendu [1 ]
Sarawadekar, Kishor [1 ]
机构
[1] Indian Inst Technol BHU, Dept Elect Engn, Varanasi 221005, Uttar Pradesh, India
关键词
Computer Vision; Fingertip; RGB; HCI; Ego-centric; Multi-gesture; POINT DETECTION;
D O I
10.1109/ivcnz48456.2019.8960957
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, there has been much advancement in Augmented Reality technologies. Also, there has been a rise in the usage of wearable cameras. These technologies allow us to interact with the virtual world and the real world simultaneously. Hand gestures or finger gestures can be used to provide input instructions replacing conventional tools like a keyboard or a mouse. This paper introduces an improvement over the YOLSE (You Only Look what You Should See) model towards multiple fingertip position estimation. We propose a regression-based technique to locate fingertip(s) in a multi-gesture condition. First, the hand gesture is segmented from the scene using a deep neural network (DNN) based object detection model. Next, fingertip(s) positions are estimated using MobileNetv2 architecture. It is difficult to use direct regression when the varying number of visible fingertips are present in different egocentric hand gestures. We used the multi-label classification concept to identify all the visible extended fingers in the image. Average errors on RGB image with a resolution of 640 x 480 is 6.1527 pixels. The processing time of 9.072 ms is achieved on Nvidia GeForce GTX 1080 GPU.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Video Dynamics Detection Using Deep Neural Networks
    Zheng, Keji
    Yan, Wei Qi
    Nand, Parma
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2018, 2 (03): : 224 - 234
  • [2] Video Saliency Detection Using Deep Convolutional Neural Networks
    Zhou, Xiaofei
    Liu, Zhi
    Gong, Chen
    Li, Gongyang
    Huang, Mengke
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PT II, 2018, 11257 : 308 - 319
  • [3] Emotion Intensity Estimation from Video Frames using Deep Hybrid Convolutional Neural Networks
    Thuseethan, Selvarajah
    Rajasegarar, Sutharshan
    Yearwood, John
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [4] Visibility Loss Detection for Video Camera Using Deep Convolutional Neural Networks
    Ivanov, Alexey
    Yudin, Dmitry
    [J]. PROCEEDINGS OF THE THIRD INTERNATIONAL SCIENTIFIC CONFERENCE INTELLIGENT INFORMATION TECHNOLOGIES FOR INDUSTRY (IITI'18), VOL 1, 2019, 874 : 434 - 443
  • [5] Recognition of Depression from Video Frames by using Convolutional Neural Networks
    Wang, Jianwen
    Sha, Xiao
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (11) : 1137 - 1148
  • [6] Spatio-temporal based video anomaly detection using deep neural networks
    Chaurasia R.K.
    Jaiswal U.C.
    [J]. International Journal of Information Technology, 2023, 15 (3) : 1569 - 1581
  • [7] Video deepfake detection using Particle Swarm Optimization improved deep neural networks
    Cunha, Leandro
    Zhang, Li
    Lim, Chee Peng
    Sowan, Bilal
    Kong, Yinghui
    [J]. NEURAL COMPUTING & APPLICATIONS, 2024, 36 (15): : 8417 - 8453
  • [8] Video deepfake detection using Particle Swarm Optimization improved deep neural networks
    Leandro Cunha
    Li Zhang
    Bilal Sowan
    Chee Peng Lim
    Yinghui Kong
    [J]. Neural Computing and Applications, 2024, 36 : 8417 - 8453
  • [9] Salient object detection in video using deep non-local neural networks
    Shokri, Mohammad
    Harati, Ahad
    Taba, Kimya
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2020, 68
  • [10] Deep-fake video detection approaches using convolutional - recurrent neural networks
    Suratkar, Shraddha
    Bhiungade, Sayali
    Pitale, Jui
    Soni, Komal
    Badgujar, Tushar
    Kazi, Faruk
    [J]. JOURNAL OF CONTROL AND DECISION, 2023, 10 (02) : 198 - 214