Deep Attention Models for Human Tracking Using RGBD

被引:12
|
作者
Rasoulidanesh, Maryamsadat [1 ]
Yadav, Srishti [1 ]
Herath, Sachini [2 ]
Vaghei, Yasaman [3 ]
Payandeh, Shahram [1 ]
机构
[1] Simon Fraser Univ, Sch Engn Sci, Networked Robot & Sensing Lab, Burnaby, BC V5A 1S6, Canada
[2] Simon Fraser Univ, Sch Comp Sci, Burnaby, BC V5A 1S6, Canada
[3] Simon Fraser Univ, Sch Mechatron Syst Engn, Burnaby, BC V5A 1S6, Canada
关键词
computer vision; visual tracking; attention model; RGBD; Kinect; deep network; convolutional neural network; Long Short-Term Memory; DEPTH;
D O I
10.3390/s19040750
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Visual tracking performance has long been limited by the lack of better appearance models. These models fail either where they tend to change rapidly, like in motion-based tracking, or where accurate information of the object may not be available, like in color camouflage (where background and foreground colors are similar). This paper proposes a robust, adaptive appearance model which works accurately in situations of color camouflage, even in the presence of complex natural objects. The proposed model includes depth as an additional feature in a hierarchical modular neural framework for online object tracking. The model adapts to the confusing appearance by identifying the stable property of depth between the target and the surrounding object(s). The depth complements the existing RGB features in scenarios when RGB features fail to adapt, hence becoming unstable over a long duration of time. The parameters of the model are learned efficiently in the Deep network, which consists of three modules: (1) The spatial attention layer, which discards the majority of the background by selecting a region containing the object of interest; (2) the appearance attention layer, which extracts appearance and spatial information about the tracked object; and (3) the state estimation layer, which enables the framework to predict future object appearance and location. Three different models were trained and tested to analyze the effect of depth along with RGB information. Also, a model is proposed to utilize only depth as a standalone input for tracking purposes. The proposed models were also evaluated in real-time using KinectV2 and showed very promising results. The results of our proposed network structures and their comparison with the state-of-the-art RGB tracking model demonstrate that adding depth significantly improves the accuracy of tracking in a more challenging environment (i.e., cluttered and camouflaged environments). Furthermore, the results of depth-based models showed that depth data can provide enough information for accurate tracking, even without RGB information.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Object Tracking Using Deep Convolutional Neural Networks and Visual Appearance Models
    Mocanu, Bogdan
    Tapu, Ruxandra
    Zaharia, Titus
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS (ACIVS 2017), 2017, 10617 : 114 - 125
  • [42] Enhancing Sparse Index-Tracking Portfolios Using Deep Learning Models
    Carlos Andres Zapata Quimbayo
    Daniel Aragón Urrego
    John Freddy Moreno Trujillo
    Oscar Eduardo Reyes Nieto
    SN Computer Science, 6 (3)
  • [43] Video Summarization with LSTM and Deep Attention Models
    Casas, Luis Lebron
    Koblents, Eugenia
    MULTIMEDIA MODELING, MMM 2019, PT II, 2019, 11296 : 67 - 79
  • [44] RGBD Salient Object Detection using Spatially Coherent Deep Learning Framework
    Huang, Posheng
    Shen, Chin-Han
    Hsiao, Hsu-Feng
    2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,
  • [45] Object tracking in infrared images using a deep learning model and a target-attention mechanism
    Mahboub Parhizkar
    Gholamreza Karamali
    Bahram Abedi Ravan
    Complex & Intelligent Systems, 2023, 9 : 1495 - 1506
  • [46] Object tracking in infrared images using a deep learning model and a target-attention mechanism
    Parhizkar, Mahboub
    Karamali, Gholamreza
    Ravan, Bahram Abedi
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (02) : 1495 - 1506
  • [47] Tracking Human Migration from Online Attention
    Vaca-Ruiz, Carmen
    Quercia, Daniele
    Maria Aiello, Luca
    Fraternali, Piero
    CITIZEN IN SENSOR NETWORKS, 2014, 8313 : 73 - 83
  • [48] Robust RGBD Tracking via Weighted Convolution Operators
    Liu, Weichun
    Tang, Xiaoan
    Zhao, Chenglin
    IEEE SENSORS JOURNAL, 2020, 20 (08) : 4496 - 4503
  • [49] REDEEM MYSELF: Purifying Backdoors in Deep Learning Models using Self Attention Distillation
    Gong, Xueluan
    Chen, Yanjiao
    Yang, Wang
    Wang, Qian
    Gu, Yuzhe
    Huang, Huayang
    Shen, Chao
    2023 IEEE SYMPOSIUM ON SECURITY AND PRIVACY, SP, 2023, : 755 - 772
  • [50] Computer-aided autism diagnosis based on visual attention models using eye tracking
    Jessica S. Oliveira
    Felipe O. Franco
    Mirian C. Revers
    Andréia F. Silva
    Joana Portolese
    Helena Brentani
    Ariane Machado-Lima
    Fátima L. S. Nunes
    Scientific Reports, 11