Deep Object Detector With Attentional Spatiotemporal LSTM for Space Human-Robot Interaction

被引:17
|
作者
Yu, Jiahui [1 ,2 ,3 ]
Gao, Hongwei [5 ]
Chen, Yongquan [1 ,2 ,3 ]
Zhou, Dalin [4 ]
Liu, Jinguo [6 ]
Ju, Zhaojie [4 ]
机构
[1] Chinese Univ Hong Kong, Inst Robot & Intelligent Mfg, Shenzhen 518172, Peoples R China
[2] Chinese Univ Hong Kong, Sch Sci & Engn, Shenzhen 518172, Peoples R China
[3] Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen 518035, Peoples R China
[4] Univ Portsmouth, Sch Comp, Portsmouth PO1 3HE, Hants, England
[5] Shenyang Ligong Univ, Sch Automat & Elect Engn, Shenyang 110159, Peoples R China
[6] Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Inst Robot & Intelligent Mfg, Shenyang 110016, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Feature extraction; Videos; Detectors; Robots; Semantics; Object detection; Real-time systems; Attention model; single shot multibox detector (SSD); space human-robot interaction (SHRI); video object detection; GESTURE RECOGNITION; MODEL; DYNAMICS;
D O I
10.1109/THMS.2022.3144951
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Global temporal information and local semantic information are essential cues for high-performance online object detection in videos. However, despite their promising detection accuracy in most cases, most state-of-the-art approaches have following two limitations: invalid background/scale suppression and inadequate temporal information mining between frames. Many jobs currently focus on temporal information learning based on a single frame. In this article, we propose an attentional global-local information learning network; this is one of the first attempts to fully use both types of information between frames. Attention maps are creatively utilized to transfer temporal contexts between frames. This also effectively alleviates the adverse effects of scale changes. Furthermore, empowered by a detailed framework, a proposed detector effectively uses multilevel feature extraction. Given these contributions, the proposed detector achieves state-of-the-art performance on challenging benchmarks. Finally, practical experiments are conducted on a space human-robot interaction platform.
引用
收藏
页码:784 / 793
页数:10
相关论文
共 50 条
  • [1] Attentional Human-Robot Interaction in Simple Manipulation Tasks
    Burattini, Ernesto
    Finzi, Alberto
    Rossi, Silvia
    Staffa, Mariacarla
    [J]. HRI'12: PROCEEDINGS OF THE SEVENTH ANNUAL ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, 2012, : 129 - 130
  • [2] Human object tracking algorithm for human-robot interaction
    Zhang, Tie
    Ma, Qiong-Xiong
    [J]. Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2015, 49 (08): : 1213 - 1219
  • [3] Flexible Task Execution and Attentional Regulations in Human-Robot Interaction
    Caccavale, Riccardo
    Finzi, Alberto
    [J]. IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2017, 9 (01) : 68 - 79
  • [4] Plan Execution and Attentional Regulations for Flexible Human-Robot Interaction
    Caccavale, Riccardo
    Finzi, Alberto
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, : 2453 - 2458
  • [5] Space, Speech, and Gesture in Human-Robot Interaction
    Mead, Ross
    [J]. ICMI '12: PROCEEDINGS OF THE ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2012, : 333 - 336
  • [6] A Review of NASA Human-Robot Interaction in Space
    Kimberly Hambuchen
    Jessica Marquez
    Terrence Fong
    [J]. Current Robotics Reports, 2021, 2 (3): : 265 - 272
  • [7] Fast Object Detection for Human-Robot Interaction Control
    Shieh, M. Y.
    Chen, Y. H.
    Li, J. H.
    Pai, N. S.
    Chiou, J. S.
    [J]. 2013 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2013, : 616 - 619
  • [8] Human-Robot Interaction for Learning and Adaptation of Object Movements
    Muehlig, Manuel
    Gienger, Michael
    Steil, Jochen J.
    [J]. IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010, : 4901 - 4907
  • [9] Object recognition through human-robot interaction by speech
    Kurnia, R
    Hossain, A
    Nakamura, A
    Kuno, Y
    [J]. RO-MAN 2004: 13TH IEEE INTERNATIONAL WORKSHOP ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, PROCEEDINGS, 2004, : 619 - 624
  • [10] Spatiotemporal Coordination Supports a Sense of Commitment in Human-Robot Interaction
    Vignolo, Alessia
    Sciutti, Alessandra
    Rea, Francesco
    Michael, John
    [J]. SOCIAL ROBOTICS, ICSR 2019, 2019, 11876 : 34 - 43