Attention-based active visual search for mobile robots

被引:14
|
作者
Rasouli, Amir [1 ,2 ]
Lanillos, Pablo [3 ]
Cheng, Gordon [3 ]
Tsotsos, John K. [1 ,2 ]
机构
[1] York Univ, Dept Elect Engn & Comp Sci, Toronto, ON, Canada
[2] York Univ, Ctr Vis Res, Toronto, ON, Canada
[3] Tech Univ Munich, ICS, Arcisstr 21, D-80333 Munich, Germany
基金
欧盟地平线“2020”; 加拿大自然科学与工程研究理事会;
关键词
Active visual search; Visual attention; Probabilistic lost target search; Top-down modulation; Search and rescue; TOP-DOWN; OBJECT; COMPLEXITY; MODEL; TARGET; ENVIRONMENTS; FRAMEWORK; VISION; PATH;
D O I
10.1007/s10514-019-09882-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an active visual search model for finding objects in unknown environments. The proposed algorithm guides the robot towards the sought object using the relevant stimuli provided by the visual sensors. Existing search strategies are either purely reactive or use simplified sensor models that do not exploit all the visual information available. In this paper, we propose a new model that actively extracts visual information via visual attention techniques and, in conjunction with a non-myopic decision-making algorithm, leads the robot to search more relevant areas of the environment. The attention module couples both top-down and bottom-up attention models enabling the robot to search regions with higher importance first. The proposed algorithm is evaluated on a mobile robot platform in a 3D simulated environment. The results indicate that the use of visual attention significantly improves search, but the degree of improvement depends on the nature of the task and the complexity of the environment. In our experiments, we found that performance enhancements of up to 42% in structured and 38% in highly unstructured cluttered environments can be achieved using visual attention mechanisms.
引用
收藏
页码:131 / 146
页数:16
相关论文
共 50 条
  • [41] Attention-Based Audio-Visual Fusion for Video Summarization
    Fang, Yinghong
    Zhang, Junpeng
    Lu, Cewu
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2019), PT II, 2019, 11954 : 328 - 340
  • [42] Leveraging attention-based visual clue extraction for image classification
    Cui, Yunbo
    Du, Youtian
    Wang, Xue
    Wang, Hang
    Su, Chang
    [J]. IET IMAGE PROCESSING, 2021, 15 (12) : 2937 - 2947
  • [43] Relational attention-based Markov logic network for visual navigation
    Kang Zhou
    Chi Guo
    Huyin Zhang
    [J]. The Journal of Supercomputing, 2022, 78 : 9907 - 9933
  • [44] Visual attention-based robot navigation using information sampling
    Winters, N
    Santos-Victor, J
    [J]. IROS 2001: PROCEEDINGS OF THE 2001 IEEE/RJS INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4: EXPANDING THE SOCIETAL ROLE OF ROBOTICS IN THE NEXT MILLENNIUM, 2001, : 1670 - 1675
  • [45] Attention-based Pyramid Aggregation Network for Visual Place Recognition
    Zhu, Yingying
    Wang, Jiong
    Xie, Lingxi
    Zheng, Liang
    [J]. PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 99 - 107
  • [46] Multiple mobile robots real-time visual search algorithm
    Yan, Caixia
    Zhan, Qiang
    [J]. INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND PATTERN RECOGNITION IN INDUSTRIAL ENGINEERING, 2010, 7820
  • [47] Visual Coverage Using Autonomous Mobile Robots for Search and Rescue Applications
    Del Bue, A.
    Tamassia, M.
    Signorini, F.
    Murino, V.
    Farinelli, A.
    [J]. 2013 IEEE INTERNATIONAL SYMPOSIUM ON SAFETY, SECURITY, AND RESCUE ROBOTICS (SSRR), 2013,
  • [48] Relational attention-based Markov logic network for visual navigation
    Zhou, Kang
    Guo, Chi
    Zhang, Huyin
    [J]. JOURNAL OF SUPERCOMPUTING, 2022, 78 (07): : 9907 - 9933
  • [49] Attention-Based Keyword Localisation in Speech using Visual Grounding
    Olaleye, Kayode
    Kamper, Herman
    [J]. INTERSPEECH 2021, 2021, : 2991 - 2995
  • [50] Global motion compensated visual attention-based video watermarking
    Oakes, Matthew
    Bhowmik, Deepayan
    Abhayaratne, Charith
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2016, 25 (06)