Attention-based active visual search for mobile robots

被引：14

作者：

Rasouli, Amir ^{[1
,2
]}

Lanillos, Pablo ^{[3
]}

Cheng, Gordon ^{[3
]}

Tsotsos, John K. ^{[1
,2
]}

机构：

[1] York Univ, Dept Elect Engn & Comp Sci, Toronto, ON, Canada

[2] York Univ, Ctr Vis Res, Toronto, ON, Canada

[3] Tech Univ Munich, ICS, Arcisstr 21, D-80333 Munich, Germany

来源：

AUTONOMOUS ROBOTS | 2020年 / 44卷 / 02期

基金：

欧盟地平线“2020”; 加拿大自然科学与工程研究理事会;

关键词：

Active visual search; Visual attention; Probabilistic lost target search; Top-down modulation; Search and rescue; TOP-DOWN; OBJECT; COMPLEXITY; MODEL; TARGET; ENVIRONMENTS; FRAMEWORK; VISION; PATH;

D O I：

10.1007/s10514-019-09882-z

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present an active visual search model for finding objects in unknown environments. The proposed algorithm guides the robot towards the sought object using the relevant stimuli provided by the visual sensors. Existing search strategies are either purely reactive or use simplified sensor models that do not exploit all the visual information available. In this paper, we propose a new model that actively extracts visual information via visual attention techniques and, in conjunction with a non-myopic decision-making algorithm, leads the robot to search more relevant areas of the environment. The attention module couples both top-down and bottom-up attention models enabling the robot to search regions with higher importance first. The proposed algorithm is evaluated on a mobile robot platform in a 3D simulated environment. The results indicate that the use of visual attention significantly improves search, but the degree of improvement depends on the nature of the task and the complexity of the environment. In our experiments, we found that performance enhancements of up to 42% in structured and 38% in highly unstructured cluttered environments can be achieved using visual attention mechanisms.

引用

页码：131 / 146

页数：16

共 50 条

[41] Attention-Based Audio-Visual Fusion for Video Summarization
Fang, Yinghong
Zhang, Junpeng
Lu, Cewu
[J]. NEURAL INFORMATION PROCESSING (ICONIP 2019), PT II, 2019, 11954 : 328 - 340
[42] Leveraging attention-based visual clue extraction for image classification
Cui, Yunbo
Du, Youtian
Wang, Xue
Wang, Hang
Su, Chang
[J]. IET IMAGE PROCESSING, 2021, 15 (12) : 2937 - 2947
[43] Relational attention-based Markov logic network for visual navigation
Kang Zhou
Chi Guo
Huyin Zhang
[J]. The Journal of Supercomputing, 2022, 78 : 9907 - 9933
[44] Visual attention-based robot navigation using information sampling
Winters, N
Santos-Victor, J
[J]. IROS 2001: PROCEEDINGS OF THE 2001 IEEE/RJS INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4: EXPANDING THE SOCIETAL ROLE OF ROBOTICS IN THE NEXT MILLENNIUM, 2001, : 1670 - 1675
[45] Attention-based Pyramid Aggregation Network for Visual Place Recognition
Zhu, Yingying
Wang, Jiong
Xie, Lingxi
Zheng, Liang
[J]. PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 99 - 107
[46] Multiple mobile robots real-time visual search algorithm
Yan, Caixia
Zhan, Qiang
[J]. INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND PATTERN RECOGNITION IN INDUSTRIAL ENGINEERING, 2010, 7820
[47] Visual Coverage Using Autonomous Mobile Robots for Search and Rescue Applications
Del Bue, A.
Tamassia, M.
Signorini, F.
Murino, V.
Farinelli, A.
[J]. 2013 IEEE INTERNATIONAL SYMPOSIUM ON SAFETY, SECURITY, AND RESCUE ROBOTICS (SSRR), 2013,
[48] Relational attention-based Markov logic network for visual navigation
Zhou, Kang
Guo, Chi
Zhang, Huyin
[J]. JOURNAL OF SUPERCOMPUTING, 2022, 78 (07): : 9907 - 9933
[49] Attention-Based Keyword Localisation in Speech using Visual Grounding
Olaleye, Kayode
Kamper, Herman
[J]. INTERSPEECH 2021, 2021, : 2991 - 2995
[50] Global motion compensated visual attention-based video watermarking
Oakes, Matthew
Bhowmik, Deepayan
Abhayaratne, Charith
[J]. JOURNAL OF ELECTRONIC IMAGING, 2016, 25 (06)

← 1 2 3 4 5 →