Part-based multi-task deep network for autonomous indoor drone navigation

被引：7

作者：

Zhang, Xiangzhu ^{[1
]}

Zhang, Lijia ^{[2
]}

Pei, Hailong ^{[1
]}

Lewis, Frank L. ^{[3
]}

机构：

[1] South China Univ Technol, Unmanned Aerial Vehicle Syst Engn Technol Res Ctr, Minist Educ, Key Lab Autonomous Syst & Networked Control, Wushan Rd, Guangzhou 510640, Peoples R China

[2] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou, Guangdong, Peoples R China

[3] Univ Texas Arlington, UTA Res Inst, Arlington, TX 76019 USA

来源：

TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL | 2020年 / 42卷 / 16期

关键词：

Aerial robotics; monocular vision; indoor navigation; obstacle avoidance; multi-task deep network;

D O I：

10.1177/0142331220947507

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Two common methods exist for solving indoor autonomous navigation and obstacle-avoidance problems using monocular vision: the traditional simultaneous localization and mapping (SLAM) method, which requires complex hardware, heavy calculations, and is prone to errors in low texture or dynamic environments; and deep-learning algorithms, which use the fully connected layer for classification or regression, resulting in more model parameters and easy over-fitting. Among the latter ones, the most advanced indoor navigation algorithm divides a single image frame into multiple parts for prediction, resulting in doubled reasoning time. To solve these problems, we propose a multi-task deep network based on feature map region division for monocular indoor autonomous navigation. We divide the feature map instead of the original image to avoid repeated information processing. To reduce model parameters, we use convolution instead of the fully connected layer to predict the navigable probability of the left, middle, and right parts. We propose that the linear velocity is determined by combining three prediction probabilities to reduce collision risk. Experimental evaluation shows that the proposed method is nine times smaller than the previous state-of-the-art methods; further, its processing speed and navigation capability increase more than five and 1.6 times, respectively.

引用

页码：3243 / 3253

页数：11

共 50 条

[1] Deep Convolutional Neural Network Based Autonomous Drone Navigation
Amer, Karim
Samy, Mohamed
Shaker, Mahmoud
Elhelw, Mohamed
[J]. THIRTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2020), 2021, 11605
[2] Multi-task person re-identification via attribute and part-based learning
Yuqing Peng
Wei Li
Yingjun Li
Yixin Pei
Yongfang Guo
[J]. Multimedia Tools and Applications, 2022, 81 : 11221 - 11237
[3] Multi-task person re-identification via attribute and part-based learning
Peng, Yuqing
Li, Wei
Li, Yingjun
Pei, Yixin
Guo, Yongfang
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (08) : 11221 - 11237
[4] Autonomous unmanned aerial vehicle flight control using multi-task deep neural network for exploring indoor environments
Duc Bui, Viet
Shirakawa, Tomohiro
Sato, Hiroshi
[J]. SICE Journal of Control, Measurement, and System Integration, 2022, 15 (02) : 130 - 144
[5] Perception, Guidance, and Navigation for Indoor Autonomous Drone Racing Using Deep Learning
Jung, Sunggoo
Hwang, Sunyou
Shin, Heemin
Shim, David Hyunchul
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (03): : 2539 - 2544
[6] Wi-Fi Indoor Localization based on Multi-Task Deep Learning
Lin, Wei-Yuan
Huang, Ching-Chun
Nguyen-Tran Duc
Hung-Nguyen Manh
[J]. 2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,
[7] Shared Multi-Task Imitation Learning for Indoor Self-Navigation
Xu, Junhong
Liu, Qiwei
Guo, Hanqing
Kageza, Aaron
AlQarni, Saeed
Wu, Shaoen
[J]. 2018 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2018,
[8] Consistent Online Multi-object Tracking with Part-Based Deep Network
Xu, Chuanzhi
Zhou, Yue
[J]. PATTERN RECOGNITION AND COMPUTER VISION, PT II, 2018, 11257 : 180 - 192
[9] LiDAR-Based Multi-Task Road Perception Network for Autonomous Vehicles
Yan, Fuwu
Wang, Kewei
Zou, Bin
Tang, Luqi
Li, Wenbo
Lv, Chen
[J]. IEEE ACCESS, 2020, 8 : 86753 - 86764
[10] Drone-Based Tower Survey by Multi-Task Learning
Sami, Mirza Tanzim
Yan, Da
Huang, Huang
Liang, Xinyu
Guo, Guimu
Jiang, Zhe
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 6011 - 6013

← 1 2 3 4 5 →