Part-based multi-task deep network for autonomous indoor drone navigation

被引:7
|
作者
Zhang, Xiangzhu [1 ]
Zhang, Lijia [2 ]
Pei, Hailong [1 ]
Lewis, Frank L. [3 ]
机构
[1] South China Univ Technol, Unmanned Aerial Vehicle Syst Engn Technol Res Ctr, Minist Educ, Key Lab Autonomous Syst & Networked Control, Wushan Rd, Guangzhou 510640, Peoples R China
[2] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou, Guangdong, Peoples R China
[3] Univ Texas Arlington, UTA Res Inst, Arlington, TX 76019 USA
关键词
Aerial robotics; monocular vision; indoor navigation; obstacle avoidance; multi-task deep network;
D O I
10.1177/0142331220947507
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Two common methods exist for solving indoor autonomous navigation and obstacle-avoidance problems using monocular vision: the traditional simultaneous localization and mapping (SLAM) method, which requires complex hardware, heavy calculations, and is prone to errors in low texture or dynamic environments; and deep-learning algorithms, which use the fully connected layer for classification or regression, resulting in more model parameters and easy over-fitting. Among the latter ones, the most advanced indoor navigation algorithm divides a single image frame into multiple parts for prediction, resulting in doubled reasoning time. To solve these problems, we propose a multi-task deep network based on feature map region division for monocular indoor autonomous navigation. We divide the feature map instead of the original image to avoid repeated information processing. To reduce model parameters, we use convolution instead of the fully connected layer to predict the navigable probability of the left, middle, and right parts. We propose that the linear velocity is determined by combining three prediction probabilities to reduce collision risk. Experimental evaluation shows that the proposed method is nine times smaller than the previous state-of-the-art methods; further, its processing speed and navigation capability increase more than five and 1.6 times, respectively.
引用
收藏
页码:3243 / 3253
页数:11
相关论文
共 50 条
  • [1] Deep Convolutional Neural Network Based Autonomous Drone Navigation
    Amer, Karim
    Samy, Mohamed
    Shaker, Mahmoud
    Elhelw, Mohamed
    [J]. THIRTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2020), 2021, 11605
  • [2] Multi-task person re-identification via attribute and part-based learning
    Yuqing Peng
    Wei Li
    Yingjun Li
    Yixin Pei
    Yongfang Guo
    [J]. Multimedia Tools and Applications, 2022, 81 : 11221 - 11237
  • [3] Multi-task person re-identification via attribute and part-based learning
    Peng, Yuqing
    Li, Wei
    Li, Yingjun
    Pei, Yixin
    Guo, Yongfang
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (08) : 11221 - 11237
  • [4] Autonomous unmanned aerial vehicle flight control using multi-task deep neural network for exploring indoor environments
    Duc Bui, Viet
    Shirakawa, Tomohiro
    Sato, Hiroshi
    [J]. SICE Journal of Control, Measurement, and System Integration, 2022, 15 (02) : 130 - 144
  • [5] Perception, Guidance, and Navigation for Indoor Autonomous Drone Racing Using Deep Learning
    Jung, Sunggoo
    Hwang, Sunyou
    Shin, Heemin
    Shim, David Hyunchul
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (03): : 2539 - 2544
  • [6] Wi-Fi Indoor Localization based on Multi-Task Deep Learning
    Lin, Wei-Yuan
    Huang, Ching-Chun
    Nguyen-Tran Duc
    Hung-Nguyen Manh
    [J]. 2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,
  • [7] Shared Multi-Task Imitation Learning for Indoor Self-Navigation
    Xu, Junhong
    Liu, Qiwei
    Guo, Hanqing
    Kageza, Aaron
    AlQarni, Saeed
    Wu, Shaoen
    [J]. 2018 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2018,
  • [8] Consistent Online Multi-object Tracking with Part-Based Deep Network
    Xu, Chuanzhi
    Zhou, Yue
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PT II, 2018, 11257 : 180 - 192
  • [9] LiDAR-Based Multi-Task Road Perception Network for Autonomous Vehicles
    Yan, Fuwu
    Wang, Kewei
    Zou, Bin
    Tang, Luqi
    Li, Wenbo
    Lv, Chen
    [J]. IEEE ACCESS, 2020, 8 : 86753 - 86764
  • [10] Drone-Based Tower Survey by Multi-Task Learning
    Sami, Mirza Tanzim
    Yan, Da
    Huang, Huang
    Liang, Xinyu
    Guo, Guimu
    Jiang, Zhe
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 6011 - 6013