Research on Road Scene Understanding of Autonomous Vehicles Based on Multi-Task Learning

被引:10
|
作者
Guo, Jinghua [1 ]
Wang, Jingyao [2 ]
Wang, Huinian [1 ]
Xiao, Baoping [1 ]
He, Zhifei [1 ]
Li, Lubin [1 ]
机构
[1] Xiamen Univ, Dept Mech & Elect Engn, Xiamen 361005, Peoples R China
[2] Xiamen Univ, Dept Automat, Xiamen 361005, Peoples R China
关键词
autonomous vehicles; visual perception; multi-task learning; traffic object detection; drivable area detection; lane line detection;
D O I
10.3390/s23136238
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Road scene understanding is crucial to the safe driving of autonomous vehicles. Comprehensive road scene understanding requires a visual perception system to deal with a large number of tasks at the same time, which needs a perception model with a small size, fast speed, and high accuracy. As multi-task learning has evident advantages in performance and computational resources, in this paper, a multi-task model YOLO-Object, Drivable Area, and Lane Line Detection (YOLO-ODL) based on hard parameter sharing is proposed to realize joint and efficient detection of traffic objects, drivable areas, and lane lines. In order to balance tasks of YOLO-ODL, a weight balancing strategy is introduced so that the weight parameters of the model can be automatically adjusted during training, and a Mosaic migration optimization scheme is adopted to improve the evaluation indicators of the model. Our YOLO-ODL model performs well on the challenging BDD100K dataset, achieving the state of the art in terms of accuracy and computational efficiency.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Multi-Task Deep Learning Design and Training Tool for Unified Visual Driving Scene Understanding
    Won, Woong-Jae
    Kim, Tae Hun
    Kwon, Soon
    2019 19TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2019), 2019, : 356 - 360
  • [22] Statistically correlated multi-task learning for autonomous driving
    Abbas, Waseem
    Khan, Muhammad Fakhir
    Taj, Murtaza
    Mahmood, Arif
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (19): : 12921 - 12938
  • [23] MTMamba: Enhancing Multi-task Dense Scene Understanding by Mamba-Based Decoders
    Lin, Baijiong
    Jiang, Weisen
    Chen, Pengguang
    Zhang, Yu
    Liu, Shu
    Chen, Ying-Cong
    COMPUTER VISION - ECCV 2024, PT LXX, 2025, 15128 : 314 - 330
  • [24] Scalable Multi-Task Imitation Learning with Autonomous Improvement
    Singh, Avi
    Jang, Eric
    Irpan, Alexander
    Kappler, Daniel
    Dalal, Murtaza
    Levinev, Sergey
    Khansari, Mohi
    Finn, Chelsea
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 2167 - 2173
  • [25] Optimal Configuration of Multi-Task Learning for Autonomous Driving
    Jun, Woomin
    Son, Minjun
    Yoo, Jisang
    Lee, Sungjin
    SENSORS, 2023, 23 (24)
  • [26] Modeling On-road Trajectories with Multi-task Learning
    Liu, Kaijun
    Ruan, Sijie
    Long, Cheng
    Yu, Liang
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2025, 19 (01)
  • [27] Multi-Task Learning and Multimodal Fusion for Road Segmentation
    Cheng, Bowen
    Tian, Miaomiao
    Jiang, Shuai
    Liu, Weiwei
    Pang, Yalong
    IEEE ACCESS, 2023, 11 : 18947 - 18959
  • [28] Research on NER Based on Register Migration and Multi-task Learning
    Ma, Haoran
    Ding, Zhaoyun
    Zhou, Dongsheng
    Wang, Jinhua
    Niu, ShuoShuo
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, PT III, 2022, 13473 : 657 - 666
  • [29] Bidirectional Transformer Based Multi-Task Learning for Natural Language Understanding
    Tripathi, Suraj
    Singh, Chirag
    Kumar, Abhay
    Pandey, Chandan
    Jain, Nishant
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2019), 2019, 11608 : 54 - 65
  • [30] Event Detection via Context Understanding Based on Multi-task Learning
    Xia, Jing
    Li, Xiaolong
    Tan, Yongbin
    Zhang, Wu
    Li, Dajun
    Xiong, Zhengkun
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (01)