Multimodal Perception and Decision-Making Systems for Complex Roads Based on Foundation Models

被引:0
|
作者
Fan, Lili [1 ]
Wang, Yutong [2 ]
Zhang, Hui [3 ]
Zeng, Changxian [4 ]
Li, Yunjie [5 ]
Gou, Chao [6 ]
Yu, Hui [7 ]
机构
[1] Beijing Inst Technol, Sch Automat, Beijing 100081, Peoples R China
[2] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
[3] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing 100044, Peoples R China
[4] Chongqing Univ, Coll Mech & Vehicle Engn, Chongqing 400044, Peoples R China
[5] Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China
[6] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Shenzhen 518107, Peoples R China
[7] Univ Portsmouth, Sch Creat Technol, Portsmouth PO1 2DJ, Hants, England
基金
中国国家自然科学基金;
关键词
Autonomous driving; camera and four-dimensional (4-D) millimeter wave radar; ChatGPT; Industry; 5.0; multimodal; perception and decision making; LOCALIZATION; FUSION; LIDAR;
D O I
10.1109/TSMC.2024.3444277
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Since the inception of Industry 5.0 in 2021, a growing number of researchers have begun to pay their attention to the revolutionary shift it brings. The principles of Industry 5.0, including human-centric, sustainability, and emphasis on ecological and social values, will become the new paradigm for future industrial development. In this transformative landscape, artificial intelligence (AI) plays a pivotal role, and foundation models based on ChatGPT are set to reshape the organizational structure of industries. In this article, we introduce a multimodal perception and decision-making system built upon a foundational model. This system integrates image and point cloud data to enhance perception accuracy and provide ample information for decision making. It is designed to achieve a deep integration of AI and human-centric autonomous driving within the context of Industry 5.0. We introduce a cross-domain learning approach in the system architecture, along with a model training method from foundation models to handle complex road conditions. The proposed method enables road drivable area segmentation on complex unstructured roads. To address the issue of increased variance caused by the residual structure employed in previous works, this article introduces a distribution correction module, which effectively mitigates this problem. Furthermore, to achieve high-performance perception systems in intricate road scenarios, we put forth a multimodal perception fusion method in this study. The experiments demonstrate the superiority of this approach over single-sensor perception. This work contributes to the ongoing discourse on the convergence of AI, human-centric values, and advanced driving systems within the framework of Industry 5.0.
引用
收藏
页码:6561 / 6569
页数:9
相关论文
共 50 条
  • [41] DYNAMIC DECISION-MAKING - HUMAN CONTROL OF COMPLEX-SYSTEMS
    BREHMER, B
    ACTA PSYCHOLOGICA, 1992, 81 (03) : 211 - 241
  • [42] Estimating Local Decision-Making Behavior in Complex Evolutionary Systems
    Sha, Zhenghui
    Panchal, Jitesh H.
    JOURNAL OF MECHANICAL DESIGN, 2014, 136 (06)
  • [43] Emergy evaluation for decision-making in complex multifunctional farming systems
    Fonseca, Ana Margarida P.
    Marques, Carlos A. F.
    Pinto-Correia, Teresa
    Guiomar, Nuno
    Campbell, Daniel E.
    AGRICULTURAL SYSTEMS, 2019, 171 : 1 - 12
  • [44] Model-Based Support for Decision-Making in Architecture Evolution of Complex Software Systems
    Plakidas, Konstantinos
    Schall, Daniel
    Zdun, Uwe
    ECSA 2018: PROCEEDINGS OF THE 12TH EUROPEAN CONFERENCE ON SOFTWARE ARCHITECTURE: COMPANION PROCEEDINGS, 2018,
  • [45] Multiagent trajectory decision-making for complex systems based on transformer large model structure
    Ma, Ziyuan
    Gong, Huajun
    Wang, Xinhua
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024,
  • [46] An applied complex systems framework for risk-based decision-making in infrastructure engineering
    Gomez, C.
    Sanchez-Silva, M.
    Duenas-Osorio, L.
    STRUCTURAL SAFETY, 2014, 50 : 66 - 77
  • [47] Intelligent Assistance of Decision-Making in the Management of Multifactor Systems Based on Fuzzy Cognitive Models
    Ilyasov, Baryi
    Makarova, Elena
    Zakieva, Elena
    Gabdullina, Elvira
    PROCEEDINGS OF THE 7TH SCIENTIFIC CONFERENCE ON INFORMATION TECHNOLOGIES FOR INTELLIGENT DECISION MAKING SUPPORT (ITIDS 2019), 2019, 166 : 1 - 7
  • [48] Cellular perception and misperception: Internal models for decision-making shaped by evolutionary experience
    Mitchell, Amir
    Lim, Wendell
    BIOESSAYS, 2016, 38 (09) : 845 - 849
  • [49] Decision making in systems engineering: The foundation
    Roth, Gregory L.
    CTS 2007: PROCEEDINGS OF THE 2007 INTERNATIONAL SYMPOSIUM ON COLLABORATIVE TECHNOLOGIES AND SYSTEMS, 2007, : 236 - 246
  • [50] DECISION-MAKING - YES OR NO - PROBLEM OF DECISION-MAKING IN LIGHT OF CATHOLIC SOCIAL MODELS
    STEGMANN, FJ
    POLITISCHE STUDIEN, 1970, 21 (192) : 385 - 405