Software/Hardware Co-design for Multi-modal Multi-task Learning in Autonomous Systems

被引:13
|
作者
Hao, Cong [1 ]
Chen, Deming [2 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
[2] Univ Illinois, Urbana, IL USA
关键词
D O I
10.1109/AICAS51828.2021.9458577
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Optimizing the quality of result (QoR) and the quality of service (QoS) of AI-empowered autonomous systems simultaneously is very challenging. First, there are multiple input sources, e.g., multi-modal data from different sensors, requiring diverse data preprocessing, sensor fusion, and feature aggregation. Second, there are multiple tasks that require various AI models to run simultaneously, e.g., perception, localization, and control. Third, the computing and control system is heterogeneous, composed of hardware components with varied features, such as embedded CPUs, GPUs, FPGAs, and dedicated accelerators. Therefore, autonomous systems essentially require multi-modal multi-task (MMMT) learning which must be aware of hardware performance and implementation strategies. While MMMT learning has been attracting intensive research interests, its applications in autonomous systems are still underexplored. In this paper, we first discuss the opportunities of applying MMMT techniques in autonomous systems, and then discuss the unique challenges that must be solved. In addition, we discuss the necessity and opportunities of MMMT model and hardware co-design, which is critical for autonomous systems especially with power/resource-limited or heterogeneous platforms. We formulate the MMMT model and heterogeneous hardware implementation co-design as a differentiable optimization problem, with the objective of improving the solution quality and reducing the overall power consumption and critical path latency. We advocate for further explorations of MMMT in autonomous systems and software/hardware co-design solutions.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Multi-Modal Multi-Task (3MT) Road Segmentation
    Milli, Erkan
    Erkent, Ozgur
    Ylmaz, Asm Egemen
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (09) : 5408 - 5415
  • [42] Multi-Modal Fusion for Multi-Task Fuzzy Detection of Rail Anomalies
    Liyuan, Yang
    Osman, Ghazali
    Abdul Rahman, Safawi
    Mustapha, Muhammad Firdaus
    IEEE ACCESS, 2024, 12 : 73925 - 73935
  • [43] YuYin: a multi-task learning model of multi-modal e-commerce background music recommendation
    Ma, Le
    Wu, Xinda
    Tang, Ruiyuan
    Zhong, Chongjun
    Zhang, Kejun
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2023, 2023 (01)
  • [44] Traffic Sign Recognition via Multi-Modal Tree-Structure Embedded Multi-Task Learning
    Lu, Xiao
    Wang, Yaonan
    Zhou, Xuanyu
    Zhang, Zhenjun
    Ling, Zhigang
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2017, 18 (04) : 960 - 972
  • [45] MULTI-MODAL MULTI-TASK DEEP LEARNING FOR SPEAKER AND EMOTION RECOGNITION OF TV-SERIES DATA
    Novitasari, Sashi
    Quoc Truong Do
    Sakti, Sakriani
    Lestari, Dessi
    Nakamura, Satoshi
    2018 ORIENTAL COCOSDA - INTERNATIONAL CONFERENCE ON SPEECH DATABASE AND ASSESSMENTS, 2018, : 37 - 42
  • [46] Multi-task Learning using Multi-modal Encoder-Decoder Networks with Shared Skip Connections
    Kuga, Ryohei
    Kanezaki, Asako
    Samejima, Masaki
    Sugano, Yusuke
    Matsushita, Yasuyuki
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 403 - 411
  • [47] YuYin: a multi-task learning model of multi-modal e-commerce background music recommendation
    Le Ma
    Xinda Wu
    Ruiyuan Tang
    Chongjun Zhong
    Kejun Zhang
    EURASIP Journal on Audio, Speech, and Music Processing, 2023
  • [48] Multi-Task Collaboration for Cross-Modal Generation and Multi-Modal Ophthalmic Diseases Diagnosis
    Yu, Yang
    Zhu, Hongqing
    Qian, Tianwei
    Hou, Tong
    Huang, Bingcang
    IET IMAGE PROCESSING, 2025, 19 (01)
  • [49] Large Margin Multi-Modal Multi-Task Feature Extraction for Image Classification
    Luo, Yong
    Wen, Yonggang
    Tao, Dacheng
    Gui, Jie
    Xu, Chao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (01) : 414 - 427
  • [50] Multi-task & Multi-modal Sentiment Analysis Model Based on Aware Fusion
    Wu S.
    Ma J.
    Data Analysis and Knowledge Discovery, 2023, 7 (10) : 74 - 84