Highly Interactive Self-Supervised Learning for Multi-Modal Trajectory Prediction

被引:0
|
作者
Xie, Wenda [1 ]
Liu, Yahui [1 ]
Zhao, Hongxia [1 ]
Guo, Chao [1 ]
Dai, Xingyuan [1 ,2 ]
Lv, Yisheng [1 ,3 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence, Beijing 100190, Peoples R China
[2] Shanghai Artificial Intelligence Lab, Shanghai 200232, Peoples R China
[3] Changan Univ, Minist Educ, Engn Res Ctr Highway Infrastruct Digitalizat, Xian 710064, Peoples R China
来源
IFAC PAPERSONLINE | 2024年 / 58卷 / 10期
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Automatic driving; Self-supervised learning; Trajectory prediction; Deep learning; Intelligent Transportation;
D O I
10.1016/j.ifacol.2024.07.345
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To ensure the safety of autonomous vehicles, trajectory prediction is critical as it enables vehicles to anticipate the movements of surrounding agents, thereby facilitating the planning of secure and strategic driving routes. However, striking a trade-off between predictive accuracy and training costs has always been an intricate challenge. This paper introduces a groundbreaking framework for trajectory prediction known as Highly Interactive Self-Supervised Learning (HI-SSL), a methodology based on self-supervised learning (SSL) that has yet to be thoroughly investigated in the realm of trajectory prediction. The cornerstone of the aforementioned framework is Interactive Masking, which leverages a novel trajectory masking strategy facilitating self-supervised learning tasks that not only enhance prediction accuracy but also eliminate the need for manual annotations. Experiments conducted on the Argoverse motion forecasting dataset demonstrate that our approach achieves competitive performance to prior methods that depend on supervised learning without additional annotation costs. Copyright (C) 2024 The Authors. This is an open access article under the CC BY-NC-ND license (https://creativecommons.org/licenses/by-nc-nd/4.0/)
引用
收藏
页码:231 / 236
页数:6
相关论文
共 50 条
  • [1] Self-Supervised Distilled Learning for Multi-modal Misinformation Identification
    Mu, Michael
    Das Bhattacharjee, Sreyasee
    Yuan, Junsong
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2818 - 2827
  • [2] Self-Supervised Entity Alignment Based on Multi-Modal Contrastive Learning
    Liu, Bo
    Song, Ruoyi
    Xiang, Yuejia
    Du, Junbo
    Ruan, Weijian
    Hu, Jinhui
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2022, 9 (11) : 2031 - 2033
  • [3] Self-Supervised Entity Alignment Based on Multi-Modal Contrastive Learning
    Bo Liu
    Ruoyi Song
    Yuejia Xiang
    Junbo Du
    Weijian Ruan
    Jinhui Hu
    IEEE/CAA Journal of Automatica Sinica, 2022, 9 (11) : 2031 - 2033
  • [4] Self-Supervised Multi-Modal Learning for Collaborative Robotic Grasp-Throw
    Hou, Yanxu
    Fang, Zihan
    Li, Jun
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (05) : 4250 - 4256
  • [5] Self-supervised multi-modal fusion network for multi-modal thyroid ultrasound image diagnosis
    Xiang, Zhuo
    Zhuo, Qiuluan
    Zhao, Cheng
    Deng, Xiaofei
    Zhu, Ting
    Wang, Tianfu
    Jiang, Wei
    Lei, Baiying
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 150
  • [6] Self-supervised Multi-Modal Video Forgery Attack Detection
    Zhao, Chenhui
    Li, Xiang
    Younes, Rabih
    2023 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC, 2023,
  • [7] Self-supervised opinion summarization with multi-modal knowledge graph
    Lingyun Jin
    Jingqiang Chen
    Journal of Intelligent Information Systems, 2024, 62 : 191 - 208
  • [8] Self-supervised opinion summarization with multi-modal knowledge graph
    Jin, Lingyun
    Chen, Jingqiang
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2024, 62 (01) : 191 - 208
  • [9] Self-supervised learning based multi-modal intra-hour irradiance forecast
    Shan, Shuo
    Dou, Weijin
    Zhang, Kanjian
    Wei, Haikun
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 2549 - 2553
  • [10] Multi-modal representation learning in retinal imaging using self-supervised learning for enhanced clinical predictions
    Emese Sükei
    Elisabeth Rumetshofer
    Niklas Schmidinger
    Andreas Mayr
    Ursula Schmidt-Erfurth
    Günter Klambauer
    Hrvoje Bogunović
    Scientific Reports, 14 (1)