Multi-Modal Data-Based Semi-Supervised Learning for Vehicle Positioning

被引:0
|
作者
Huan, Ouwen [1 ]
Yang, Yang [2 ]
Luo, Tao [1 ]
Chen, Mingzhe [3 ,4 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing Lab Adv Informat Network, Beijing 100876, Peoples R China
[2] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing Key Lab Network Syst Architecture & Conver, Beijing 100876, Peoples R China
[3] Univ Miami, Dept Elect & Comp Engn, Coral Gables, FL 33146 USA
[4] Univ Miami, Frost Inst Data Sci & Comp, Coral Gables, FL 33146 USA
基金
中国国家自然科学基金;
关键词
Cameras; Radio frequency; Data models; Azimuth; Vectors; Fingerprint recognition; Training; Semi-supervised learning; vehicle positioning; multi-modal data; LOCALIZATION; CAMERA;
D O I
10.1109/TCOMM.2024.3459848
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, a multi-modal data based semi-supervised learning (SSL) framework that jointly use channel state information (CSI) data and RGB images for vehicle positioning is designed. In particular, an outdoor positioning system where the vehicle locations are determined by a base station (BS) is considered. The BS equipped with several cameras can collect a large amount of unlabeled CSI data and a small number of labeled CSI data of vehicles, and the images taken by cameras. Although the collected images contain partial information of vehicles (i.e. azimuth angles of vehicles), the relationship between the unlabeled CSI data and its azimuth angle, and the distances between the BS and the vehicles captured by images are both unknown. Therefore, the images cannot be directly used as the labels of unlabeled CSI data to train a positioning model. To exploit unlabeled CSI data and images, a SSL framework that consists of a pretraining stage and a downstream training stage is proposed. In the pretraining stage, the azimuth angles obtained from the images are considered as the labels of unlabeled CSI data to pretrain the positioning model. In the downstream training stage, a small sized labeled dataset in which the accurate vehicle positions are considered as labels is used to retrain the model. Simulation results show that the proposed method can reduce the positioning error by up to 30% compared to a baseline where the model is not pretrained.
引用
收藏
页码:1663 / 1676
页数:14
相关论文
共 50 条
  • [1] Comprehensive Semi-Supervised Multi-Modal Learning
    Yang, Yang
    Wang, Ke-Tao
    Zhan, De-Chuan
    Xiong, Hui
    Jiang, Yuan
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 4092 - 4098
  • [2] Semi-Supervised Multi-Modal Learning with Incomplete Modalities
    Yang, Yang
    Zhan, De-Chuan
    Sheng, Xiang-Rong
    Jiang, Yuan
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2998 - 3004
  • [3] Semi-Supervised Learning of Geospatial Objects Through Multi-Modal Data Integration
    Yang, Yi
    Newsam, Shawn
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 4062 - 4067
  • [4] Multi-Modal Curriculum Learning for Semi-Supervised Image Classification
    Gong, Chen
    Tao, Dacheng
    Maybank, Stephen J.
    Liu, Wei
    Kang, Guoliang
    Yang, Jie
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (07) : 3249 - 3260
  • [5] Semi-Supervised Multi-Modal Learning with Balanced Spectral Decomposition
    Hu, Peng
    Zhu, Hongyuan
    Peng, Xi
    Lin, Jie
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 99 - 106
  • [6] Semi-supervised Grounding Alignment for Multi-modal Feature Learning
    Chou, Shih-Han
    Fan, Zicong
    Little, James J.
    Sigal, Leonid
    2022 19TH CONFERENCE ON ROBOTS AND VISION (CRV 2022), 2022, : 48 - 57
  • [7] Semi-supervised image clustering with multi-modal information
    Jianqing Liang
    Yahong Han
    Qinghua Hu
    Multimedia Systems, 2016, 22 : 149 - 160
  • [8] Semi-supervised image clustering with multi-modal information
    Liang, Jianqing
    Han, Yahong
    Hu, Qinghua
    MULTIMEDIA SYSTEMS, 2016, 22 (02) : 149 - 160
  • [9] Failure Analysis of a Complex Learning Framework Incorporating Multi-Modal and Semi-Supervised Learning
    Pullum, Laura L.
    Symons, Christopher T.
    2011 IEEE 17TH PACIFIC RIM INTERNATIONAL SYMPOSIUM ON DEPENDABLE COMPUTING (PRDC), 2011, : 308 - 313
  • [10] A multi-modal dental dataset for semi-supervised deep learning image segmentation
    Wang, Yaqi
    Ye, Fan
    Chen, Yifei
    Wang, Chengkai
    Wu, Chengyu
    Xu, Feng
    Ma, Zhean
    Liu, Yi
    Zhang, Yifan
    Cao, Mingguo
    Chen, Xiaodiao
    SCIENTIFIC DATA, 2025, 12 (01)