Multi-Modal Data-Based Semi-Supervised Learning for Vehicle Positioning

被引:0
|
作者
Huan, Ouwen [1 ]
Yang, Yang [2 ]
Luo, Tao [1 ]
Chen, Mingzhe [3 ,4 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing Lab Adv Informat Network, Beijing 100876, Peoples R China
[2] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing Key Lab Network Syst Architecture & Conver, Beijing 100876, Peoples R China
[3] Univ Miami, Dept Elect & Comp Engn, Coral Gables, FL 33146 USA
[4] Univ Miami, Frost Inst Data Sci & Comp, Coral Gables, FL 33146 USA
基金
中国国家自然科学基金;
关键词
Cameras; Radio frequency; Data models; Azimuth; Vectors; Fingerprint recognition; Training; Semi-supervised learning; vehicle positioning; multi-modal data; LOCALIZATION; CAMERA;
D O I
10.1109/TCOMM.2024.3459848
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, a multi-modal data based semi-supervised learning (SSL) framework that jointly use channel state information (CSI) data and RGB images for vehicle positioning is designed. In particular, an outdoor positioning system where the vehicle locations are determined by a base station (BS) is considered. The BS equipped with several cameras can collect a large amount of unlabeled CSI data and a small number of labeled CSI data of vehicles, and the images taken by cameras. Although the collected images contain partial information of vehicles (i.e. azimuth angles of vehicles), the relationship between the unlabeled CSI data and its azimuth angle, and the distances between the BS and the vehicles captured by images are both unknown. Therefore, the images cannot be directly used as the labels of unlabeled CSI data to train a positioning model. To exploit unlabeled CSI data and images, a SSL framework that consists of a pretraining stage and a downstream training stage is proposed. In the pretraining stage, the azimuth angles obtained from the images are considered as the labels of unlabeled CSI data to pretrain the positioning model. In the downstream training stage, a small sized labeled dataset in which the accurate vehicle positions are considered as labels is used to retrain the model. Simulation results show that the proposed method can reduce the positioning error by up to 30% compared to a baseline where the model is not pretrained.
引用
收藏
页码:1663 / 1676
页数:14
相关论文
共 50 条
  • [21] Multi-modal Recognition of Mental Workload Using Empirical Mode Decomposition and Semi-Supervised Learning
    Zhang, Jianhua
    Li, Jianrong
    2019 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW), 2019, : 215 - 218
  • [22] Semi-supervised Multi-modal Emotion Recognition with Cross-Modal Distribution Matching
    Liang, Jingjun
    Li, Ruichen
    Jin, Qin
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2852 - 2861
  • [23] Semi-Supervised Unpaired Multi-Modal Learning for Label-Efficient Medical Image Segmentation
    Zhu, Lei
    Yang, Kaiyuan
    Zhang, Meihui
    Chan, Ling Ling
    Ng, Teck Khim
    Ooi, Beng Chin
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT II, 2021, 12902 : 394 - 404
  • [24] Cancer immunotherapy response prediction from multi-modal clinical and image data using semi-supervised deep learning
    Wang, Xi
    Jiang, Yuming
    Chen, Hao
    Zhang, Taojun
    Han, Zhen
    Chen, Chuanli
    Yuan, Qingyu
    Xiong, Wenjun
    Wang, Wei
    Li, Guoxin
    Heng, Pheng-Ann
    Li, Ruijiang
    RADIOTHERAPY AND ONCOLOGY, 2023, 186
  • [25] Semi-supervised Learning for WLAN Positioning
    Pulkkinen, Teemu
    Roos, Teemu
    Myllymaki, Petri
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2011, PT I, 2011, 6791 : 355 - 362
  • [26] VSM: A Versatile Semi-supervised Model for Multi-modal Cell Instance Segmentation
    Cai, Xiaochen
    Cai, Hengxing
    Tu, Weiwei
    Xu, Kele
    Li, Wu-Jun
    COMPETITIONS IN NEURAL INFORMATION PROCESSING SYSTEMS, VOL 212, 2022, 212
  • [27] Pseudo-Label Calibration Semi-supervised Multi-Modal Entity Alignment
    Wang, Luyao
    Qi, Pengnian
    Bao, Xigang
    Zhou, Chunlai
    Qin, Biao
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 8, 2024, : 9116 - 9124
  • [28] Heterogeneous Features Integration via Semi-supervised Multi-modal Deep Networks
    Zhao, Lei
    Hu, Qinghua
    Zhou, Yucan
    NEURAL INFORMATION PROCESSING, ICONIP 2015, PT IV, 2015, 9492 : 11 - 19
  • [29] SMIN: Semi-Supervised Multi-Modal Interaction Network for Conversational Emotion Recognition
    Lian, Zheng
    Liu, Bin
    Tao, Jianhua
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (03) : 2415 - 2429
  • [30] Semi-supervised Convolutional Neural Networks for Flood Mapping using Multi-modal Remote Sensing Data
    Viet-Hung Luu
    Minh-Son Dao
    Thi Nhat-Thanh Nguyen
    Perry, Stuart
    Zettsu, Koji
    PROCEEDINGS OF 2019 6TH NATIONAL FOUNDATION FOR SCIENCE AND TECHNOLOGY DEVELOPMENT (NAFOSTED) CONFERENCE ON INFORMATION AND COMPUTER SCIENCE (NICS), 2019, : 342 - 347