Multi-Modal Data-Based Semi-Supervised Learning for Vehicle Positioning

被引：0

作者：

Huan, Ouwen ^{[1
]}

Yang, Yang ^{[2
]}

Luo, Tao ^{[1
]}

Chen, Mingzhe ^{[3
,4
]}

机构：

[1] Beijing Univ Posts & Telecommun, Beijing Lab Adv Informat Network, Beijing 100876, Peoples R China

[2] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing Key Lab Network Syst Architecture & Conver, Beijing 100876, Peoples R China

[3] Univ Miami, Dept Elect & Comp Engn, Coral Gables, FL 33146 USA

[4] Univ Miami, Frost Inst Data Sci & Comp, Coral Gables, FL 33146 USA

来源：

IEEE TRANSACTIONS ON COMMUNICATIONS | 2025年 / 73卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Cameras; Radio frequency; Data models; Azimuth; Vectors; Fingerprint recognition; Training; Semi-supervised learning; vehicle positioning; multi-modal data; LOCALIZATION; CAMERA;

D O I：

10.1109/TCOMM.2024.3459848

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, a multi-modal data based semi-supervised learning (SSL) framework that jointly use channel state information (CSI) data and RGB images for vehicle positioning is designed. In particular, an outdoor positioning system where the vehicle locations are determined by a base station (BS) is considered. The BS equipped with several cameras can collect a large amount of unlabeled CSI data and a small number of labeled CSI data of vehicles, and the images taken by cameras. Although the collected images contain partial information of vehicles (i.e. azimuth angles of vehicles), the relationship between the unlabeled CSI data and its azimuth angle, and the distances between the BS and the vehicles captured by images are both unknown. Therefore, the images cannot be directly used as the labels of unlabeled CSI data to train a positioning model. To exploit unlabeled CSI data and images, a SSL framework that consists of a pretraining stage and a downstream training stage is proposed. In the pretraining stage, the azimuth angles obtained from the images are considered as the labels of unlabeled CSI data to pretrain the positioning model. In the downstream training stage, a small sized labeled dataset in which the accurate vehicle positions are considered as labels is used to retrain the model. Simulation results show that the proposed method can reduce the positioning error by up to 30% compared to a baseline where the model is not pretrained.

引用

页码：1663 / 1676

页数：14

共 50 条

[21] Multi-modal Recognition of Mental Workload Using Empirical Mode Decomposition and Semi-Supervised Learning
Zhang, Jianhua
Li, Jianrong
2019 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW), 2019, : 215 - 218
[22] Semi-supervised Multi-modal Emotion Recognition with Cross-Modal Distribution Matching
Liang, Jingjun
Li, Ruichen
Jin, Qin
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2852 - 2861
[23] Semi-Supervised Unpaired Multi-Modal Learning for Label-Efficient Medical Image Segmentation
Zhu, Lei
Yang, Kaiyuan
Zhang, Meihui
Chan, Ling Ling
Ng, Teck Khim
Ooi, Beng Chin
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT II, 2021, 12902 : 394 - 404
[24] Cancer immunotherapy response prediction from multi-modal clinical and image data using semi-supervised deep learning
Wang, Xi
Jiang, Yuming
Chen, Hao
Zhang, Taojun
Han, Zhen
Chen, Chuanli
Yuan, Qingyu
Xiong, Wenjun
Wang, Wei
Li, Guoxin
Heng, Pheng-Ann
Li, Ruijiang
RADIOTHERAPY AND ONCOLOGY, 2023, 186
[25] Semi-supervised Learning for WLAN Positioning
Pulkkinen, Teemu
Roos, Teemu
Myllymaki, Petri
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2011, PT I, 2011, 6791 : 355 - 362
[26] VSM: A Versatile Semi-supervised Model for Multi-modal Cell Instance Segmentation
Cai, Xiaochen
Cai, Hengxing
Tu, Weiwei
Xu, Kele
Li, Wu-Jun
COMPETITIONS IN NEURAL INFORMATION PROCESSING SYSTEMS, VOL 212, 2022, 212
[27] Pseudo-Label Calibration Semi-supervised Multi-Modal Entity Alignment
Wang, Luyao
Qi, Pengnian
Bao, Xigang
Zhou, Chunlai
Qin, Biao
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 8, 2024, : 9116 - 9124
[28] Heterogeneous Features Integration via Semi-supervised Multi-modal Deep Networks
Zhao, Lei
Hu, Qinghua
Zhou, Yucan
NEURAL INFORMATION PROCESSING, ICONIP 2015, PT IV, 2015, 9492 : 11 - 19
[29] SMIN: Semi-Supervised Multi-Modal Interaction Network for Conversational Emotion Recognition
Lian, Zheng
Liu, Bin
Tao, Jianhua
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (03) : 2415 - 2429
[30] Semi-supervised Convolutional Neural Networks for Flood Mapping using Multi-modal Remote Sensing Data
Viet-Hung Luu
Minh-Son Dao
Thi Nhat-Thanh Nguyen
Perry, Stuart
Zettsu, Koji
PROCEEDINGS OF 2019 6TH NATIONAL FOUNDATION FOR SCIENCE AND TECHNOLOGY DEVELOPMENT (NAFOSTED) CONFERENCE ON INFORMATION AND COMPUTER SCIENCE (NICS), 2019, : 342 - 347

← 1 2 3 4 5 →