Wireless Deep Video Semantic Transmission

被引：70

作者：

Wang, Sixian ^{[1
]}

Dai, Jincheng ^{[1
]}

Liang, Zijian ^{[1
]}

Niu, Kai ^{[1
,2
]}

Si, Zhongwei ^{[1
]}

Dong, Chao ^{[1
]}

Qin, Xiaoqi ^{[3
]}

Zhang, Ping ^{[3
]}

机构：

[1] Beijing Univ Posts & Telecommun, Minist Educ, Key Lab Universal Wireless Commun, Beijing 100876, Peoples R China

[2] Peng Cheng Lab, Shenzhen 518066, Peoples R China

[3] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China

来源：

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS | 2023年 / 41卷 / 01期

基金：

北京市自然科学基金; 中国国家自然科学基金;

关键词：

Semantic communications; video transmission; nonlinear transform; joint source-channel coding; rate-distortion; JOINT SOURCE;

D O I：

10.1109/JSAC.2022.3221977

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, we design a new class of high-efficiency deep joint source-channel coding methods to achieve end-to-end video transmission over wireless channels. The proposed methods exploit nonlinear transform and conditional coding architecture to adaptively extract semantic features across video frames, and transmit semantic feature domain representations over wireless channels via deep joint source-channel coding. Our framework is collected under the name deep video semantic transmission (DVST). In particular, benefiting from the strong temporal prior provided by the feature domain context, the learned nonlinear transform function becomes temporally adaptive, resulting in a richer and more accurate entropy model guiding the transmission of current frame. Accordingly, a novel rate adaptive transmission mechanism is developed to customize deep joint source-channel coding for video sources. It learns to allocate the limited channel bandwidth within and among video frames to maximize the overall transmission performance. The whole DVST design is formulated as an optimization problem whose goal is to minimize the end-to-end transmission rate-distortion performance under perceptual quality metrics or machine vision task performance metrics. Across standard video source test sequences and various communication scenarios, experiments show that our DVST can generally surpass traditional wireless video coded transmission schemes. The proposed DVST framework can well support future semantic communications due to its video content-aware and machine vision task integration abilities.

引用

页码：214 / 229

页数：16

共 50 条

[1] Deep-Learning-Aided Wireless Video Transmission
Tung, Tze-Yang
Gunduz, Deniz
2022 IEEE 23RD INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATION (SPAWC), 2022,
[2] Deep Learning Enabled Semantic Communication Systems for Video Transmission
Zhang, Zhenguo
Yang, Qianqian
He, Shibo
Chen, Jiming
2023 IEEE 98TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-FALL, 2023,
[3] Semantic Communication-Enabled Wireless Adaptive Panoramic Video Transmission
Gao, Haixiao
Sun, Mengying
Xu, Xiaodong
Han, Shujun
2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,
[4] DeepWiVe: Deep-Learning-Aided Wireless Video Transmission
Tung, Tze-Yang
Gunduz, Deniz
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2022, 40 (09) : 2570 - 2583
[5] Predictive and Adaptive Deep Coding for Wireless Image Transmission in Semantic Communication
Zhang, Wenyu
Zhang, Haijun
Ma, Hui
Shao, Hua
Wang, Ning
Leung, Victor C. M.
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (08) : 5486 - 5501
[6] Free-Ride Transmission of Semantic Features in Wireless Video Surveillance Systems
Chen, Junjie
Wang, Yinchu
Wang, Qianfan
Wan, Hai
Ma, Xiao
2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,
[7] Wireless Video Transmission
Wang, Jiangzhou
Fan, Mingxi
You, Xiaohu
Zhang, Xi
Liu, Hui
Steinbach, Eckehard
Milstein, Laurence B.
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2010, 28 (03) : 297 - 298
[8] Wireless Semantic Communications for Video Conferencing
Jiang, Peiwen
Wen, Chao-Kai
Jin, Shi
Li, Geoffrey Ye
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2023, 41 (01) : 230 - 244
[9] Deep Joint Source-Channel Coding for Wireless Image Transmission with Semantic Importance
Sun, Qizheng
Guo, Caili
Yang, Yang
Chen, Jiujiu
Tang, Rui
Liu, Chuanhong
2022 IEEE 96TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-FALL), 2022,
[10] Deep Video Dehazing With Semantic Segmentation
Ren, Wenqi
Zhang, Jingang
Xu, Xiangyu
Ma, Lin
Cao, Xiaochun
Meng, Gaofeng
Liu, Wei
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (04) : 1895 - 1908

← 1 2 3 4 5 →