Wireless Deep Video Semantic Transmission

被引:70
|
作者
Wang, Sixian [1 ]
Dai, Jincheng [1 ]
Liang, Zijian [1 ]
Niu, Kai [1 ,2 ]
Si, Zhongwei [1 ]
Dong, Chao [1 ]
Qin, Xiaoqi [3 ]
Zhang, Ping [3 ]
机构
[1] Beijing Univ Posts & Telecommun, Minist Educ, Key Lab Universal Wireless Commun, Beijing 100876, Peoples R China
[2] Peng Cheng Lab, Shenzhen 518066, Peoples R China
[3] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Semantic communications; video transmission; nonlinear transform; joint source-channel coding; rate-distortion; JOINT SOURCE;
D O I
10.1109/JSAC.2022.3221977
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we design a new class of high-efficiency deep joint source-channel coding methods to achieve end-to-end video transmission over wireless channels. The proposed methods exploit nonlinear transform and conditional coding architecture to adaptively extract semantic features across video frames, and transmit semantic feature domain representations over wireless channels via deep joint source-channel coding. Our framework is collected under the name deep video semantic transmission (DVST). In particular, benefiting from the strong temporal prior provided by the feature domain context, the learned nonlinear transform function becomes temporally adaptive, resulting in a richer and more accurate entropy model guiding the transmission of current frame. Accordingly, a novel rate adaptive transmission mechanism is developed to customize deep joint source-channel coding for video sources. It learns to allocate the limited channel bandwidth within and among video frames to maximize the overall transmission performance. The whole DVST design is formulated as an optimization problem whose goal is to minimize the end-to-end transmission rate-distortion performance under perceptual quality metrics or machine vision task performance metrics. Across standard video source test sequences and various communication scenarios, experiments show that our DVST can generally surpass traditional wireless video coded transmission schemes. The proposed DVST framework can well support future semantic communications due to its video content-aware and machine vision task integration abilities.
引用
收藏
页码:214 / 229
页数:16
相关论文
共 50 条
  • [1] Deep-Learning-Aided Wireless Video Transmission
    Tung, Tze-Yang
    Gunduz, Deniz
    2022 IEEE 23RD INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATION (SPAWC), 2022,
  • [2] Deep Learning Enabled Semantic Communication Systems for Video Transmission
    Zhang, Zhenguo
    Yang, Qianqian
    He, Shibo
    Chen, Jiming
    2023 IEEE 98TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-FALL, 2023,
  • [3] Semantic Communication-Enabled Wireless Adaptive Panoramic Video Transmission
    Gao, Haixiao
    Sun, Mengying
    Xu, Xiaodong
    Han, Shujun
    2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,
  • [4] DeepWiVe: Deep-Learning-Aided Wireless Video Transmission
    Tung, Tze-Yang
    Gunduz, Deniz
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2022, 40 (09) : 2570 - 2583
  • [5] Predictive and Adaptive Deep Coding for Wireless Image Transmission in Semantic Communication
    Zhang, Wenyu
    Zhang, Haijun
    Ma, Hui
    Shao, Hua
    Wang, Ning
    Leung, Victor C. M.
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (08) : 5486 - 5501
  • [6] Free-Ride Transmission of Semantic Features in Wireless Video Surveillance Systems
    Chen, Junjie
    Wang, Yinchu
    Wang, Qianfan
    Wan, Hai
    Ma, Xiao
    2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,
  • [7] Wireless Video Transmission
    Wang, Jiangzhou
    Fan, Mingxi
    You, Xiaohu
    Zhang, Xi
    Liu, Hui
    Steinbach, Eckehard
    Milstein, Laurence B.
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2010, 28 (03) : 297 - 298
  • [8] Wireless Semantic Communications for Video Conferencing
    Jiang, Peiwen
    Wen, Chao-Kai
    Jin, Shi
    Li, Geoffrey Ye
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2023, 41 (01) : 230 - 244
  • [9] Deep Joint Source-Channel Coding for Wireless Image Transmission with Semantic Importance
    Sun, Qizheng
    Guo, Caili
    Yang, Yang
    Chen, Jiujiu
    Tang, Rui
    Liu, Chuanhong
    2022 IEEE 96TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-FALL), 2022,
  • [10] Deep Video Dehazing With Semantic Segmentation
    Ren, Wenqi
    Zhang, Jingang
    Xu, Xiangyu
    Ma, Lin
    Cao, Xiaochun
    Meng, Gaofeng
    Liu, Wei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (04) : 1895 - 1908