SAW: Semantic-Aware WebRTC Transmission Using Diffusion-Based Scalable Video Coding

被引:0
|
作者
Wen, Yihan [1 ,2 ]
Zhang, Zheng [3 ]
Sun, Jiayi [1 ]
Li, Jinglei [4 ]
Chen, Chung Shue [5 ]
Niu, Guanchong [1 ]
机构
[1] Xidian Univ, Guangzhou Inst Technol, Guangzhou 510000, Peoples R China
[2] Hong Kong Polytech Univ, Dept Land Surveying & Geoinformat, Hong Kong, Peoples R China
[3] Dalian Univ Technol, Sch Software, Dalian 116024, Peoples R China
[4] Xidian Univ, Sch Telecommun Engn, Xian 710071, Shaanxi, Peoples R China
[5] Nokia Bell Labs, Dept Machine Learning & Syst, F-91300 Massy, France
来源
IEEE INTERNET OF THINGS JOURNAL | 2025年 / 12卷 / 05期
基金
中国国家自然科学基金;
关键词
Computer vision; network adaptability; scalable video coding (SVC); service-aware WebRTC (SAW); video streaming; RECURRENT NEURAL-NETWORKS; IMAGE; PERFORMANCE; COMPRESSION; IMPACT;
D O I
10.1109/JIOT.2024.3486725
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As video transmission systems expand into various complex scenarios, real-time video coding methods are essential for maintaining low latency and high perceptual quality across varying network conditions. In this work, we propose service-aware Web real-time communication (WebRTC), a semantic-assisted WebRTC system built on scalable video coding (SVC). Specifically, this system is structured with three layers: 1) L-1 extracts and down-samples semantic information at the encoder, employing a novel super-resolution (SR) method named BUS-DDIM at the decoder to enhance the transmission efficiency and machine vision recognition rate; 2) L-2 adaptively compresses high-quality video by discarding frames with little motion at the encoder to address latency issues under poor network conditions, and utilize the adjacent frame-guided denoised interpolation model called the adjacent frame-guided denoised diffusion implicit model for restoring the video; and 3) L-3 transmits high-quality video tailored for users with high-definition video requirements and favorable network conditions. These layers dynamically enhance the visual experience and ensure low latency across various network environments. Experiments are conducted on diverse videos to validate the effectiveness of the proposed framework. The performance evaluation under real-time scenarios indicates significant enhancements in video quality and transmission efficiency, showcasing compatibility and versatility across various applications.
引用
收藏
页码:5346 / 5359
页数:14
相关论文
共 50 条
  • [1] SAW: Semantic-Aware WebRTC Transmission Using Diffusion-Based Scalable Video Coding
    Wen, Yihan
    Zhang, Zheng
    Sun, Jiayi
    Li, Jinglei
    Chen, Chung Shue
    Niu, Guanchong
    IEEE Internet of Things Journal, 2024,
  • [2] Media aware FEC for Scalable Video Coding Transmission
    Kondrad, Lukasz
    Bouazizi, Imed
    Gabbouj, Moncef
    ISCC: 2009 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS, VOLS 1 AND 2, 2009, : 7 - +
  • [3] Motion-Based Rate Adaptation in WebRTC Videoconferencing Using Scalable Video Coding
    Bakar, Gonca
    Kirmizioglu, Riza Arda
    Tekalp, A. Murat
    IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (02) : 429 - 441
  • [4] A Semantic-Aware Transmission With Adaptive Control Scheme for Volumetric Video Service
    Zhu, Yuanwei
    Huang, Yakun
    Qiao, Xiuquan
    Tan, Zhijie
    Bai, Boyuan
    Ma, Huadong
    Dustdar, Schahram
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 7160 - 7172
  • [5] Improving Learning-Based Semantic Coding Efficiency for Image Transmission via Shared Semantic-Aware Codebook
    Zhang, Hongwei
    Tao, Meixia
    Sun, Yaping
    Letaief, Khaled B.
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2025, 73 (02) : 1217 - 1232
  • [6] Deep Separate Source-channel Coding for Semantic-aware Image Transmission
    Huang, Jianhao
    Li, Dongxu
    Huang, Chuan
    Qin, Xiaoqi
    Zhang, Wei
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 5626 - 5631
  • [7] Mobile video transmission using Scalable Video Coding
    Schierl, Thomas
    Stockhammer, Thomas
    Wiegand, Thomas
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2007, 17 (09) : 1204 - 1217
  • [8] A wireless video transmission scheme based on network coding and scalable video coding
    Feng, Dechun
    Gao, Shaoshuai
    2013 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2013,
  • [9] CONGESTION-AWARE TRANSMISSION RATE CONTROL USING MEDIUM GRAIN SCALABILITY OF SCALABLE VIDEO CODING
    Hannuksela, Miska M.
    Zhu, Haibo
    Li, Houqiang
    Gabbouj, Moncef
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 2929 - 2932
  • [10] Adaptive transmission of medical image and video using scalable coding and context-aware wireless medical networks
    Doukas, Charalampos
    Maglogiannis, Ilias
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2008, 2008 (1)