Data-Driven Video Scene Importance Estimation for Adaptive Video Streaming

被引:0
|
作者
Choi, Wangyu [1 ]
Yoon, Jongwon [1 ]
机构
[1] Hanyang Univ, Dept Comp Sci & Engn, Ansan, South Korea
来源
2024 FIFTEENTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS, ICUFN 2024 | 2024年
基金
新加坡国家研究基金会;
关键词
Scene importance estimation; Adaptive video streaming; Deep learning;
D O I
10.1109/ICUFN61752.2024.10624977
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recently, many video streaming services have adopted adaptive bitrate algorithms as their optimization algorithm. Traditionally, ABR algorithms strive to provide an accurate estimate of network conditions. In recent years, ABR algorithms have incorporated the content of interest of the video into the algorithm based on the fact that users are interested in certain segments of the video when watching, and they have achieved significant performance improvements. However, these efforts are expensive in terms of time and cost and are difficult to adapt to new videos. To overcome these limitations, we propose a system for estimating scene saliency for new videos. To do so, we first build a dataset from a large-scale video streaming service, which is then trained on a deep learning model consisting of a 3D CNN and a Transformer. As a result, our proposed model achieves a significantly lower prediction error rate on unseen videos and achieves a significant QoE improvement when incorporated with the ABR algorithm.
引用
收藏
页码:348 / 351
页数:4
相关论文
共 50 条
  • [31] A Data-driven Approach for Facial Expression Synthesis in Video
    Li, Kai
    Xu, Feng
    Wang, Jue
    Dai, Qionghai
    Liu, Yebin
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 57 - 64
  • [33] Data-driven multichannel superresolution with application to video sequences
    Shekarforoush, H
    Chellappa, R
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1999, 16 (03) : 481 - 492
  • [34] Data-driven approaches for social image and video tagging
    Ballan, Lamberto
    Bertini, Marco
    Uricchio, Tiberio
    Del Bimbo, Alberto
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (04) : 1443 - 1468
  • [35] Data-driven background representation method to video surveillance
    Li, Zhihui
    Xia, Yingji
    Qu, Zhaowei
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2017, 34 (02) : 193 - 202
  • [36] Data-driven approaches for social image and video tagging
    Lamberto Ballan
    Marco Bertini
    Tiberio Uricchio
    Alberto Del Bimbo
    Multimedia Tools and Applications, 2015, 74 : 1443 - 1468
  • [37] A Data-Driven Approach for Facial Expression Retargeting in Video
    Li, Kai
    Dai, Qionghai
    Wang, Ruiping
    Liu, Yebin
    Xu, Feng
    Wang, Jue
    IEEE TRANSACTIONS ON MULTIMEDIA, 2014, 16 (02) : 299 - 310
  • [38] Available Bandwidth Estimation for Adaptive Video Streaming in Mobile Ad Hoc
    W. Castellanos
    J. C. Guerri
    P. Arce
    International Journal of Wireless Information Networks, 2019, 26 : 218 - 229
  • [39] Eliminating bandwidth estimation from adaptive video streaming in wireless networks
    Hwang, Jaehyun
    Lee, Junghwan
    Yoo, Chuck
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2016, 47 : 242 - 251
  • [40] On the trajectory of video quality transition in HTTP adaptive video streaming
    Sani, Yusuf
    Mauthe, Andreas
    Edwards, Christopher
    MULTIMEDIA SYSTEMS, 2018, 24 (03) : 327 - 340