Deep Learning and Video Quality Analysis: Towards A Unified VQA

被引:0
|
作者
Topiwala, P. [1 ]
Dai, W. [1 ]
Pian, J. [1 ]
机构
[1] FastVDO LLC, 3097 Cortona Dr, Melbourne, FL 32940 USA
关键词
video quality assessment; video compression; full reference video quality; no reference video quality;
D O I
10.1117/12.2571309
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Video makes up 80% of internet traffic today and is still rising. Most of it is meant for human consumption. But for 40 years, the video coding industry has been using mean-squared error-based PSNR, effectively the most basic full reference (FR) video quality measure, as the main tool for assessing video quality despite long known poor correlation to subjective video ratings. Moreover, in the encoder, the sum of absolute differences (SAD) is used instead of MSE to save multiplications. Meanwhile, many current video applications such as YouTube do not have access to a pristine reference and have had to develop ad hoc methods to attempt to monitor the volumes of video in their servers in a challenging no reference (NR) setting. For this, they have in part leaned on the Gaussianity of natural scene statistics (NSS), and evaluating how video distortions affect or alter those statistics to create a measure of quality. An entire cottage industry has sprung up to create both full-reference and no-reference video quality assessment (FR-, NR-VQA) measures, that can adequately meet the needs for monitoring and stream selection, in the massive worldwide video services industry. These two fields have so far gone their separate ways, as there seemed no sensible way to bring them under one roof. In this paper, we attempt a first synthesis of FR and NR VQA, which we simply call FastVDO Quality (FVQ). It incorporates all the lessons learned from the Video Multi-Assessment Fusion (VMAF) algorithm introduced by Netflix in 2016, the NSS-based assessment concepts developed by Univ. of Texas and Google to treat the NR case, culminating in the algorithms VIIDEO and SLEEQ, as well as our own research over the past several years in using learning-based methods in VQA. We provide some early indications that this approach can bear fruit for both NR and FR-VQA and may even offer state-of-the-art results in each field.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] VMAF And Variants: Towards A Unified VQA
    Topiwala, P.
    Dai, W.
    Pian, J.
    Biondi, K.
    [J]. APPLICATIONS OF DIGITAL IMAGE PROCESSING XLIV, 2021, 11842
  • [2] Towards Unified Deep Learning Model for NSFW Image and Video Captioning
    Ko, Jong-Won
    Hwang, Dong-Hyun
    [J]. ADVANCED MULTIMEDIA AND UBIQUITOUS ENGINEERING, MUE/FUTURETECH 2018, 2019, 518 : 57 - 63
  • [3] Deep Learning and Video Quality Analysis
    Topiwala, P.
    Krishnan, M.
    Dai, W.
    [J]. APPLICATIONS OF DIGITAL IMAGE PROCESSING XLII, 2019, 11137
  • [4] A UNIFIED VIDEO SUMMARIZATION FOR VIDEO ANOMALIES THROUGH DEEP LEARNING
    Muchtar, Kahlil
    Munggaran, Muhammad Rizky
    Mahendra, Adhiguna
    Anwar, Khairul
    Lin, Chih-Yang
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (IEEE ICMEW 2022), 2022,
  • [5] Deep Learning Techniques in Video Coding and Quality Analysis
    Topiwala, Pankaj
    Krishnan, Madhu
    Dai, Wei
    [J]. APPLICATIONS OF DIGITAL IMAGE PROCESSING XLI, 2018, 10752
  • [6] Towards a Semantic Video Analysis using Deep Learning and Ontology
    Bornia, Jemai
    Mahmoudi, Sidi Ahmed
    Frihida, Ali
    Manneback, Pierre
    [J]. 2018 4TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGIES AND APPLICATIONS (CLOUDTECH), 2018,
  • [7] Towards Unified Data and Lifecycle Management for Deep Learning
    Miao, Hui
    Li, Ang
    Davis, Larry S.
    Deshpande, Amol
    [J]. 2017 IEEE 33RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2017), 2017, : 571 - 582
  • [8] Towards A Unified Deep Model for Trajectory Analysis
    Musleh, Mashaal
    [J]. 30TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS, ACM SIGSPATIAL GIS 2022, 2022, : 762 - 763
  • [9] Beyond BatchNorm: Towards a Unified Understanding of Normalization in Deep Learning
    Lubana, Ekdeep Singh
    Dick, Robert P.
    Tanaka, Hidenori
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [10] Deep learning for diplomatic video analysis
    Hongyan Zhao
    Fangxin Zhou
    Huaping Liu
    [J]. Multimedia Tools and Applications, 2020, 79 : 4811 - 4830