Deep Learning and Video Quality Analysis: Towards A Unified VQA

被引:0
|
作者
Topiwala, P. [1 ]
Dai, W. [1 ]
Pian, J. [1 ]
机构
[1] FastVDO LLC, 3097 Cortona Dr, Melbourne, FL 32940 USA
关键词
video quality assessment; video compression; full reference video quality; no reference video quality;
D O I
10.1117/12.2571309
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Video makes up 80% of internet traffic today and is still rising. Most of it is meant for human consumption. But for 40 years, the video coding industry has been using mean-squared error-based PSNR, effectively the most basic full reference (FR) video quality measure, as the main tool for assessing video quality despite long known poor correlation to subjective video ratings. Moreover, in the encoder, the sum of absolute differences (SAD) is used instead of MSE to save multiplications. Meanwhile, many current video applications such as YouTube do not have access to a pristine reference and have had to develop ad hoc methods to attempt to monitor the volumes of video in their servers in a challenging no reference (NR) setting. For this, they have in part leaned on the Gaussianity of natural scene statistics (NSS), and evaluating how video distortions affect or alter those statistics to create a measure of quality. An entire cottage industry has sprung up to create both full-reference and no-reference video quality assessment (FR-, NR-VQA) measures, that can adequately meet the needs for monitoring and stream selection, in the massive worldwide video services industry. These two fields have so far gone their separate ways, as there seemed no sensible way to bring them under one roof. In this paper, we attempt a first synthesis of FR and NR VQA, which we simply call FastVDO Quality (FVQ). It incorporates all the lessons learned from the Video Multi-Assessment Fusion (VMAF) algorithm introduced by Netflix in 2016, the NSS-based assessment concepts developed by Univ. of Texas and Google to treat the NR case, culminating in the algorithms VIIDEO and SLEEQ, as well as our own research over the past several years in using learning-based methods in VQA. We provide some early indications that this approach can bear fruit for both NR and FR-VQA and may even offer state-of-the-art results in each field.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] HIDRO-VQA: High Dynamic Range Oracle for Video Quality Assessment
    Saini, Shreshth
    Saha, Avinab
    Bovik, Alan C.
    [J]. 2024 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS, WACVW 2024, 2024, : 469 - 479
  • [22] BH-VQA: Blind High Frame Rate Video Quality Assessment
    Lu, Wei
    Sun, Wei
    Zhang, Zicheng
    Tu, Danyang
    Min, Xiongkuo
    Zhai, Guangtao
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2501 - 2506
  • [23] A deep learning framework for quality assessment and restoration in video endoscopy
    Ali, Sharib
    Zhou, Felix
    Bailey, Adam
    Braden, Barbara
    East, James E.
    Lu, Xin
    Rittscher, Jens
    [J]. MEDICAL IMAGE ANALYSIS, 2021, 68
  • [24] Deep Multimodality Learning for UAV Video Aesthetic Quality Assessment
    Kuang, Qi
    Jin, Xin
    Zhao, Qinping
    Zhou, Bin
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (10) : 2623 - 2634
  • [25] Preserving Low-Quality Video through Deep Learning
    Galteri, Leonardo
    Seidenari, Lorenzo
    Uricchio, Tiberio
    Bertini, Marco
    Del Bimbo, Alberto
    [J]. INTERNATIONAL CONFERENCE FLORENCE HERI-TECH: THE FUTURE OF HERITAGE SCIENCE AND TECHNOLOGIES, 2020, 949
  • [26] Deep learning based on multimedia encoding to enhance video quality
    Challa, Nagendra Panini
    Shanmuganathan, C.
    Shobana, M.
    Deepthi, Ch. Venkata Sasi
    Bharathiraja, N.
    [J]. INTERNATIONAL JOURNAL OF COMMUNICATION NETWORKS AND DISTRIBUTED SYSTEMS, 2023, 29 (05) : 475 - 492
  • [27] Deep Learning for Image/Video Compression and Visual Quality Assessment
    Pan, Zhaoqing
    Jeon, Byeungwoo
    Ling, Nam
    Peng, Bo
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (29) : 42483 - 42483
  • [28] Deep Learning for Image/Video Compression and Visual Quality Assessment
    [J]. Multimedia Tools and Applications, 2022, 81 : 42483 - 42483
  • [29] A comparative evaluation of machine learning and deep learning algorithms for question categorization of VQA datasets
    Asudani, Deepak Suresh
    Nagwani, Naresh Kumar
    Singh, Pradeep
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (19) : 57829 - 57859
  • [30] Deep learning-based video quality enhancement for the new versatile video coding
    Soulef Bouaafia
    Randa Khemiri
    Seifeddine Messaoud
    Olfa Ben Ahmed
    Fatma Ezahra Sayadi
    [J]. Neural Computing and Applications, 2022, 34 : 14135 - 14149