On the Importance of Spatio-Temporal Learning for Video Quality Assessment

被引:1
|
作者
Fontanel, Dario [1 ,2 ]
Higham, David [2 ]
Vallade, Benoit Quentin Arthur [2 ]
机构
[1] Politecn Torino, Turin, Italy
[2] Amazon Prime Video, London, England
关键词
PREDICTION;
D O I
10.1109/WACVW58289.2023.00053
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video quality assessment (VQA) has sparked a lot of interest in the computer vision community, as it plays a critical role in services that provide customers with high quality video content. Due to the lack of high quality reference videos and the difficulties in collecting subjective evaluations, assessing video quality is a challenging and still unsolved problem. Moreover, most of the public research efforts focus only on user-generated content (UGC), making it unclear if reliable solutions can be adopted for assessing the quality of production-related videos. The goal of this work is to assess the importance of spatial and temporal learning for production-related VQA. In particular, it assesses state-of-the-art UGC video quality assessment perspectives on LIVE-APV dataset, demonstrating the importance of learning contextual characteristics from each video frame, as well as capturing temporal correlations between them.
引用
收藏
页码:481 / 487
页数:7
相关论文
共 50 条
  • [31] Spatio-temporal feature learning for enhancing video quality based on screen content characteristics
    Huang, Ziyin
    Chan, Yui-Lam
    Tsang, Sik-Ho
    Kwong, Ngai-Wing
    Lam, Kin-Man
    Ling, Wing-Kuen
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 104
  • [32] Video Packet Priority Assignment based on Spatio-temporal Perceptual Importance
    Reddy, Sadu Sadhana
    Manasa, K.
    Channappayya, Sumohana S.
    [J]. 2015 TWENTY FIRST NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2015,
  • [33] ANALYSIS OF VIDEO QUALITY INDUCED SPATIO-TEMPORAL SALIENCY SHIFTS
    Wu, Xinbo
    Dong, Zhengyan
    Zhang, Fan
    Rosin, Paul L.
    Liu, Hantao
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1581 - 1585
  • [34] Spatio-Temporal Deformable Convolution for Compressed Video Quality Enhancement
    Deng, Jianing
    Wang, Li
    Pu, Shiliang
    Zhuo, Cheng
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 10696 - 10703
  • [35] Using multiple spatio-temporal features to estimate video quality
    Freitas, Pedro Garcia
    Akamine, Welington Y. L.
    Farias, Mylene C. Q.
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2018, 64 : 1 - 10
  • [36] Learning Spatio-temporal Representation by Channel Aliasing Video Perception
    Lin, Yiqi
    Wang, Jinpeng
    Zhang, Manlin
    Ma, Andy J.
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2317 - 2325
  • [37] Learning Feature Semantic Matching for Spatio-Temporal Video Grounding
    Zhang, Tong
    Fang, Hao
    Zhang, Hao
    Gao, Jialin
    Lu, Xiankai
    Nie, Xiushan
    Yin, Yilong
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9268 - 9279
  • [38] STEP: Spatio-Temporal Progressive Learning for Video Action Detection
    Yang, Xitong
    Yang, Xiaodong
    Liu, Ming-Yu
    Xiao, Fanyi
    Davis, Larry
    Kautz, Jan
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 264 - 272
  • [39] Learning Deep Spatio-Temporal Dependence for Semantic Video Segmentation
    Qiu, Zhaofan
    Yao, Ting
    Mei, Tao
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (04) : 939 - 949
  • [40] Annoyance of spatio-temporal artifacts in segmentation quality assessment
    Gelasca, EDG
    Ebrahimi, T
    Farias, MCQ
    Carli, MC
    Mitra, SK
    [J]. ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 345 - 348