Learning Generalized Spatial-Temporal Deep Feature Representation for No-Reference Video Quality Assessment

被引:35
|
作者
Chen, Baoliang [1 ]
Zhu, Lingyu [1 ]
Li, Guo [2 ]
Lu, Fangbo [2 ]
Fan, Hongfei [2 ]
Wang, Shiqi [1 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[2] Kingsoft Cloud, Beijing 100000, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Quality assessment; Training; Video recording; Image quality; Streaming media; Nonlinear distortion; Video quality assessment; generalization capability; deep neural networks; temporal aggregation; IMAGE; STATISTICS; DATABASE;
D O I
10.1109/TCSVT.2021.3088505
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this work, we propose a no-reference video quality assessment method, aiming to achieve high-generalization capability in cross-content, -resolution and -frame rate quality prediction. In particular, we evaluate the quality of a video by learning effective feature representations in spatial-temporal domain. In the spatial domain, to tackle the resolution and content variations, we impose the Gaussian distribution constraints on the quality features. The unified distribution can significantly reduce the domain gap between different video samples, resulting in more generalized quality feature representation. Along the temporal dimension, inspired by the mechanism of visual perception, we propose a pyramid temporal aggregation module by involving the short-term and long-term memory to aggregate the frame-level quality. Experiments show that our method outperforms the state-of-the-art methods on cross-dataset settings, and achieves comparable performance on intra-dataset configurations, demonstrating the high-generalization capability of the proposed method. The codes are released at https://github.com/Baoliang93/GSTVQA
引用
收藏
页码:1903 / 1916
页数:14
相关论文
共 50 条
  • [31] COME for No-Reference Video Quality Assessment
    Wang, Chunfeng
    Su, Li
    Zhang, Weigang
    IEEE 1ST CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2018), 2018, : 232 - 237
  • [32] Predictive no-reference assessment of video quality
    Vega, Maria Torres
    Mocanu, Decebal Constantin
    Stavrou, Stavros
    Liotta, Antonio
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2017, 52 : 20 - 32
  • [33] Predictive no-reference assessment of video quality
    Torres Vega M.
    Mocanu D.C.
    Stavrou S.
    Liotta A.
    Torres Vega, Maria (m.torres.vega@tue.nl), 1600, Elsevier B.V., Netherlands (52): : 20 - 32
  • [34] No-reference image quality assessment based on deep learning method
    Yang, Ruozhang
    Su, Jiangang
    Yu, Wenguang
    2017 IEEE 3RD INFORMATION TECHNOLOGY AND MECHATRONICS ENGINEERING CONFERENCE (ITOEC), 2017, : 476 - 479
  • [35] Deep supervised dictionary learning for no-reference image quality assessment
    Huang, Yuge
    Liu, Xuesong
    Tian, Xiang
    Zhou, Fan
    Chen, Yaowu
    Jiang, Rongxin
    JOURNAL OF ELECTRONIC IMAGING, 2018, 27 (02)
  • [36] No-reference quality assessment for live broadcasting videos in temporal and spatial domains
    Huang, Yipo
    Li, Leida
    Zhou, Yu
    Hu, Bo
    IET IMAGE PROCESSING, 2020, 14 (04) : 774 - 781
  • [37] No-reference Distorted Image Quality Assessment Based on Deep Learning
    Guo, Chang
    Liu, Haoting
    Pan, Shunliang
    Dong, Weidong
    Yang, Shuo
    Tian, Guoliang
    PROCEEDINGS OF 2018 IEEE 4TH INFORMATION TECHNOLOGY AND MECHATRONICS ENGINEERING CONFERENCE (ITOEC 2018), 2018, : 586 - 591
  • [38] Spatial-Temporal Visual Attention Model for Video Quality Assessment
    Suen, Wei-Juen
    Liu, Hsin-Hua
    Pei, Soo-Chang
    Liu, Kuan-Hsien
    Liu, Tsung-Jung
    2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2019,
  • [39] Joint Distortion Restoration and Quality Feature Learning for No-reference Image Quality Assessment
    Yang, Jifan
    Wang, Zhongyuan
    Huang, Baojin
    Ai, Jiaxin
    Yang, Yuhong
    Xiong, Zixiang
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (07)
  • [40] Feature-based no-reference video quality assessment using Extra Trees
    Otroshi-Shahreza, Hatef
    Amini, Arash
    Behroozi, Hamid
    IET IMAGE PROCESSING, 2022, 16 (06) : 1531 - 1543