KonVid-150k: A Dataset for No-Reference Video Quality Assessment of Videos in-the-Wild

被引:28
|
作者
Goetz-Hahn, Franz [1 ]
Hosu, Vlad [1 ]
Lin, Hanhe [1 ]
Saupe, Dietmar [1 ]
机构
[1] Univ Konstanz, Dept Comp Sci, D-78464 Constance, Germany
关键词
Streaming media; Distortion; Feature extraction; Quality assessment; Video recording; Training; Cameras; Datasets; deep transfer learning; multi-level spatially-pooled features; video quality assessment; video quality dataset; PREDICTION;
D O I
10.1109/ACCESS.2021.3077642
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video quality assessment (VQA) methods focus on particular degradation types, usually artificially induced on a small set of reference videos. Hence, most traditional VQA methods under-perform in-the-wild. Deep learning approaches have had limited success due to the small size and diversity of existing VQA datasets, either artificial or authentically distorted. We introduce a new in-the-wild VQA dataset that is substantially larger and diverse: KonVid-150k. It consists of a coarsely annotated set of 153,841 videos having five quality ratings each, and 1,596 videos with a minimum of 89 ratings each. Additionally, we propose new efficient VQA approaches (MLSP-VQA) relying on multi-level spatially pooled deep-features (MLSP). They are exceptionally well suited for training at scale, compared to deep transfer learning approaches. Our best method, MLSP-VQA-FF, improves the Spearman rank-order correlation coefficient (SRCC) performance metric on the commonly used KoNViD-1k in-the-wild benchmark dataset to 0.82. It surpasses the best existing deep-learning model (0.80 SRCC) and hand-crafted feature-based method (0.78 SRCC). We further investigate how alternative approaches perform under different levels of label noise, and dataset size, showing that MLSP-VQA-FF is the overall best method for videos in-the-wild. Finally, we show that the MLSP-VQA models trained on KonVid-150k sets the new state-of-the-art for cross-test performance on KoNViD-1k and LIVE-Qualcomm with a 0.83 and 0.64 SRCC, respectively. For KoNViD-1k this inter-dataset testing outperforms intra-dataset experiments, showing excellent generalization.
引用
收藏
页码:72139 / 72160
页数:22
相关论文
共 50 条
  • [1] Multi-Dimensional Feature Fusion Network for No-Reference Quality Assessment of In-the-Wild Videos
    Jiang, Jiu
    Wang, Xianpei
    Li, Bowen
    Tian, Meng
    Yao, Hongtai
    SENSORS, 2021, 21 (16)
  • [2] Quality Assessment of In-the-Wild Videos
    Li, Dingquan
    Jiang, Tingting
    Jiang, Ming
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2351 - 2359
  • [3] In-the-wild No-Reference Image Quality Assessment using Deep Convolutional Neural Networks
    Shahreza, Hatef Otroshi
    Amini, Arash
    Behroozi, Hamid
    2019 5TH IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS 2019), 2019,
  • [4] Learning Based Hybrid No-reference Video Quality Assessment of Compressed Videos
    Fazliani, Yasamin
    Andrade, Ernesto
    Shirani, Shahram
    2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2019,
  • [5] No-Reference Video Shakiness Quality Assessment
    Cui, Zhaoxiong
    Jiang, Tingting
    COMPUTER VISION - ACCV 2016, PT V, 2017, 10115 : 396 - 411
  • [6] COME for No-Reference Video Quality Assessment
    Wang, Chunfeng
    Su, Li
    Zhang, Weigang
    IEEE 1ST CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2018), 2018, : 232 - 237
  • [7] Predictive no-reference assessment of video quality
    Vega, Maria Torres
    Mocanu, Decebal Constantin
    Stavrou, Stavros
    Liotta, Antonio
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2017, 52 : 20 - 32
  • [8] Predictive no-reference assessment of video quality
    Torres Vega M.
    Mocanu D.C.
    Stavrou S.
    Liotta A.
    Torres Vega, Maria (m.torres.vega@tue.nl), 1600, Elsevier B.V., Netherlands (52): : 20 - 32
  • [9] Unified Quality Assessment of in-the-Wild Videos with Mixed Datasets Training
    Li, Dingquan
    Jiang, Tingting
    Jiang, Ming
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (04) : 1238 - 1257
  • [10] Unified Quality Assessment of in-the-Wild Videos with Mixed Datasets Training
    Dingquan Li
    Tingting Jiang
    Ming Jiang
    International Journal of Computer Vision, 2021, 129 : 1238 - 1257