KonVid-150k: A Dataset for No-Reference Video Quality Assessment of Videos in-the-Wild

被引:28
|
作者
Goetz-Hahn, Franz [1 ]
Hosu, Vlad [1 ]
Lin, Hanhe [1 ]
Saupe, Dietmar [1 ]
机构
[1] Univ Konstanz, Dept Comp Sci, D-78464 Constance, Germany
关键词
Streaming media; Distortion; Feature extraction; Quality assessment; Video recording; Training; Cameras; Datasets; deep transfer learning; multi-level spatially-pooled features; video quality assessment; video quality dataset; PREDICTION;
D O I
10.1109/ACCESS.2021.3077642
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video quality assessment (VQA) methods focus on particular degradation types, usually artificially induced on a small set of reference videos. Hence, most traditional VQA methods under-perform in-the-wild. Deep learning approaches have had limited success due to the small size and diversity of existing VQA datasets, either artificial or authentically distorted. We introduce a new in-the-wild VQA dataset that is substantially larger and diverse: KonVid-150k. It consists of a coarsely annotated set of 153,841 videos having five quality ratings each, and 1,596 videos with a minimum of 89 ratings each. Additionally, we propose new efficient VQA approaches (MLSP-VQA) relying on multi-level spatially pooled deep-features (MLSP). They are exceptionally well suited for training at scale, compared to deep transfer learning approaches. Our best method, MLSP-VQA-FF, improves the Spearman rank-order correlation coefficient (SRCC) performance metric on the commonly used KoNViD-1k in-the-wild benchmark dataset to 0.82. It surpasses the best existing deep-learning model (0.80 SRCC) and hand-crafted feature-based method (0.78 SRCC). We further investigate how alternative approaches perform under different levels of label noise, and dataset size, showing that MLSP-VQA-FF is the overall best method for videos in-the-wild. Finally, we show that the MLSP-VQA models trained on KonVid-150k sets the new state-of-the-art for cross-test performance on KoNViD-1k and LIVE-Qualcomm with a 0.83 and 0.64 SRCC, respectively. For KoNViD-1k this inter-dataset testing outperforms intra-dataset experiments, showing excellent generalization.
引用
收藏
页码:72139 / 72160
页数:22
相关论文
共 50 条
  • [21] An improved model for no-reference image quality assessment and a no-reference video quality assessment model based on frame analysis
    Rohil, Mukesh Kumar
    Gupta, Neetika
    Yadav, Prakash
    SIGNAL IMAGE AND VIDEO PROCESSING, 2020, 14 (01) : 205 - 213
  • [22] MMW-AQA: Multimodal In-the-Wild Dataset for Action Quality Assessment
    Nagai, Takasuke
    Takeda, Shoichiro
    Suzuki, Satoshi
    Seshimo, Hitoshi
    IEEE ACCESS, 2024, 12 : 92062 - 92072
  • [23] DEEP NEURAL NETWORKS FOR NO-REFERENCE VIDEO QUALITY ASSESSMENT
    You, Junyong
    Korhonen, Jari
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 2349 - 2353
  • [24] NATURAL MOTION STATISTICS FOR NO-REFERENCE VIDEO QUALITY ASSESSMENT
    Saad, Michele A.
    Bovik, Alan C.
    QOMEX: 2009 INTERNATIONAL WORKSHOP ON QUALITY OF MULTIMEDIA EXPERIENCE, 2009, : 163 - 167
  • [25] NO-REFERENCE VIDEO QUALITY ASSESSMENT USING MPEG ANALYSIS
    Sogaard, Jacob
    Forchhammer, Soren
    Korhonen, Jari
    2013 PICTURE CODING SYMPOSIUM (PCS), 2013, : 161 - 164
  • [26] No-reference model for video quality assessment based on SVM
    Wu, Lili
    Yu, Chunyan
    ADVANCES IN MECHATRONICS, AUTOMATION AND APPLIED INFORMATION TECHNOLOGIES, PTS 1 AND 2, 2014, 846-847 : 1024 - 1030
  • [27] No-Reference Video Quality Assessment by HEVC Codec Analysis
    Huang, Xin
    Sogaard, Jacob
    Forchhammer, Soren
    2015 VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2015,
  • [28] A No-Reference Video Quality Assessment Metric Based On ROI
    Jia, Lixiu
    Zhong, Xuefei
    Tu, Yan
    Niu, Wenjuan
    IMAGE QUALITY AND SYSTEM PERFORMANCE XII, 2015, 9396
  • [29] A NO-REFERENCE VIDEO QUALITY ASSESSMENT BASED ON LAPLACIAN PYRAMIDS
    Zhu, Kongfeng
    Hirakawa, Keigo
    Asari, Vijayan
    Saupe, Dietmar
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 49 - 53
  • [30] Semantic Information Oriented No-Reference Video Quality Assessment
    Wu, Wei
    Li, Qinyao
    Chen, Zhenzhong
    Liu, Shan
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 (28) : 204 - 208