YouTube UGC Dataset for Video Compression Research

被引:119
|
作者
Wang, Yilin [1 ]
Inguva, Sasi [1 ]
Adsumilli, Balu [1 ]
机构
[1] Google Inc, Mountain View, CA 94043 USA
关键词
User Generated Content; Video Compression; Video Quality Assessment;
D O I
10.1109/mmsp.2019.8901772
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Non-professional video, commonly known as User Generated Content (UGC) has become very popular in today's video sharing applications. However, traditional metrics used in compression and quality assessment, like BD-Rate and PSNR, are designed for pristine originals. Thus, their accuracy drops significantly when being applied on non-pristine originals (the majority of UGC). Understanding difficulties for compression and quality assessment in the scenario of UGC is important, but there are few public UGC datasets available for research. This paper introduces a large scale UGC dataset (1500 20 sec video clips) sampled from millions of YouTube videos. The dataset covers popular categories like Gaming, Sports, and new features like High Dynamic Range (HDR). Besides a novel sampling method based on features extracted from encoding, challenges for UGC compression and quality evaluation are also discussed. Shortcomings of traditional reference-based metrics on UGC are addressed. We demonstrate a promising way to evaluate UGC quality by no-reference objective quality metrics, and evaluate the current dataset with three no-reference metrics (Noise, Banding, and SLEEQ).
引用
收藏
页数:5
相关论文
共 50 条
  • [1] SUBJECTIVE QUALITY ASSESSMENT FOR YOUTUBE UGC DATASET
    Yim, Joong Gon
    Wang, Yilin
    Birkbeck, Neil
    Adsumilli, Balu
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 131 - 135
  • [2] A SYNTHETIC VIDEO DATASET FOR VIDEO COMPRESSION EVALUATION
    Ma, Di
    Katsenou, Angeliki V.
    Bull, David R.
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 1094 - 1098
  • [3] BVI-CC: A Dataset for Research on Video Compression and Quality Assessment
    Katsenou, Angeliki
    Zhang, Fan
    Afonso, Mariana
    Dimitrov, Goce
    Bull, David R.
    [J]. FRONTIERS IN SIGNAL PROCESSING, 2022, 2
  • [4] The Research of Domestic Popular Travel Regions Based on UGC Dataset
    Wang, Zhen-Xuan
    Cao, Han
    [J]. 2015 INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND INFORMATION SYSTEM (SEIS 2015), 2015, : 123 - 128
  • [5] Performance Analysis of Object Detection Algorithms on YouTube Video Object Dataset
    Sharma, Chethan
    Singh, Siddharth
    Poornalatha, G.
    Shenoy, Ajitha K. B.
    [J]. ENGINEERING LETTERS, 2021, 29 (02) : 813 - 817
  • [6] Debunking a Video on YouTube as an Authentic Research Experience
    Davidowsky, Philip
    Rogers, Michael
    [J]. PHYSICS TEACHER, 2015, 53 (05): : 304 - 306
  • [7] Discovering popular and persistent tags from YouTube trending video big dataset
    Dokuz, Yesim
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (04) : 10779 - 10797
  • [8] Discovering popular and persistent tags from YouTube trending video big dataset
    Yesim Dokuz
    [J]. Multimedia Tools and Applications, 2024, 83 : 10779 - 10797
  • [9] Sexism in Focus: An Annotated Dataset of YouTube Comments for Gender Bias Research
    Bertaglia, Thales
    Bartekova, Katarina
    Jongma, Rinske
    McCarthy, Stephen
    Iamnitchi, Adriana
    [J]. PROCEEDINGS OF THE 2023 WORKSHOP ON OPEN CHALLENGES IN ONLINE SOCIAL NETWORKS, OASIS 2023/ 34TH ACM CONFERENCE ON HYPERTEXT AND SOCIAL MEDIA, HT 2023, 2023, : 22 - 28
  • [10] Approach for Video Classification with Multi-label on YouTube-8M Dataset
    Shin, Kwangsoo
    Jeon, Junhyeong
    Lee, Seungbin
    Lim, Boyoung
    Jeong, Minsoo
    Nang, Jongho
    [J]. COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV, 2019, 11132 : 317 - 324