Just Recognizable Distortion for Machine Vision Oriented Image and Video Coding

被引:13
|
作者
Zhang, Qi [1 ]
Wang, Shanshe [1 ]
Zhang, Xinfeng [2 ]
Ma, Siwei [1 ]
Gao, Wen [1 ,3 ]
机构
[1] Peking Univ, Natl Engn Lab Video Technol, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing, Peoples R China
[3] Peng Cheng Lab, Shenzhen, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Image and video coding; Machine vision; Deep learning; Just noticeable distortion; DESCRIPTORS; MODELS;
D O I
10.1007/s11263-021-01505-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine visual intelligence has exploded in recent years. Large-scale, high-quality image and video datasets significantly empower learning-based machine vision models, especially deep-learning models. However, images and videos are usually compressed before being analyzed in practical situations where transmission or storage is limited, leading to a noticeable performance loss of vision models. In this work, we broadly investigate the impact on the performance of machine vision from image and video coding. Based on the investigation, we propose Just Recognizable Distortion (JRD) to present the maximum distortion caused by data compression that will reduce the machine vision model performance to an unacceptable level. A large-scale JRD-annotated dataset containing over 340,000 images is built for various machine vision tasks, where the factors for different JRDs are studied. Furthermore, an ensemble-learning-based framework is established to predict the JRDs for diverse vision tasks under few- and non-reference conditions, which consists of multiple binary classifiers to improve the prediction accuracy. Experiments prove the effectiveness of the proposed JRD-guided image and video coding to significantly improve compression and machine vision performance. Applying predicted JRD is able to achieve remarkably better machine vision task accuracy and save a large number of bits.
引用
收藏
页码:2889 / 2906
页数:18
相关论文
共 50 条
  • [21] Fast synthesized and predicted just noticeable distortion maps for perceptual multiview video coding
    Gao, Yu
    Xiu, Xiaoyu
    Liang, Jie
    Lin, Weisi
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2013, 24 (06) : 700 - 707
  • [22] Fast Mode Decision for Multiview Video Coding Based on Just Noticeable Distortion Profile
    Shang, Xiwu
    Wang, Yongfang
    Luo, Lidong
    Zuo, Yifan
    Zhang, Zhaoyang
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2015, 34 (01) : 301 - 320
  • [23] Perceptual stereoscopic video coding using disparity just-noticeable-distortion model
    Jung, Cheolkon
    Fu, Qingtao
    Xue, Fei
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 48 : 195 - 204
  • [24] DeepSVC: Deep Scalable Video Coding for Both Machine and Human Vision
    Lin, Hongbin
    Chen, Bolin
    Zhang, Zhichen
    Lin, Jielian
    Wang, Xu
    Zhao, Tiesong
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 9205 - 9214
  • [25] On object-oriented video coding using the CNN universal machine
    Stoffels, A
    Roska, T
    Chua, LO
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-FUNDAMENTAL THEORY AND APPLICATIONS, 1996, 43 (11): : 948 - 952
  • [26] Hardware implementation of machine vision systems: image and video processing
    Guillermo Botella
    Carlos García
    Uwe Meyer-Bäse
    EURASIP Journal on Advances in Signal Processing, 2013
  • [27] Hardware implementation of machine vision systems: image and video processing
    Botella, Guillermo
    Garcia, Carlos
    Meyer-Baese, Uwe
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2013,
  • [28] Perceptual Video Coding Based on Saliency and Just Noticeable Distortion for H.265/HEVC
    Wang, Huiqi
    Wang, Lin
    Hu, Xuelin
    Tu, Qin
    Men, Aidong
    2014 INTERNATIONAL SYMPOSIUM ON WIRELESS PERSONAL MULTIMEDIA COMMUNICATIONS (WPMC), 2014, : 106 - 111
  • [29] PERCEPTUAL MULTIVIEW VIDEO CODING BASED ON FOVEATED JUST NOTICEABLE DISTORTION PROFILE IN DCT DOMAIN
    Shang, Xiwu
    Wang, Yongfang
    Luo, Lidong
    Zhang, Zhaoyang
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 1914 - 1917
  • [30] Perceptual Video Coding Scheme Using Just Noticeable Distortion Model Based on Entropy Filter
    Cui, Xin
    Peng, Zongju
    Jiang, Gangyi
    Chen, Fen
    Yu, Mei
    ENTROPY, 2019, 21 (11)