Just Recognizable Distortion for Machine Vision Oriented Image and Video Coding

被引:13
|
作者
Zhang, Qi [1 ]
Wang, Shanshe [1 ]
Zhang, Xinfeng [2 ]
Ma, Siwei [1 ]
Gao, Wen [1 ,3 ]
机构
[1] Peking Univ, Natl Engn Lab Video Technol, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing, Peoples R China
[3] Peng Cheng Lab, Shenzhen, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Image and video coding; Machine vision; Deep learning; Just noticeable distortion; DESCRIPTORS; MODELS;
D O I
10.1007/s11263-021-01505-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine visual intelligence has exploded in recent years. Large-scale, high-quality image and video datasets significantly empower learning-based machine vision models, especially deep-learning models. However, images and videos are usually compressed before being analyzed in practical situations where transmission or storage is limited, leading to a noticeable performance loss of vision models. In this work, we broadly investigate the impact on the performance of machine vision from image and video coding. Based on the investigation, we propose Just Recognizable Distortion (JRD) to present the maximum distortion caused by data compression that will reduce the machine vision model performance to an unacceptable level. A large-scale JRD-annotated dataset containing over 340,000 images is built for various machine vision tasks, where the factors for different JRDs are studied. Furthermore, an ensemble-learning-based framework is established to predict the JRDs for diverse vision tasks under few- and non-reference conditions, which consists of multiple binary classifiers to improve the prediction accuracy. Experiments prove the effectiveness of the proposed JRD-guided image and video coding to significantly improve compression and machine vision performance. Applying predicted JRD is able to achieve remarkably better machine vision task accuracy and save a large number of bits.
引用
收藏
页码:2889 / 2906
页数:18
相关论文
共 50 条
  • [1] Just Recognizable Distortion for Machine Vision Oriented Image and Video Coding
    Qi Zhang
    Shanshe Wang
    Xinfeng Zhang
    Siwei Ma
    Wen Gao
    International Journal of Computer Vision, 2021, 129 : 2889 - 2906
  • [2] Learning to Predict Object-Wise Just Recognizable Distortion for Image and Video Compression
    Zhang, Yun
    Lin, Haoqin
    Sun, Jing
    Zhu, Linwei
    Kwong, Sam
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 5925 - 5938
  • [3] Just noticeable coding distortion model for HEVC video coding
    Xu, Sheng-Yang
    Yu, Mei
    Jiang, Gang-Yi
    Fang, Shu-Qing
    Shao, Feng
    Peng, Zong-Ju
    Guangdianzi Jiguang/Journal of Optoelectronics Laser, 2015, 26 (12): : 2381 - 2392
  • [4] Just noticeable distortion model and its applications in video coding
    Yang, XK
    Ling, WS
    Lu, ZK
    Ong, EP
    Yao, SS
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2005, 20 (07) : 662 - 680
  • [5] JUST NOTICEABLE DISTORTION MAP PREDICTION FOR PERCEPTUAL MULTIVIEW VIDEO CODING
    Gao, Yu
    Xiu, Xiaoyu
    Liang, Jie
    Lin, Weisi
    2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 1045 - 1048
  • [6] A survey on just noticeable distortion estimation and its applications in video coding
    Wang, Guoxiang
    Wang, Hongkui
    Li, Hui
    Yu, Li
    Yin, Haibing
    Xu, Haifeng
    Ye, Zhen
    Song, Junfeng
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 98
  • [7] Perceptual video coding with multi-just-noticeable-distortion level
    Wang J.
    Wan S.
    Gong Y.
    Zhao H.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2021, 49 (09): : 11 - 16
  • [8] PERCEPTUAL VIDEO CODING WITH BLOCK-LEVEL STAIRCASE JUST NOTICEABLE DISTORTION
    Zhang, Xinyu
    Wang, Hanli
    Tian, Tao
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 4140 - 4144
  • [9] TEMPORAL COLOR JUST NOTICEABLE DISTORTION MODEL AND ITS APPLICATION FOR VIDEO CODING
    Chen, Hao
    Hu, Ruimin
    Hu, Jinhui
    Wang, Zhongyuan
    2010 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2010), 2010, : 713 - 718
  • [10] A just noticeable distortion based rate control algorithm for multiview video coding
    Jiang, G. (jianggangyi@126.com), 1600, Academy Publisher (08):