Fast Image Processing with Fully-Convolutional Networks

被引:186
|
作者
Chen, Qifeng [1 ]
Xu, Jia [1 ]
Koltun, Vladlen [1 ]
机构
[1] Intel Labs, Santa Clara, CA 95054 USA
关键词
D O I
10.1109/ICCV.2017.273
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an approach to accelerating a wide variety of image processing operators. Our approach uses a fully-convolutional network that is trained on input-output pairs that demonstrate the operator's action. After training, the original operator need not be run at all. The trained network operates at full resolution and runs in constant time. We investigate the effect of network architecture on approximation accuracy, runtime, and memory footprint, and identify a specific architecture that balances these considerations. We evaluate the presented approach on ten advanced image processing operators, including multiple variational models, multiscale tone and detail manipulation, photographic style transfer, nonlocal dehazing, and nonphotorealistic stylization. All operators are approximated by the same model. Experiments demonstrate that the presented approach is significantly more accurate than prior approximation schemes. It increases approximation accuracy as measured by PSNR across the evaluated operators by 8.5 dB on the MIT-Adobe dataset (from 27.5 to 36 dB) and reduces DSSIM by a multiplicative factor of 3 compared to the most accurate prior approximation scheme, while being the fastest. We show that our models generalize across datasets and across resolutions, and investigate a number of extensions of the presented approach.
引用
收藏
页码:2516 / 2525
页数:10
相关论文
共 50 条
  • [31] Bayesian Fully Convolutional Networks for Brain Image Registration
    Cui, Kunpeng
    Fu, Panpan
    Li, Yinghao
    Lin, Yusong
    JOURNAL OF HEALTHCARE ENGINEERING, 2021, 2021
  • [32] Fully-Convolutional Intensive Feature Flow Neural Network for Text Recognition
    Zhang, Zhao
    Tang, Zemin
    Zhang, Zheng
    Wang, Yang
    Qin, Jie
    Wang, Meng
    ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 1706 - 1713
  • [33] GCI DETECTION FROM RAW SPEECH USING A FULLY-CONVOLUTIONAL NETWORK
    Ardaillon, Luc
    Roebel, Axel
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6739 - 6743
  • [34] Hyper-feature based tracking with the fully-convolutional Siamese network
    Kuai, Yangliu
    Wen, Gongjian
    Li, Dongdong
    2017 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING - TECHNIQUES AND APPLICATIONS (DICTA), 2017, : 157 - 163
  • [35] Facial image processing with convolutional neural networks
    Garcia, Christophe
    Duffner, Stefan
    PROGRESS IN PATTERN RECOGNITION, 2007, : 97 - +
  • [36] Image Sensing and Processing with Convolutional Neural Networks
    Coleman, Sonya
    Kerr, Dermot
    Zhang, Yunzhou
    SENSORS, 2022, 22 (10)
  • [37] A Unified Framework Integrating Recurrent Fully-Convolutional Networks and Optical Flow for Segmentation of the Left Ventricle in Echocardiography Data
    Jafari, Mohammad H.
    Girgis, Hany
    Liao, Zhibin
    Behnami, Delaram
    Abdi, Amir
    Vaseli, Hooman
    Luong, Christina
    Rohling, Robert
    Gin, Ken
    Tsang, Terasa
    Abolmaesumi, Purang
    DEEP LEARNING IN MEDICAL IMAGE ANALYSIS AND MULTIMODAL LEARNING FOR CLINICAL DECISION SUPPORT, DLMIA 2018, 2018, 11045 : 29 - 37
  • [38] PixelRL: Fully Convolutional Network With Reinforcement Learning for Image Processing
    Furuta, Ryosuke
    Inoue, Naoto
    Yamasaki, Toshihiko
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (07) : 1704 - 1719
  • [39] Exploiting Fully Convolutional Neural Networks for Fast Road Detection
    Teodoro Mendes, Caio Cesar
    Fremont, Vincent
    Wolf, Denis Fernando
    2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 3174 - 3179
  • [40] Fast, Simple Calcium Imaging Segmentation with Fully Convolutional Networks
    Klibisz, Aleksander
    Rose, Derek
    Eicholtz, Matthew
    Blundon, Jay
    Zakharenko, Stanislav
    DEEP LEARNING IN MEDICAL IMAGE ANALYSIS AND MULTIMODAL LEARNING FOR CLINICAL DECISION SUPPORT, 2017, 10553 : 285 - 293