Streaming Convolutional Neural Networks for End-to-End Learning With Multi-Megapixel Images

被引：32

作者：

Pinckaers, Hans ^{[1
]}

van Ginneken, Bram ^{[1
]}

Litjens, Geert ^{[1
]}

机构：

[1] Radboud Univ Nijmegen, Med Ctr, Diagnost Image Anal Grp, Radboud Inst Hlth Sci, NL-6525 GA Nijmegen, Netherlands

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2022年 / 44卷 / 03期

关键词：

Memory management; Convolution; Convolutional neural networks; Backpropagation; Streaming media; Task analysis; Training; Deep learning; convolutional neural networks; image classification; high-resolution images;

D O I：

10.1109/TPAMI.2020.3019563

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Due to memory constraints on current hardware, most convolution neural networks (CNN) are trained on sub-megapixel images. For example, most popular datasets in computer vision contain images much less than a megapixel in size (0.09MP for ImageNet and 0.001MP for CIFAR-10). In some domains such as medical imaging, multi-megapixel images are needed to identify the presence of disease accurately. We propose a novel method to directly train convolutional neural networks using any input image size end-to-end. This method exploits the locality of most operations in modern convolutional neural networks by performing the forward and backward pass on smaller tiles of the image. In this work, we show a proof of concept using images of up to 66-megapixels (8192x8192), saving approximately 50GB of memory per image. Using two public challenge datasets, we demonstrate that CNNs can learn to extract relevant information from these large images and benefit from increasing resolution. We improved the area under the receiver-operating characteristic curve from 0.580 (4MP) to 0.706 (66MP) for metastasis detection in breast cancer (CAMELYON17). We also obtained a Spearman correlation metric approaching state-of-the-art performance on the TUPAC16 dataset, from 0.485 (1MP) to 0.570 (16MP). Code to reproduce a subset of the experiments is available at https://github.com/DIAGNijmegen/StreamingCNN.

引用

页码：1581 / 1590

页数：10

共 50 条

[1] Convolutional Dictionary Learning by End-To-End Training of Iterative Neural Networks
Kofler, Andreas
Wald, Christian
Schaeffter, Tobias
Haltmeier, Markus
Kolbitsch, Christoph
[J]. 2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 1213 - 1217
[2] Leukocyte Segmentation via End-to-End Learning of Deep Convolutional Neural Networks
Lu, Yan
Fan, Haoyi
Li, Zuoyong
[J]. INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: VISUAL DATA ENGINEERING, PT I, 2019, 11935 : 191 - 200
[3] CONVOLUTIONAL ANALYSIS OPERATOR LEARNING BY END-TO-END TRAINING OF ITERATIVE NEURAL NETWORKS
Kofler, Andreas
Wald, Christian
Schaeffter, Tobias
Haltmeier, Markus
Kolbitsch, Christoph
[J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (IEEE ISBI 2022), 2022,
[4] Handwritten Text Segmentation via End-to-End Learning of Convolutional Neural Networks
Junho Jo
Hyung Il Koo
Jae Woong Soh
Nam Ik Cho
[J]. Multimedia Tools and Applications, 2020, 79 : 32137 - 32150
[5] End-to-End Text Recognition with Convolutional Neural Networks
Wang, Tao
Wu, David J.
Coates, Adam
Ng, Andrew Y.
[J]. 2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 3304 - 3308
[6] Handwritten Text Segmentation via End-to-End Learning of Convolutional Neural Networks
Jo, Junho
Koo, Hyung Il
Soh, Jae Woong
Cho, Nam Ik
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (43-44) : 32137 - 32150
[7] End-to-End Performance Optimization for Training Streaming Convolutional Neural Networks using Billion-Pixel Whole-Slide Images
Tao, Liang-Wei
Hwu, An-Fong
Huang, Yu-Jen
Chen, Chi-Chung
Yeh, Chao-Yuan
Hung, Shih-Hao
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 1127 - 1137
[8] An End-to-End Compression Framework Based on Convolutional Neural Networks
Jiang, Feng
Tao, Wen
Liu, Shaohui
Ren, Jie
Guo, Xun
Zhao, Debin
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (10) : 3007 - 3018
[9] An End-to-End Compression Framework Based on Convolutional Neural Networks
Tao, Wen
Jiang, Feng
Zhang, Shengping
Ren, Jie
Shi, Wuzhen
Zuo, Wangmeng
Guo, Xun
Zhao, Debin
[J]. 2017 DATA COMPRESSION CONFERENCE (DCC), 2017, : 463 - 463
[10] LEARNING ENVIRONMENTAL SOUNDS WITH END-TO-END CONVOLUTIONAL NEURAL NETWORK
Tokozume, Yuji
Harada, Tatsuya
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2721 - 2725

← 1 2 3 4 5 →