Speed up Object Detection on Gigapixel-level Images with Patch Arrangement

被引:7
|
作者
Fan, Jiahao [1 ]
Liu, Huabin [1 ]
Yang, Wenjie [1 ]
See, John [2 ]
Zhang, Aixin [1 ]
Lin, Weiyao [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Heriot Watt Univ, Putrajaya, Malaysia
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52688.2022.00461
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the appearance of super high-resolution (e.g., gigapixel-level) images, performing efficient object detection on such images becomes an important issue. Most existing works for efficient object detection on high-resolution images focus on generating local patches where objects may exist, and then every patch is detected independently. However, when the image resolution reaches gigapixel-level, they will suffer from a huge time cost for detecting numerous patches. Different from them, we devise a novel patch arrangement framework for fast object detection on gigapixel-level images. Under this framework, a Patch Arrangement Network (PAN) is proposed to accelerate the detection by determining which patches could be packed together into a compact canvas. Specifically, PAN consists of (1) a Patch Filter Module (PFM) (2) a Patch Packing Module (PPM). PFM filters patch candidates by learning to select patches between two granularities. Subsequently, from the remaining patches, PPM determines how to pack these patches together into a smaller number of canvases. Meanwhile, it generates an ideal layout of patches on canvas. These canvases are fed to the detector to get final results. Experiments show that our method could improve the inference speed on gigapixel-level images by 5x while maintaining great performance.
引用
收藏
页码:4643 / 4651
页数:9
相关论文
共 50 条
  • [41] Evaluating salient object detection in natural images with multiple objects having multi-level saliency
    Yildirim, Goekhan
    Sen, Debashis
    Kankanhalli, Mohan
    Suesstrunk, Sabine
    IET IMAGE PROCESSING, 2020, 14 (10) : 2249 - 2262
  • [42] Kill Two Birds with One Stone: Boosting Both Object Detection Accuracy and Speed with Adaptive Patch-of-Interest Composition
    Zhang, Shihao
    Lin, Weiyao
    Lu, Ping
    Li, Weihua
    Deng, Shuo
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2017,
  • [43] Few-Shot Object Detection via Dual-Domain Feature Fusion and Patch-Level Attention
    Ren, Guangli
    Liu, Jierui
    Wang, Mengyao
    Guan, Peiyu
    Cao, Zhiqiang
    Yu, Junzhi
    TSINGHUA SCIENCE AND TECHNOLOGY, 2025, 30 (03): : 1237 - 1250
  • [44] 3D Object detection and viewpoint selection in sketch images using local patch-based Zernike moments
    Ta, Anh-Phuong
    Wolf, Christian
    Lavoue, Guillaume
    Baskurt, Atilla
    CBMI: 2009 INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING, 2009, : 189 - 194
  • [45] Weakly Supervised Object Detection for Remote Sensing Images via Progressive Image-Level and Instance-Level Feature Refinement
    Zheng, Shangdong
    Wu, Zebin
    Xu, Yang
    Wei, Zhihui
    REMOTE SENSING, 2024, 16 (07)
  • [46] A Survey on Deep Learning Approaches to Medical Images and a Systematic Look up into Real-Time Object Detection
    Amrita Kaur
    Yadwinder Singh
    Nirvair Neeru
    Lakhwinder Kaur
    Ashima Singh
    Archives of Computational Methods in Engineering, 2022, 29 : 2071 - 2111
  • [47] Pixel-Level patch detection from full-scale asphalt pavement images based on deep learning
    Xiong, Xuetang
    Tan, Yiqiu
    INTERNATIONAL JOURNAL OF PAVEMENT ENGINEERING, 2023, 24 (01)
  • [48] A Survey on Deep Learning Approaches to Medical Images and a Systematic Look up into Real-Time Object Detection
    Kaur, Amrita
    Singh, Yadwinder
    Neeru, Nirvair
    Kaur, Lakhwinder
    Singh, Ashima
    ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING, 2022, 29 (04) : 2071 - 2111
  • [49] Multi-level progressive parallel attention guided salient object detection for RGB-D images
    Liu, Zhengyi
    Duan, Quntao
    Shi, Song
    Zhao, Peng
    VISUAL COMPUTER, 2021, 37 (03): : 529 - 540
  • [50] Multi-level progressive parallel attention guided salient object detection for RGB-D images
    Zhengyi Liu
    Quntao Duan
    Song Shi
    Peng Zhao
    The Visual Computer, 2021, 37 : 529 - 540