Contextual Information based Network with High-Frequency Feature Fusion for High Frame Rate and Ultra-Low Delay Small-Scale Object Detection

被引:0
|
作者
Huang, Dongmei [1 ]
Zhang, Jihan [1 ]
Hu, Tingting [1 ,2 ]
Fuchikami, Ryuji [2 ]
Ikenaga, Takashi [1 ]
机构
[1] Waseda Univ, Grad Sch Informat Prod & Syst, Kitakyushu, Fukuoka 8080135, Japan
[2] Panasonic Corp, Fukuoka 8128531, Japan
关键词
D O I
10.23919/MVA51890.2021.9511387
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High frame rate and ultra-low delay small-scale object detection plays an important role in factory automation for its timely and accurate reaction. Although many CNN based detection methods have been proposed to improve the accuracy of small object detection for the low resolution and large gap between the object and the background, it is difficult to achieve a trade-off between accuracy and speed. For the pursuit of ultra-low delay processing by utilizing FPGA, this paper proposes: (A) IoU and distance based loss function, (B) Contextual information with high temporal correlation based parallel detection, (C) High frequency feature fusion for enhancing low-bit networks. The proposed methods achieve 45.3 % mAP for test sequences, which is only 0.7 % mAP lower compared with the general method. Meanwhile, the size of the model has been compressed to 1.94 % of the original size and reaches a speed of 278 fps on FPGA and 15 fps on GPU.
引用
收藏
页数:5
相关论文
共 37 条
  • [1] Critically Compressed Quantized Convolution Neural Network based High Frame Rate and Ultra-Low Delay Fruit External Defects Detection
    Zhang, Jihan
    Huang, Dongmei
    Hu, Tingting
    Fuchikami, Ryuji
    Ikenaga, Takeshi
    PROCEEDINGS OF 17TH INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA 2021), 2021,
  • [2] Temporally Forward Nonlinear Scale Space for High Frame Rate and Ultra-Low Delay A-KAZE Matching System
    Du, Songlin
    Li, Yuan
    Ikenaga, Takeshi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (06) : 1226 - 1235
  • [3] Dual-scale point cloud completion network based on high-frequency feature fusion
    Gao, Fang
    Liu, Yong
    Shi, Pengbo
    Jin, Yan
    Yu, Jun
    Li, Shaodong
    IMAGE AND VISION COMPUTING, 2023, 139
  • [4] Multi-scale dehazing network via high-frequency feature fusion
    Xu, YuJie
    Zhang, YongJun
    Li, Zhi
    Cui, ZhongWei
    Yang, YiTong
    COMPUTERS & GRAPHICS-UK, 2022, 107 : 50 - 59
  • [5] Remote Sensing Small Object Detection Network Based on Multi-Scale Feature Extraction and Information Fusion
    Qu, Junsuo
    Liu, Tong
    Tang, Zongbing
    Duan, Yifei
    Yao, Heng
    Hu, Jiyuan
    REMOTE SENSING, 2025, 17 (05)
  • [6] Multi-scale fusion dehazing network for high-frequency information alignment 
    Li, Peng-ze
    Li, Wan
    Zhang, Xuan-de
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2023, 38 (02) : 216 - 224
  • [7] FPGA Implementation of High Frame Rate and Ultra-Low Delay Vision System with Local and Global Parallel based Matching
    Hu, Tingting
    Ikenaga, Takeshi
    PROCEEDINGS OF THE FIFTEENTH IAPR INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS - MVA2017, 2017, : 286 - 289
  • [8] Temporal Constraints and Block Weighting Judgement Based High Frame Rate and Ultra-Low Delay Mismatch Removal System
    Du, Songlin
    Wang, Zhe
    Ikenaga, Takeshi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (06): : 1236 - 1246
  • [9] FPGA Implementation of High Frame Rate and Ultra-Low Delay Tracking with Local-Search Based Block Matching
    Hu, Tingting
    Wu, Hong
    Ikenaga, Takeshi
    2017 INTERNATIONAL CONFERENCE ON MACHINE VISION AND INFORMATION TECHNOLOGY (CMVIT), 2017, : 93 - 98
  • [10] Pixel Selection and Intensity Directed Symmetry for High Frame Rate and Ultra-Low Delay Matching System
    Hu, Tingting
    Ikenaga, Takeshi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (05): : 1260 - 1269