Acceleration of DNN-Based Video Object Detection Using Temporal Dependency of the Object Size

被引:0
|
作者
Yoo, Jeong Yeop [1 ]
Ko, Jong Hwan [2 ]
机构
[1] Sungkyunkwan Univ, Dept Elect & Comp Engn, Suwon, South Korea
[2] Sungkyunkwan Univ, Coll Informat & Commun Engn, Suwon, South Korea
关键词
Deep Learning Acceleration; Video Object Detection; Object Detection Acceleration;
D O I
10.1109/ICTC52510.2021.9620830
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Many studies have been proposed to improve the accuracy of Deep Neural Network based object detection. Some of them bring an increase in computation, which is a problem in tasks such as autonomous driving that requires high accuracy and low latency. Feature Pyramid Network (FPN) is a structure commonly used in improving the accuracy of object detection. However, it slows down the inference speed because of the high computation. To accelerate the inference while maintaining the accuracy of FPN, this paper proposes dynamic acceleration of object detection with FPN using temporal dependency of object sizes in a video. We modify FPN to have faster inference speed when targeting certain object sizes. By using the previous object sizes, the target object size is determined. The modified FPN is used in a dynamic manner, which speeds up the inference. In this method, we achieve 20.9% faster inference at the cost of a 0.06 mAP drop on the ImageNet VID validation dataset.
引用
收藏
页码:1182 / 1184
页数:3
相关论文
共 50 条
  • [41] Object-based video segmentation using spatio-temporal energy
    Bao, HQ
    Zhang, ZY
    2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 1260 - 1263
  • [42] Detection of object abandonment using temporal logic
    Medha Bhargava
    Chia-Chih Chen
    M. S. Ryoo
    J. K. Aggarwal
    Machine Vision and Applications, 2009, 20 : 271 - 281
  • [43] Detection of object abandonment using temporal logic
    Bhargava, Medha
    Chen, Chia-Chih
    Ryoo, M. S.
    Aggarwal, J. K.
    MACHINE VISION AND APPLICATIONS, 2009, 20 (05) : 271 - 281
  • [44] Temporal Ensemble SSDLite: Exploiting Temporal Correlation in Video for Accurate Object Detection
    Nakamura, Lukas
    Awano, Hiromitsu
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2022, E105A (07) : 1082 - 1090
  • [45] Object Detection in Drone Video with Temporal Attention Gated Recurrent Unit Based on Transformer
    Zhou, Zihao
    Yu, Xianguo
    Chen, Xiangcheng
    DRONES, 2023, 7 (07)
  • [46] Face Detection and Recognition Based on General Purpose DNN Object Detector
    Ghenescu, Veta
    Mihaescu, Roxana Elena
    Carata, Serban-Vasile
    Ghenescu, Marian Traian
    Barnoviciu, Eduard
    Chindea, Mihai
    2018 13TH INTERNATIONAL SYMPOSIUM ON ELECTRONICS AND TELECOMMUNICATIONS (ISETC), 2018, : 289 - 292
  • [47] Moving Object Detection Based on Temporal Information
    Wang, Zhihu
    Liao, Kai
    Xiong, Jiulong
    Zhang, Qi
    IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (11) : 1403 - 1407
  • [48] Video Object Detection Guided by Object Blur Evaluation
    Wu, Yujie
    Zhang, Hong
    Li, Yawei
    Yang, Yifan
    Yuan, Ding
    IEEE ACCESS, 2020, 8 : 208554 - 208565
  • [49] Video Temporal Alignment for Object Viewpoint
    Papazoglou, Anestis
    Del Pero, Luca
    Ferrari, Vittorio
    COMPUTER VISION - ACCV 2016, PT IV, 2017, 10114 : 273 - 288
  • [50] Group-of-Picture Mode Acceleration for Efficient Object Detection in Video Streams
    Chen, Kuan-Hung
    IEEE ACCESS, 2023, 11 : 71668 - 71682