A Scalable Architecture for Multi-Class Visual Object Detection

被引:0
|
作者
Advani, Siddharth [1 ]
Tanabe, Yasuki [2 ]
Irick, Kevin [3 ]
Sampson, Jack [1 ]
Narayanan, Vijaykrishnan [1 ]
机构
[1] Penn State Univ, University Pk, PA 16802 USA
[2] Toshiba, Tokyo, Japan
[3] SiliconScapes LLC, State Coll, PA USA
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As high-fidelity small form-factor cameras become increasingly available and affordable, there will be a subsequent growth and emergence of vision-based applications that take advantage of this increase in visual information. The key challenge is for the embedded systems, on which the bulk of these applications will be deployed, to maintain real-time performance in the midst of the exponential increase in spatial and temporal visual data. For example, a useful vision-based driver assistance system needs to locate and identify critical objects such as pedestrians, other vehicles, pot-holes, animals, and street signs with latency small enough to allow a human driver to react accordingly. In this work, we propose a digital accelerator architecture for a high-throughput, robust, scalable, and tunable visual object detection pipeline based on Histogram of Oriented Gradients (HOG) features. From a systems perspective, efficacy can be measured in terms of speed, accuracy, energy efficiency and scalability in performing such visual tasks. Since each application dictates the criticality of any one of these dimensions, our proposed architecture exposes design-time parameters that can take advantage of domain-specific knowledge while supporting tune-ability through run-time configurations. To evaluate the effectiveness of our vision accelerator we map the architecture to a modern FPGA and demonstrate full HD video processing at 30 fps (frames per second) operating at a conservative 100 MHz clock. Evaluations on a single object class show throughput improvements of 2x and 5x over GPU and multi-threaded CPU implementations respectively. Further more we provide a pathway for enhanced scalability for the many-class problem and achieve over 20x improvement over an equivalent CPU implementation for 5 object classes.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Scalable Multi-class Object Detection
    Razavi, Nima
    Gall, Juergen
    Van Gool, Luc
    [J]. 2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, : 1505 - 1512
  • [2] Joint Learning for Multi-class Object Detection
    Fard, Hamidreza Odabai
    Chaouch, Mohamed
    Quoc-cuong Pham
    Vacavant, Antoine
    Chateau, Thierry
    [J]. PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 2, 2014, : 104 - 112
  • [3] Layered Object Detection for Multi-Class Segmentation
    Yang, Yi
    Hallman, Sam
    Ramanan, Deva
    Fowlkes, Charless
    [J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 3113 - 3120
  • [4] Contextual disamriguation for multi-class object detection
    Fan, XD
    [J]. ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 2873 - 2876
  • [5] An Optimal Approach for Multi-class Object Detection
    Deb, Ankit
    Chaudhuri, Rapti
    Deb, Suman
    [J]. DISTRIBUTED COMPUTING AND INTELLIGENT TECHNOLOGY, ICDCIT 2023, 2023, 13776 : 335 - 340
  • [6] Multi-class object detection system using hybrid convolutional neural network architecture
    Jay Laxman Borade
    Muddana A Lakshmi
    [J]. Multimedia Tools and Applications, 2022, 81 : 31727 - 31751
  • [7] Multi-class object detection system using hybrid convolutional neural network architecture
    Borade, Jay Laxman
    Lakshmi, Muddana A.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (22) : 31727 - 31751
  • [8] Multi-class Object Detection with Hough Forests Using Local Histograms of Visual Words
    Muehling, Markus
    Ewerth, Ralph
    Shi, Bing
    Freisleben, Bernd
    [J]. COMPUTER ANALYSIS OF IMAGES AND PATTERNS: 14TH INTERNATIONAL CONFERENCE, CAIP 2011, PT I, 2011, 6854 : 386 - 393
  • [9] Interactive Multi-Class Tiny-Object Detection
    Lee, Chunggi
    Park, Seonwook
    Song, Heon
    Ryu, Jeongun
    Kim, Sanghoon
    Kim, Haejoon
    Pereira, Sergio
    Yoo, Donggeun
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 14116 - 14125
  • [10] SCALABLE MULTI-CLASS GEOSPATIAL OBJECT DETECTION IN HIGH-SPATIAL-RESOLUTION REMOTE SENSING IMAGES
    Cheng, Gong
    Han, Junwei
    Zhou, Peicheng
    Guo, Lei
    [J]. 2014 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2014, : 2479 - 2482