Real-Time Mobile Acceleration of DNNs: From Computer Vision to Medical Applications

被引:6
|
作者
Li, Hongjia [1 ]
Yuan, Geng [1 ]
Niu, Wei [2 ]
Cai, Yuxuan [1 ]
Sun, Mengshu [1 ]
Li, Zhengang [1 ]
Ren, Bin [2 ]
Lin, Xue [1 ]
Wang, Yanzhi [1 ]
机构
[1] Northeastern Univ, Boston, MA 02115 USA
[2] Coll William & Mary, Williamsburg, VA USA
基金
美国国家科学基金会;
关键词
computer vision; real-time; mobile acceleration;
D O I
10.1145/3394885.3431627
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With the growth of mobile vision applications, there is a growing need to break through the current performance limitation of mobile platforms, especially for computationally intensive applications, such as object detection, action recognition, and medical diagnosis. To achieve this goal, we present our unified real-time mobile DNN inference acceleration framework, seamlessly integrating hardware-friendly, structured model compression with mobile-targeted compiler optimizations. We aim at an unprecedented, real-time performance of such large-scale neural network inference on mobile devices. A fine-grained block-based pruning scheme is proposed to be universally applicable to all types of DNN layers, such as convolutional layers with different kernel sizes and fully connected layers. Moreover, it is also successfully extended to 3D convolutions. With the assist of our compiler optimizations, the fine-grained block-based sparsity is fully utilized to achieve high model accuracy and high hardware acceleration simultaneously. To validate our framework, three representative fields of applications are implemented and demonstrated, object detection, activity detection, and medical diagnosis. All applications achieve real-time inference using an off-the-shelf smartphone, outperforming the representative mobile DNN inference acceleration frameworks by up to 6.7x in speed. The demonstrations of these applications can be found in the following link: https://bit.ly/39lWpYu.
引用
收藏
页码:581 / 586
页数:6
相关论文
共 50 条
  • [1] Real-time computer vision platform for mobile robot applications
    Natl Inst of Standards and, Technology, Gaithersburg, United States
    Real Time Imaging, 5 (315-327):
  • [2] A real-time computer vision platform for mobile robot applications
    Szabo, S
    Coombs, D
    Herman, M
    Camus, T
    Liu, HC
    REAL-TIME IMAGING, 1996, 2 (05) : 315 - 327
  • [3] Real-time computer vision system for mobile robot
    Persa, S
    Jonker, P
    INTELLIGENT ROBOTS AND COMPUTER VISION XX: ALGORITHMS, TECHNIQUES, AND ACTIVE VISION, 2001, 4572 : 105 - 114
  • [4] Multi-Tenant Mobile Offloading Systems for Real-Time Computer Vision Applications
    Fang, Zhou
    Lin, Jeng-Hau
    Srivastava, Mani B.
    Gupta, Rajesh K.
    ICDCN '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING AND NETWORKING, 2019, : 21 - 30
  • [5] AN ADVANCED COMPUTER FOR REAL-TIME MEDICAL APPLICATIONS
    CROUSE, L
    WIEDERHOLD, G
    COMPUTERS AND BIOMEDICAL RESEARCH, 1969, 2 (06): : 582 - +
  • [6] A Real-time Energy-Efficient Superpixel Hardware Accelerator for Mobile Computer Vision Applications
    Hong, Injoon
    Clemons, Jason
    Venkatesan, Rangharajan
    Frosio, Iuri
    Khailany, Brucek
    Keckler, Stephen W.
    2016 ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2016,
  • [7] Real-Time Computer Vision with OpenCV
    Pulli, Kari
    Baksheev, Anatoly
    Kornyakov, Kirill
    Eruhimov, Victor
    COMMUNICATIONS OF THE ACM, 2012, 55 (06) : 61 - 69
  • [8] CV-SDF - A model for real-time computer vision applications
    Stichling, D
    Kleinjohann, B
    SIXTH IEEE WORKSHOP ON APPLICATIONS OF COMPUTER VISION, PROCEEDINGS, 2002, : 325 - 329
  • [9] Energy-efficient Real-time Computer Vision Applications in Practice
    Kramer, Mark A. M.
    Roth, Peter M.
    REAL-TIME PROCESSING OF IMAGE, DEPTH, AND VIDEO INFORMATION 2024, 2024, 13000
  • [10] A real-time high performance edge detector for computer vision applications
    Alzahrani, F
    Chen, T
    PROCEEDINGS OF THE ASP-DAC '97 - ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE 1997, 1996, : 671 - 672