VisionAGILE: A Versatile Domain-Specific Accelerator for Computer Vision Tasks

被引:0
|
作者
Zhang, Bingyi [1 ]
Kannan, Rajgopal [2 ]
Busart, Carl [2 ]
Prasanna, Viktor K. [1 ]
机构
[1] Univ Southern Calif, Dept Elect & Comp Engn, Los Angeles, CA 90089 USA
[2] DEVCOM Army Res Off, Los Angeles, CA 90089 USA
基金
美国国家科学基金会;
关键词
Computational modeling; Runtime; Data models; Hardware acceleration; Computer vision; Computer architecture; Program processors; Compiler; computer architecture; computer vision; domain-specific accelerator; runtime system;
D O I
10.1109/TPDS.2024.3466891
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The emergence of diverse machine learning (ML) models has led to groundbreaking revolutions in computer vision (CV). These ML models include convolutional neural networks (CNNs), graph neural networks (GNNs), and vision transformers (ViTs). However, existing hardware accelerators designed for CV lack the versatility to support various ML models, potentially limiting their applicability to real-world scenarios. To address this limitation, we introduce VisionAGILE, a domain-specific accelerator designed to be versatile and capable of accommodating a range of ML models, including CNNs, GNNs, and ViTs. VisionAGILE comprises a compiler, a runtime system, and a hardware accelerator. For the hardware accelerator, we develop a novel unified architecture with a flexible data path and memory organization to support the computation primitives in various ML models. Regarding the compiler design, we develop a unified compilation workflow that maps various ML models to the proposed hardware accelerator. The runtime system executes dynamic sparsity exploitation to reduce inference latency and dynamic task scheduling for workload balance. The compiler, the runtime system, and the hardware accelerator work synergistically to support a variety of ML models in CV, enabling low-latency inference. We deploy the hardware accelerator on a state-of-the-art data center FPGA (Xilinx Alveo U250). We evaluate VisionAGILE on diverse ML models for CV, including CNNs, GNNs, hybrid models (comprising both CNN and GNN), and ViTs. The experimental results indicate that, compared with state-of-the-art CPU (GPU) implementations, VisionAGILE achieves a speedup of 81.7 x (4.8x) in terms of latency. Evaluated on standalone CNNs, GNNs, and ViTs, VisionAGILE demonstrates comparable or higher performance with state-of-the-art CNN accelerators, GNN accelerators, and ViT accelerators, respectively.
引用
收藏
页码:2405 / 2422
页数:18
相关论文
共 50 条
  • [1] COMPUTER TASKS FOR MEMORY-IMPAIRED PATIENTS - THE DOMAIN-SPECIFIC LEARNING APPROACH
    GLISKY, EL
    JOURNAL OF CLINICAL AND EXPERIMENTAL NEUROPSYCHOLOGY, 1990, 12 (01) : 65 - 66
  • [2] Domain-Specific Computing Using FPGA Accelerator
    Watanabe, Yasuhiro
    Fujisawa, Hisanori
    Ozawa, Toshihiro
    FUJITSU SCIENTIFIC & TECHNICAL JOURNAL, 2017, 53 (05): : 20 - 25
  • [3] DOMAIN-SPECIFIC KNOWLEDGE IN SIMPLE CATEGORIZATION TASKS
    KELEMEN, D
    BLOOM, P
    PSYCHONOMIC BULLETIN & REVIEW, 1994, 1 (03) : 390 - 395
  • [4] Design and Application Space Exploration of a Domain-Specific Accelerator System
    Feng, Fan
    Li, Li
    Wang, Kun
    Fu, Yuxiang
    He, Guoqiang
    Pan, Hongbing
    ELECTRONICS, 2018, 7 (04)
  • [5] Architecture Support for Domain-Specific Accelerator-Rich CMPs
    Ghodrat, Mohammad Ali (ghodrat@cs.ucla.edu.), 1600, Association for Computing Machinery (13):
  • [6] Architecture Support for Domain-Specific Accelerator-Rich CMPs
    Cong, Jason
    Ghodrat, Mohammad Ali
    Gill, Michael
    Grigorian, Beayna
    Reinman, Glenn
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2014, 13
  • [7] A survey on unsupervised domain adaptation in computer vision tasks
    Sun Q.
    Zhao C.
    Tang Y.
    Qian F.
    Zhongguo Kexue Jishu Kexue/Scientia Sinica Technologica, 2022, 52 (01): : 26 - 54
  • [8] Effects of domain-specific linguistic factors on the difficulty of mathematics tasks
    Bednorz, David
    Kleine, Michael
    vom Hofe, Rudolf
    JOURNAL OF MATHEMATICAL BEHAVIOR, 2024, 75
  • [9] DOLOMITES: Domain-Specific Long-Form Methodical Tasks
    Malaviya, Chaitanya
    Agrawal, Priyanka
    Ganchev, Kuzman
    Srinivasan, Pranesh
    Huot, Fantine
    Berant, Jonathan
    Yatskar, Mark
    Das, Dipanjan
    Lapata, Mirella
    Alberti, Chris
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2025, 13 : 1 - 29
  • [10] A Domain-Specific Accelerator for Ultralow Latency Market Data Distribution System
    Jia, Hao
    Huan, Yuxiang
    Ding, Chen
    Yan, Yulong
    Cui, Jianjun
    Wang, Jiachen
    Cai, Chuhuang
    Xu, Lida
    Zou, Zhuo
    Zheng, Lirong
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (04) : 5465 - 5475