A High-Performance and Energy-Efficient Photonic Architecture for Multi-DNN Acceleration

被引:0
|
作者
Li, Yuan [1 ]
Louri, Ahmed [1 ]
Karanth, Avinash [2 ]
机构
[1] George Washington Univ, Dept Elect & Comp Engn, Washington, DC 20052 USA
[2] Ohio Univ, Sch Elect Engn & Comp Sci, Athens, OH 45701 USA
基金
美国国家科学基金会;
关键词
Accelerator; dataflow; deep neural network; silicon photonics;
D O I
10.1109/TPDS.2023.3327535
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Large-scale deep neural network (DNN) accelerators are poised to facilitate the concurrent processing of diverse DNNs, imposing demanding challenges on the interconnection fabric. These challenges encompass overcoming performance degradation and energy increase associated with system scaling while also necessitating flexibility to support dynamic partitioning and adaptable organization of compute resources. Nevertheless, conventional metallic-based interconnects frequently confront inherent limitations in scalability and flexibility. In this paper, we leverage silicon photonic interconnects and adopt an algorithm-architecture co-design approach to develop MDA, a DNN accelerator meticulously crafted to empower high-performance and energy-efficient concurrent processing of diverse DNNs. Specifically, MDA consists of three novel components: 1) a resource allocation algorithm that assigns compute resources to concurrent DNNs based on their computational demands and priorities; 2) a dataflow selection algorithm that determines off-chip and on-chip dataflows for each DNN, with the objectives of minimizing off-chip and on-chip memory accesses, respectively; 3) a flexible silicon photonic network that can be dynamically segmented into sub-networks, each interconnecting the assigned compute resources of a certain DNN while adapting to the communication patterns dictated by the selected on-chip dataflow. Simulation results show that the proposed MDA accelerator outperforms other state-of-the-art multi-DNN accelerators, including PREMA, AI-MT, Planaria, and HDA. MDA accelerator achieves a speedup of 3.6, accompanied by substantial improvements of 7.3x, 12.7x, and 9.2x in energy efficiency, service-level agreement (SLA) satisfaction rate, and fairness, respectively.
引用
收藏
页码:46 / 58
页数:13
相关论文
共 50 条
  • [21] High-Performance Energy-Efficient Multicore Embedded Computing
    Munir, Arslan
    Ranka, Sanjay
    Gordon-Ross, Ann
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2012, 23 (04) : 684 - 700
  • [22] Energy-efficient high-performance parallel and distributed computing
    Samee Ullah Khan
    Pascal Bouvry
    Thomas Engel
    The Journal of Supercomputing, 2012, 60 : 163 - 164
  • [23] A High-Performance and Energy-Efficient CT Reconstruction Algorithm For Multi-Terabyte Datasets
    Jimenez, Edward S.
    Orr, Laurel J.
    Thompson, Kyle R.
    Park, Ryeojin
    2013 IEEE NUCLEAR SCIENCE SYMPOSIUM AND MEDICAL IMAGING CONFERENCE (NSS/MIC), 2013,
  • [24] Sectored DRAM: A Practical Energy-Efficient and High-Performance Fine-Grained DRAM Architecture
    Olgun, Ataberk
    Bostanci, F. Nisa
    de Oliveira Junior, Geraldo Francisco
    Tugrul, Yahya Can
    Ul Bera, Rah
    Yaglikci, Abdullah Giray
    Hassan, Hasan
    Ergin, Oguz
    Mutlu, Onur
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2024, 21 (03)
  • [25] Efficient Single- and Multi-DNN Inference Using TensorRT Framework
    Zhdanovskiy, Vyacheslav
    Teplyakov, Lev
    Belyaev, Philipp
    SIXTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION, ICMV 2023, 2024, 13072
  • [26] Tools and Methodologies for Designing Energy-Efficient Photonic Networks-on-Chip for High-Performance Chip Multiprocessors
    Chan, Johnnie
    Hendry, Gilbert
    Biberman, Aleksandr
    Bergman, Keren
    2010 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, 2010, : 3605 - 3608
  • [27] Serving Multi-DNN Workloads on FPGAs: A Coordinated Architecture, Scheduling, and Mapping Perspective
    Zeng, Shulin
    Dai, Guohao
    Zhang, Niansong
    Yang, Xinhao
    Zhang, Haoyu
    Zhu, Zhenhua
    Yang, Huazhong
    Wang, Yu
    IEEE TRANSACTIONS ON COMPUTERS, 2023, 72 (05) : 1314 - 1328
  • [28] A High-Performance and Energy-Efficient Ternary Multiplier Using CNTFETs
    Abbasian, Erfan
    Sofimowloodi, Sobhan
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2023, 48 (11) : 14365 - 14379
  • [29] Specialized dynamic optimizations for high-performance energy-efficient microarchitecture
    Almog, Y
    Rosner, R
    Schwartz, N
    Schmorak, A
    CGO 2004: INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, 2004, : 137 - 148
  • [30] Energy-Efficient Pipeline Templates for High-Performance Asynchronous Circuits
    Sheikh, Basit Riaz
    Manohar, Rajit
    ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2011, 7 (04)