MiniMalloc: A Lightweight Memory Allocator for Hardware-Accelerated Machine Learning

被引:1
|
作者
Moffitt, Michael D. [1 ]
机构
[1] Google, Mountain View, CA 94043 USA
关键词
memory allocation; hardware acceleration; machine learning; ARCHITECTURE; PACKING;
D O I
10.1145/3623278.3624752
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We present a new approach to static memory allocation, a key problem that arises in the compilation of machine learning models onto the resources of a specialized hardware accelerator. Our methodology involves a recursive depth-first search that limits exploration to a special class of canonical solutions, dramatically reducing the size of the search space. We also develop a spatial inference technique that exploits this special structure by pruning unpromising partial assignments and backtracking more effectively than otherwise possible. Finally, we introduce a new mechanism capable of detecting and eliminating dominated solutions from consideration. Empirical results demonstrate orders of magnitude improvement in performance as compared to the previous state-of-the-art on many benchmarks, as well as a substantial reduction in library size.
引用
收藏
页码:238 / 252
页数:15
相关论文
共 50 条
  • [41] Recent advances in hardware-accelerated volume rendering
    Ma, KL
    Lum, EB
    Muraki, S
    COMPUTERS & GRAPHICS-UK, 2003, 27 (05): : 725 - 734
  • [42] SIMPLE HARDWARE BUDDY SYSTEM MEMORY ALLOCATOR
    PUTTKAMER, EV
    IEEE TRANSACTIONS ON COMPUTERS, 1975, 24 (10) : 953 - 957
  • [43] Robust Partitioning for Hardware-Accelerated Functional Verification
    Moffitt, Michael D.
    Sustik, Matyas A.
    Villarrubia, Paul G.
    PROCEEDINGS OF THE 48TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2011, : 854 - 859
  • [44] Memory-based hardware-accelerated system for high-speed human detection
    Wicaksono, Indra Bagus
    An, Fengwei
    Mattausch, Hans Juergen
    ADVANCED ROBOTICS, 2014, 28 (05) : 317 - 327
  • [45] Hardware-Accelerated Dual-Split Trees
    Lin, Daqi
    Vasiou, Elena
    Yuksel, Cem
    Kopta, Daniel
    Brunvand, Erik
    PROCEEDINGS OF THE ACM ON COMPUTER GRAPHICS AND INTERACTIVE TECHNIQUES, 2020, 3 (02)
  • [46] Physically Accurate Learning-based Performance Prediction of Hardware-accelerated ML Algorithms
    Esmaeilzadeh, Hadi
    Ghodrati, Soroush
    Kahng, Andrew B.
    Kim, Joon Kyung
    Kinzer, Sean
    Kundu, Sayak
    Mahapatra, Rohan
    Manasi, Susmita Dey
    Sapatnekar, Sachin S.
    Wang, Zhiang
    Zeng, Ziqing
    MLCAD '22: PROCEEDINGS OF THE 2022 ACM/IEEE 4TH WORKSHOP ON MACHINE LEARNING FOR CAD (MLCAD), 2022, : 119 - 126
  • [47] Automated Composition and Execution of Hardware-accelerated Operator Graphs
    Werner, Stefan
    Heinrich, Dennis
    Piper, Jannik
    Groppe, Sven
    Backasch, Rico
    Blochwitz, Christopher
    Pionteck, Thilo
    2015 10TH INTERNATIONAL SYMPOSIUM ON RECONFIGURABLE COMMUNICATION-CENTRIC SYSTEMS-ON-CHIP (RECOSOC), 2015,
  • [48] Hardware-accelerated texture advection for unsteady flow visualization
    Jobard, B
    Erlebacher, G
    Hussaini, MY
    VISUALIZATION 2000, PROCEEDINGS, 2000, : 155 - 162
  • [49] A Hardware-Accelerated Software Platform for Adaptive Radiation Therapy
    Park, Seyoun
    Plishker, William
    Robinson, Adam
    Zaki, George
    Shekhar, Raj
    McNutt, Todd
    Quon, Harry
    Wong, John
    Lee, Junghoon
    WORLD CONGRESS ON MEDICAL PHYSICS AND BIOMEDICAL ENGINEERING, 2015, VOLS 1 AND 2, 2015, 51 : 509 - 512
  • [50] Hardware-accelerated rendering of antialiased shadows with shadow maps
    Brabec, S
    Seidel, HP
    COMPUTER GRAPHICS INTERNATIONAL 2001, PROCEEDINGS, 2001, : 209 - 214