ReForm: Static and Dynamic Resource-Aware DNN Reconfiguration Framework for Mobile Device

被引:0
|
作者
Xu, Zirui [1 ]
Yu, Fuxun [1 ]
Liu, Chenchen [2 ]
Chen, Xiang [1 ]
机构
[1] George Mason Univ, Fairfax, VA 22030 USA
[2] Clarkson Univ, Potsdam, NY USA
来源
PROCEEDINGS OF THE 2019 56TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC) | 2019年
关键词
D O I
10.1145/3316781.3324696
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Although the Deep Neural Network (DNN) technique has been widely applied in various applications, the DNN-based applications are still too computationally intensive for the resource-constrained mobile devices. Many works have been proposed to optimize the DNN computation performance, but most of them are limited in an algorithmic perspective, ignoring certain computing issues in practical deployment. To achieve the comprehensive DNN performance enhancement in practice, the expected DNN optimization works should closely cooperate with specific hardware and system constraints (i.e. computation capacity, energy cost, memory occupancy, and inference latency). Therefore, in this work, we propose ReForm - a resource-aware DNN optimization framework. Through thorough mobile DNN computing analysis and innovative model reconfiguration schemes (i.e. ADMM based static model fine-tuning, dynamically selective computing), ReForm can efficiently and effectively reconfigure a pre-trained DNN model for practical mobile deployment with regards to various static and dynamic computation resource constraints. Experiments show that ReForm has similar to 3.5xfaster optimization speed than state-of-the-art resource-aware optimization method. Also, ReForm can effective reconfigure a DNN model to different mobile devices with distinct resource constraints. Moreover, ReForm achieves satisfying computation cost reduction with ignorable accuracy drop in both static and dynamic computing scenarios (at most 18% workload, 16.23% latency, 48.63% memory, and 21.5% energy enhancement).
引用
收藏
页数:6
相关论文
共 50 条
  • [21] An OCCI-compliant Framework for Fine-grained Resource-aware Management in Mobile Cloud Networking
    Edmonds, Andy
    Carella, Giuseppe
    Yousaf, Faqir Zarrar
    Goncalves, Carlos
    Bohnert, Thomas Michael
    Metsch, Thijs
    Bellavista, Paolo
    Foschini, Luca
    2016 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATION (ISCC), 2016, : 1306 - 1313
  • [22] A Framework for Resource-aware Online Traffic Classification Using CNN
    Zhang, Wanqian
    Wang, Junxiao
    Chen, Sheng
    Qi, Heng
    Li, Keqiu
    PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON FUTURE INTERNET TECHNOLOGIES (CFI'19), 2019,
  • [23] A resource-aware framework for resource-constrained service-oriented systems
    Newman, Peter
    Kotonya, Gerald
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2015, 47 : 161 - 175
  • [24] RAM: A Resource-Aware DDoS Attack Mitigation Framework in Clouds
    Xing, Fangyuan
    Tong, Fei
    Yang, Jialong
    Cheng, Guang
    He, Shibo
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2024, 12 (04) : 1387 - 1400
  • [25] Dynamically Reconfigurable Resource-Aware Component Framework: Architecture and Concepts
    Orlic, Bojan
    David, Ionut
    Mak, Rudolf H.
    Lukkien, Johan J.
    SOFTWARE ARCHITECTURE, 2011, 6903 : 212 - 215
  • [26] Distributed Online Visual Sensor Network Reconfiguration for Resource-aware Coverage and Task Assignment
    Dieber, Bernhard
    Rinner, Bernhard
    2013 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2013, : 286 - 291
  • [27] Extending SLURM for Dynamic Resource-Aware Adaptive Batch Scheduling
    Chadha, Mohak
    John, Jophin
    Gerndt, Michael
    2020 IEEE 27TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC 2020), 2020, : 223 - 232
  • [28] A Dynamic Resource-Aware Routing Protocol in Resource-Constrained Opportunistic Networks
    Ali, Aref Hassan Kurd
    Lenando, Halikul
    Chaoui, Slim
    Alrfaay, Mohamad
    Tawfeek, Medhat A.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 70 (02): : 4147 - 4167
  • [29] Resource-aware distributed stream management using dynamic overlays
    Kumar, V
    Cooper, BF
    Cai, ZT
    Eisenhauer, G
    Schwan, K
    25TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, PROCEEDINGS, 2005, : 783 - 792
  • [30] A framework for Resource-Aware Data Accumulation in sparse wireless sensor networks
    Shah, Kunal
    Di Francesco, Mario
    Anastasi, Giuseppe
    Kumar, Mohan
    COMPUTER COMMUNICATIONS, 2011, 34 (17) : 2094 - 2103