TAS: Ternarized Neural Architecture Search for Resource-Constrained Edge Devices

被引:0
|
作者
Loni, Mohammad [1 ]
Mousavi, Hamid [1 ]
Riazati, Mohammad [1 ]
Daneshtalab, Masoud [1 ]
Sjodin, Mikael [1 ]
机构
[1] Malardalen Univ, Sch Innovat Design & Engn, Vasteras, Sweden
关键词
Quantization; Ternary Neural Network; Neural Architecture Search; Embedded Systems;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Ternary Neural Networks (TNNs) compress network weights and activation functions into 2-bit representation resulting in remarkable network compression and energy efficiency. However, there remains a significant gap in accuracy between TNNs and full-precision counterparts. Recent advances in Neural Architectures Search (NAS) promise opportunities in automated optimization for various deep learning tasks. Unfortunately, this area is unexplored for optimizing TNNs. This paper proposes TAS, a framework that drastically reduces the accuracy gap between TNNs and their full-precision counterparts by integrating quantization into the network design. We experienced that directly applying NAS to the ternary domain provides accuracy degradation as the search settings are customized for full-precision networks. To address this problem, we propose (i) a new cell template for ternary networks with maximum gradient propagation; and (ii) a novel learnable quantizer that adaptively relaxes the ternarization mechanism from the distribution of the weights and activation functions. Experimental results reveal that TAS delivers 2.64% higher accuracy and approximate to 2.8x memory saving over competing methods with the same bit-width resolution on the CIFAR-10 dataset. These results suggest that TAS is an effective method that paves the way for the efficient design of the next generation of quantized neural networks.
引用
收藏
页码:1115 / 1118
页数:4
相关论文
共 50 条
  • [41] Understanding Sensor Data Using Deep Learning Methods on Resource-Constrained Edge Devices
    Du, Junzhao
    Liu, Sicong
    Wei, Yuheng
    Liu, Hui
    Wang, Xin
    Nan, Kaiming
    WIRELESS SENSOR NETWORKS (CWSN 2017), 2018, 812 : 139 - 152
  • [42] SmartDedup: Optimizing Deduplication for Resource-constrained Devices
    Yang, Qirui
    Jin, Runyu
    Zhao, Ming
    PROCEEDINGS OF THE 2019 USENIX ANNUAL TECHNICAL CONFERENCE, 2019, : 633 - 646
  • [43] An Affordance Detection Pipeline for Resource-Constrained Devices
    Apicella, Tommaso
    Cavallaro, Andrea
    Berta, Riccardo
    Gastaldo, Paolo
    Bellotti, Francesco
    Ragusa, Edoardo
    2021 28TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS, AND SYSTEMS (IEEE ICECS 2021), 2021,
  • [44] Code protection for resource-constrained embedded devices
    Saputra, H
    Chen, G
    Brooks, R
    Vijaykrishnan, N
    Kandemir, M
    Irwin, MJ
    ACM SIGPLAN NOTICES, 2004, 39 (07) : 240 - 248
  • [45] CacheSim: A cache simulation framework for evaluating caching algorithms on resource-constrained edge devices
    Liu, Jian
    Chen, Yuxin
    Ding, Hao
    SOFTWAREX, 2025, 29
  • [46] Lightweight Strong PUF for Resource-Constrained Devices
    Korona, Mateusz
    Giermakowski, Radoslaw
    Biernacki, Mateusz
    Rawski, Mariusz
    ELECTRONICS, 2024, 13 (02)
  • [47] Secure Communications for Resource-Constrained IoT Devices†
    Taha, Abd-Elhamid M.
    Rashwan, Abdulmonem M.
    Hassanein, Hossam S.
    SENSORS, 2020, 20 (13) : 1 - 18
  • [48] Energy Consumption Awareness for Resource-Constrained Devices
    Silva, Edgar M.
    Malo, Pedro
    Albano, Michele
    2016 EUROPEAN CONFERENCE ON NETWORKS AND COMMUNICATIONS (EUCNC), 2016, : 74 - 78
  • [49] Data and Control Points: A Programming Model for Resource-constrained IoT Cloud Edge Devices
    Nastic, Stefan
    Hong-Linh Truong
    Dustdar, Schahram
    2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 3535 - 3540
  • [50] Image Polarity Detection on Resource-Constrained Devices
    Ragusa, Edoardo
    Gianoglio, Christian
    Zunino, Rodolfo
    Gastaldo, Paolo
    IEEE INTELLIGENT SYSTEMS, 2020, 35 (06) : 50 - 57