TAS: Ternarized Neural Architecture Search for Resource-Constrained Edge Devices

被引:0
|
作者
Loni, Mohammad [1 ]
Mousavi, Hamid [1 ]
Riazati, Mohammad [1 ]
Daneshtalab, Masoud [1 ]
Sjodin, Mikael [1 ]
机构
[1] Malardalen Univ, Sch Innovat Design & Engn, Vasteras, Sweden
关键词
Quantization; Ternary Neural Network; Neural Architecture Search; Embedded Systems;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Ternary Neural Networks (TNNs) compress network weights and activation functions into 2-bit representation resulting in remarkable network compression and energy efficiency. However, there remains a significant gap in accuracy between TNNs and full-precision counterparts. Recent advances in Neural Architectures Search (NAS) promise opportunities in automated optimization for various deep learning tasks. Unfortunately, this area is unexplored for optimizing TNNs. This paper proposes TAS, a framework that drastically reduces the accuracy gap between TNNs and their full-precision counterparts by integrating quantization into the network design. We experienced that directly applying NAS to the ternary domain provides accuracy degradation as the search settings are customized for full-precision networks. To address this problem, we propose (i) a new cell template for ternary networks with maximum gradient propagation; and (ii) a novel learnable quantizer that adaptively relaxes the ternarization mechanism from the distribution of the weights and activation functions. Experimental results reveal that TAS delivers 2.64% higher accuracy and approximate to 2.8x memory saving over competing methods with the same bit-width resolution on the CIFAR-10 dataset. These results suggest that TAS is an effective method that paves the way for the efficient design of the next generation of quantized neural networks.
引用
收藏
页码:1115 / 1118
页数:4
相关论文
共 50 条
  • [1] Resource-Constrained Neural Architecture Search on Edge Devices
    Lyu, Bo
    Yuan, Hang
    Lu, Longfei
    Zhang, Yunye
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2022, 9 (01): : 134 - 142
  • [2] Neural Architecture Search for Resource-Constrained Internet of Things Devices
    Cardoso-Pereira, Isadora
    Lobo-Pappa, Gisele
    Ramos, Heitor S.
    26TH IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (IEEE ISCC 2021), 2021,
  • [3] One-Shot Sparse Neural Architecture Search for Resource-Constrained Devices
    Song, Shenghui
    Zaech, Jan-Nico
    Heo, Seonyeong
    2024 IEEE 30TH INTERNATIONAL CONFERENCE ON EMBEDDED AND REAL-TIME COMPUTING SYSTEMS AND APPLICATIONS, RTCSA 2024, 2024, : 132 - 133
  • [4] Designing resource-constrained neural networks using neural architecture search targeting embedded devices
    Cassimon, Amber
    Vanneste, Simon
    Bosmans, Stig
    Mercelis, Siegfried
    Hellinckx, Peter
    INTERNET OF THINGS, 2020, 12
  • [5] A holistic approach for resource-constrained neural network architecture search
    Lupion, M.
    Cruz, N. C.
    Ortigosa, E. M.
    Ortigosa, P. M.
    APPLIED SOFT COMPUTING, 2025, 172
  • [6] Harmonic-NAS: Hardware-Aware Multimodal Neural Architecture Search on Resource-constrained Devices
    Ghebriout, Mohamed Imed Eddine
    Bouzidi, Halima
    Niar, Smail
    Ouarnoughi, Hamza
    ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222, 2023, 222
  • [7] Neural architecture search for resource constrained hardware devices: A survey
    Yang, Yongjia
    Zhan, Jinyu
    Jiang, Wei
    Jiang, Yucheng
    Yu, Antai
    IET CYBER-PHYSICAL SYSTEMS: THEORY & APPLICATIONS, 2023, 8 (03) : 149 - 159
  • [8] ReCNAS: Resource-Constrained Neural Architecture Search Based on Differentiable Annealing and Dynamic Pruning
    Peng, Cheng
    Li, Yangyang
    Shang, Ronghua
    Jiao, Licheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (02) : 2805 - 2819
  • [9] Seismic Waveform Inversion Capability on Resource-Constrained Edge Devices
    Manu, Daniel
    Tshakwanda, Petro Mushidi
    Lin, Youzuo
    Jiang, Weiwen
    Yang, Lei
    JOURNAL OF IMAGING, 2022, 8 (12)
  • [10] An Effective Approach for Resource-Constrained Edge Devices in Federated Learning
    Wen, Jun
    Li, Xiusheng
    Chen, Yupeng
    Li, Xiaoli
    Mao, Hang
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2024, 2024