TAS: Ternarized Neural Architecture Search for Resource-Constrained Edge Devices

被引:0
|
作者
Loni, Mohammad [1 ]
Mousavi, Hamid [1 ]
Riazati, Mohammad [1 ]
Daneshtalab, Masoud [1 ]
Sjodin, Mikael [1 ]
机构
[1] Malardalen Univ, Sch Innovat Design & Engn, Vasteras, Sweden
关键词
Quantization; Ternary Neural Network; Neural Architecture Search; Embedded Systems;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Ternary Neural Networks (TNNs) compress network weights and activation functions into 2-bit representation resulting in remarkable network compression and energy efficiency. However, there remains a significant gap in accuracy between TNNs and full-precision counterparts. Recent advances in Neural Architectures Search (NAS) promise opportunities in automated optimization for various deep learning tasks. Unfortunately, this area is unexplored for optimizing TNNs. This paper proposes TAS, a framework that drastically reduces the accuracy gap between TNNs and their full-precision counterparts by integrating quantization into the network design. We experienced that directly applying NAS to the ternary domain provides accuracy degradation as the search settings are customized for full-precision networks. To address this problem, we propose (i) a new cell template for ternary networks with maximum gradient propagation; and (ii) a novel learnable quantizer that adaptively relaxes the ternarization mechanism from the distribution of the weights and activation functions. Experimental results reveal that TAS delivers 2.64% higher accuracy and approximate to 2.8x memory saving over competing methods with the same bit-width resolution on the CIFAR-10 dataset. These results suggest that TAS is an effective method that paves the way for the efficient design of the next generation of quantized neural networks.
引用
收藏
页码:1115 / 1118
页数:4
相关论文
共 50 条
  • [31] Lightweight KPABE Architecture Enabled in Mesh Networked Resource-Constrained IoT Devices
    Hijawi, Ula
    Unal, Devrim
    Hamila, Ridha
    Gastli, Adel
    Ellabban, Omar
    IEEE ACCESS, 2021, 9 : 5640 - 5650
  • [32] PCANN: Distributed ANN Architecture for Image Recognition in Resource-Constrained IoT Devices
    Bi, Tianyu
    Liu, Qingzhi
    Ozcelebi, Tanir
    Jarnikov, Dmitri
    Sekulovski, Dragan
    2019 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT ENVIRONMENTS (IE 2019), 2019, : 1 - 8
  • [33] You Only Search Once: On Lightweight Differentiable Architecture Search for Resource-Constrained Embedded Platforms
    Luo, Xiangzhong
    Liu, Di
    Kong, Hao
    Huai, Slum
    Chen, Hui
    Liu, Weichen
    PROCEEDINGS OF THE 59TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC 2022, 2022, : 475 - 480
  • [34] Spatially Invariant Convolutional Spiking Neural Network For Resource-Constrained IoT Devices
    Yadav, Chetali
    Reniwal, Bhupendra Singh
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2025, : 3005 - 3026
  • [35] Soft Error Reliability Assessment of Neural Networks on Resource-constrained IoT Devices
    Abich, Geancarlo
    Gaya, Jonas
    Reis, Ricardo
    Ost, Luciano
    2020 27TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS (ICECS), 2020,
  • [36] LiteNet: Lightweight Neural Network for Detecting Arrhythmias at Resource-Constrained Mobile Devices
    He, Ziyang
    Zhang, Xiaoqing
    Cao, Yangjie
    Liu, Zhi
    Zhang, Bo
    Wang, Xiaoyan
    SENSORS, 2018, 18 (04)
  • [37] Achieving High Efficiency: Resource sharing techniques in artificial neural networks for resource-constrained devices
    Gorbounov, Y.
    Chen, H.
    1ST WORKSHOP ON SOLITON THEORY, NONLINEAR DYNAMICS AND MACHINE LEARNING, 2024, 2719
  • [38] Efficient knowledge management for heterogeneous federated continual learning on resource-constrained edge devices
    Yang, Zhao
    Zhang, Shengbing
    Li, Chuxi
    Wang, Miao
    Wang, Haoyang
    Zhang, Meng
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 156 : 16 - 29
  • [39] Encoding semantic awareness in resource-constrained devices
    Preuveneers, Davy
    Berbers, Yolande
    IEEE INTELLIGENT SYSTEMS, 2008, 23 (02) : 26 - 33
  • [40] TreeNet: A Hierarchical Deep Learning Model to Facilitate Edge Intelligence for Resource-Constrained Devices
    Lu, Dong
    Zhai, Yanlong
    Wu, Jianqing
    Shen, Jun
    21ST IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2021), 2021, : 525 - 534