Tensor shape search for efficient compression of tensorized data and neural networks

被引:0
|
作者
Solgi, Ryan [1 ,2 ]
He, Zichang [1 ]
Liang, William Jiahua [3 ]
Zhang, Zheng [1 ]
Loaiciga, Hugo A. [2 ]
机构
[1] Univ Calif Santa Barbara, Dept Elect & Comp Engn, Santa Barbara, CA 93106 USA
[2] Univ Calif Santa Barbara, Dept Geog, Santa Barbara, CA USA
[3] Univ Penn, Sch Engn Appl Sci, Philadelphia, PA USA
关键词
Data compression; Tensor train decomposition; Tensor compression; Genetic algorithm; Tensorized neural networks; DIMENSIONAL UNCERTAINTY QUANTIFICATION; DECOMPOSITIONS; RANK;
D O I
10.1016/j.asoc.2023.110987
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Compressing big data and model parameters via tensor decomposition such as the tensor train (TT) format has gained great success in recent years. The application of tensor compression methods requires the data be high dimensional. However, not all the real-world data primarily are high-dimensional, and sometimes reshaping is necessary before the application of tensor compression methods. Meantime, reordering and reshaping data may affect the efficiency of the compression. This work utilizes tensor reshaping to improve the efficiency of tensor compression using the TT format. An optimization model is proposed that maximizes the space-saving of tensor compression with respect to the shape of a given tensor while the compression error is bounded. The study is narrowed down to the TT decomposition and the TT-SVD algorithm is linked with a genetic algorithm (GA) to find an optimal tensor shape. The proposed method is applied to compress RGB images and a neural network to exemplify its capability. The results of the proposed tensor shape search using the GA are also compared with a purely random search. The results demonstrate that the proposed tensor shape search method significantly improves the space-saving and compression ratio of the data compression and enhances the efficiency of tensorized neural networks using the TT decomposition.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Leveraging Tensor Methods in Neural Architecture Search for the automatic development of lightweight Convolutional Neural Networks
    Dhanaraj, Mayur
    Do, Huyen
    Nair, Dinesh
    Xu, Cong
    BIG DATA IV: LEARNING, ANALYTICS, AND APPLICATIONS, 2022, 12097
  • [32] Energy Efficient Data Compression in Wireless Sensor Networks
    Vidhyapriya, Ranganathan
    Vanathi, Ponnusamy Thangapandian
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2009, 6 (03) : 297 - 303
  • [33] Adaptive Weight Compression for Memory-Efficient Neural Networks
    Ko, Jong Hwan
    Kim, Duckhwan
    Na, Taesik
    Kung, Jaeha
    Mukhopadhyay, Saibal
    PROCEEDINGS OF THE 2017 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2017, : 199 - 204
  • [34] On the efficient classification of data structures by neural networks
    Frasconi, P
    Gori, M
    Sperduti, A
    IJCAI-97 - PROCEEDINGS OF THE FIFTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 AND 2, 1997, : 1066 - 1071
  • [35] Designing Real-Time Neural Networks by Efficient Neural Architecture Search
    Bo, Zitong
    Li, Yilin
    Qiao, Ying
    Leng, Chang
    Wang, Hongan
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT IV, ICIC 2024, 2024, 14865 : 62 - 73
  • [36] Tensor based stacked fuzzy neural network for efficient data regression
    Li, Jie
    Hu, Jiale
    Zhao, Guoliang
    Huang, Sharina
    Liu, Yang
    SOFT COMPUTING, 2023, 27 (15) : 11059 - 11059
  • [37] Data-efficient Neural Text Compression with Interactive Learning
    Avinesh, P. V. S.
    Meyer, Christian M.
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 2543 - 2554
  • [38] Large-scale and energy-efficient tensorized optical neural networks on III-V-on-silicon MOSCAP platform
    Xiao, Xian
    On, Mehmet Berkay
    Van Vaerenbergh, Thomas
    Liang, Di
    Beausoleil, Raymond G.
    Yoo, S. J. Ben
    APL PHOTONICS, 2021, 6 (12)
  • [39] RankSearch: An Automatic Rank Search towards Optimal Tensor Compression for Video LSTM Networks on Edge
    Man, Changhai
    Chang, Cheng
    Ding, Chenchen
    Shen, Ao
    Ren, Hongwei
    Guan, Ziyi
    Cheng, Yuan
    Luo, Shaobo
    Zhang, Rumin
    Wong, Ngai
    Yu, Hao
    2023 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2023,
  • [40] Efficient adaptive data compression using Fano Binary Search Trees
    Rueda, L
    Oommen, BJ
    COMPUTER AND INFORMATION SCIENCES - ISCIS 2005, PROCEEDINGS, 2005, 3733 : 768 - 779