Deep tensor networks with matrix product operators

被引:0
|
作者
Bojan Žunkovič
机构
[1] University of Ljubljana,Faculty of Computer and Information Science
来源
关键词
Matrix product operators; Time-dependent variational principle; Deep tensor networks; Linear dot-attention;
D O I
暂无
中图分类号
学科分类号
摘要
We introduce deep tensor networks, which are exponentially wide neural networks based on the tensor network representation of the weight matrices. We evaluate the proposed method on the image classification (MNIST, FashionMNIST) and sequence prediction (cellular automata) tasks. In the image classification case, deep tensor networks improve our matrix product state baselines and achieve 0.49% error rate on MNIST and 8.3% error rate on FashionMNIST. In the sequence prediction case, we demonstrate an exponential improvement in the number of parameters compared to the one-layer tensor network methods. In both cases, we discuss the non-uniform and the uniform tensor network models and show that the latter generalises well to different input sizes.
引用
收藏
相关论文
共 50 条
  • [1] Deep tensor networks with matrix product operators
    Zunkovic, Bojan
    QUANTUM MACHINE INTELLIGENCE, 2022, 4 (02)
  • [2] Compressing deep neural networks by matrix product operators
    Gao, Ze-Feng
    Cheng, Song
    He, Rong-Qiang
    Xie, Z. Y.
    Zhao, Hui-Hai
    Lu, Zhong-Yi
    Xiang, Tao
    PHYSICAL REVIEW RESEARCH, 2020, 2 (02):
  • [4] REMARKS ON TENSOR PRODUCT OF OPERATORS
    FAIRES, B
    NOTICES OF THE AMERICAN MATHEMATICAL SOCIETY, 1976, 23 (01): : A148 - A148
  • [7] COMMUTATIVITY OF COONS AND TENSOR PRODUCT OPERATORS
    FARIN, G
    ROCKY MOUNTAIN JOURNAL OF MATHEMATICS, 1992, 22 (02) : 541 - 546
  • [8] Tractability of tensor product linear operators
    Novak, E
    Sloan, IH
    Wozniakowski, H
    JOURNAL OF COMPLEXITY, 1997, 13 (04) : 387 - 418
  • [9] Tensor product of left polaroid operators
    Boasso, Enrico
    Duggal, Bhagwati P.
    ACTA SCIENTIARUM MATHEMATICARUM, 2012, 78 (1-2): : 251 - 264
  • [10] Tensor product of left polaroid operators
    Enrico Boasso
    Bhagwati P. Duggal
    Acta Scientiarum Mathematicarum, 2012, 78 (1-2): : 251 - 264