Reduced-Order Modeling of Deep Neural Networks

被引:2
|
作者
Gusak, J. [1 ]
Daulbaev, T. [1 ]
Ponomarev, E. [1 ]
Cichocki, A. [1 ]
Oseledets, I [1 ]
机构
[1] Skolkovo Inst Sci & Technol, Moscow, Russia
关键词
acceleration of neural networks; MaxVol; machine learning; component analysis;
D O I
10.1134/S0965542521050109
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We introduce a new method for speeding up the inference of deep neural networks. It is somewhat inspired by the reduced-order modeling techniques for dynamical systems. The cornerstone of the proposed method is the maximum volume algorithm. We demonstrate efficiency on neural networks pre-trained on different datasets. We show that in many practical cases it is possible to replace convolutional layers with much smaller fully-connected layers with a relatively small drop in accuracy.
引用
收藏
页码:774 / 785
页数:12
相关论文
共 50 条