Bypass Exponential Time Preprocessing: Fast Neural Network Training via Weight-Data Correlation Preprocessing

被引:0
|
作者
Alman, Josh [1 ]
Liang, Jiehao [2 ]
Song, Zhao [3 ]
Zhang, Ruizhe [4 ]
Zhuo, Danyang [5 ]
机构
[1] Columbia Univ, New York, NY 10027 USA
[2] Univ Calif Berkeley, Berkeley, CA 94720 USA
[3] Adobe Res, San Jose, CA USA
[4] Simons Inst Theory Comp, Berkeley, CA USA
[5] Duke Univ, Durham, NC 27706 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Over the last decade, deep neural networks have transformed our society, and they are already widely applied in various machine learning applications. State-of-the-art deep neural networks are becoming larger in size every year to deliver increasing model accuracy, and as a result, model training consumes substantial computing resources and will only consume more in the future. Using current training methods, in each iteration, to process a data point x is an element of R-d in a layer, we need to spend Theta(md) time to evaluate all the m neurons in the layer. This means processing the entire layer takes Theta(nmd) time for n data points. Recent work [Song, Yang and Zhang, NeurIPS 2021] reduces this time per iteration to o(nmd) but requires exponential time to preprocess either the data or the neural network weights, making it unlikely to have practical usage. In this work, we present a new preprocessing method that simply stores the weight-data correlation in a tree data structure in order to quickly, and dynamically detect which neurons fire at each iteration. Our method requires only O(nmd) time in preprocessing and still achieves o(nmd) time per iteration. We complement our new algorithm with a lower bound, proving that assuming a popular conjecture from complexity theory, one could not substantially speed up our algorithm for dynamic detection of firing neurons.
引用
收藏
页数:28
相关论文
共 50 条
  • [1] Neural Network for Data Preprocessing in Computed Tomography
    Yamaev, A., V
    Chukalina, M., V
    Nikolaev, D. P.
    Sheshkus, A., V
    Chulichkov, A., I
    AUTOMATION AND REMOTE CONTROL, 2021, 82 (10) : 1752 - 1762
  • [2] Neural Network for Data Preprocessing in Computed Tomography
    A. V. Yamaev
    M. V. Chukalina
    D. P. Nikolaev
    A. V. Sheshkus
    A. I. Chulichkov
    Automation and Remote Control, 2021, 82 : 1752 - 1762
  • [3] Nonstationarity and data preprocessing for neural network predictions of an economic time series
    Virili, F
    Freisleben, B
    IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL V, 2000, : 129 - 134
  • [4] Post Training Quantization Preprocessing Method of Convolutional Neural Network via Outlier Removal
    Xu P.
    Cao J.
    Chen W.
    Liu S.
    Wang Y.
    Zhang X.
    Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis, 2022, 58 (05): : 808 - 812
  • [5] Possibilities of Seismic Data Preprocessing for Deep Neural Network Analysis
    Kislov, K. V.
    Gravirov, V. V.
    Vinberg, F. E.
    IZVESTIYA-PHYSICS OF THE SOLID EARTH, 2020, 56 (01) : 133 - 144
  • [6] DATA PREPROCESSING METHODS FOR INTERVAL BASED NEURAL NETWORK PREDICTION
    Zorins, Aleksejs
    ENVIRONMENT, TECHNOLOGY, RESOURCES, PROCEEDINGS, 2007, : 211 - 218
  • [7] Validation of Text Data Preprocessing Using a Neural Network Model
    Woo, HoSung
    Kim, JaMee
    Lee, WonGyu
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [8] Possibilities of Seismic Data Preprocessing for Deep Neural Network Analysis
    K. V. Kislov
    V. V. Gravirov
    F. E. Vinberg
    Izvestiya, Physics of the Solid Earth, 2020, 56 : 133 - 144
  • [9] The Next Failure Time Prediction of Escalators via Deep Neural Network with Dynamic Time Warping Preprocessing
    Zhou, Zitong
    Zi, Yanyang
    Xie, Jingsong
    Chen, Jinglong
    An, Tong
    APPLIED SCIENCES-BASEL, 2020, 10 (16):
  • [10] Preprocessing of matrix QCM sensors data for the classification by means of neural network
    Reznik, AM
    Galinskaya, AA
    Dekhtyarenko, OK
    Nowicki, DW
    SENSORS AND ACTUATORS B-CHEMICAL, 2005, 106 (01) : 158 - 163