Bypass Exponential Time Preprocessing: Fast Neural Network Training via Weight-Data Correlation Preprocessing

被引:0
|
作者
Alman, Josh [1 ]
Liang, Jiehao [2 ]
Song, Zhao [3 ]
Zhang, Ruizhe [4 ]
Zhuo, Danyang [5 ]
机构
[1] Columbia Univ, New York, NY 10027 USA
[2] Univ Calif Berkeley, Berkeley, CA 94720 USA
[3] Adobe Res, San Jose, CA USA
[4] Simons Inst Theory Comp, Berkeley, CA USA
[5] Duke Univ, Durham, NC 27706 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Over the last decade, deep neural networks have transformed our society, and they are already widely applied in various machine learning applications. State-of-the-art deep neural networks are becoming larger in size every year to deliver increasing model accuracy, and as a result, model training consumes substantial computing resources and will only consume more in the future. Using current training methods, in each iteration, to process a data point x is an element of R-d in a layer, we need to spend Theta(md) time to evaluate all the m neurons in the layer. This means processing the entire layer takes Theta(nmd) time for n data points. Recent work [Song, Yang and Zhang, NeurIPS 2021] reduces this time per iteration to o(nmd) but requires exponential time to preprocess either the data or the neural network weights, making it unlikely to have practical usage. In this work, we present a new preprocessing method that simply stores the weight-data correlation in a tree data structure in order to quickly, and dynamically detect which neurons fire at each iteration. Our method requires only O(nmd) time in preprocessing and still achieves o(nmd) time per iteration. We complement our new algorithm with a lower bound, proving that assuming a popular conjecture from complexity theory, one could not substantially speed up our algorithm for dynamic detection of firing neurons.
引用
收藏
页数:28
相关论文
共 50 条
  • [41] Improved Surface Defect Classification from a Simple Convolutional Neural Network by Image Preprocessing and Data Augmentation
    Lopez de la Rosa, Francisco
    Moreno-Salvador, Lucia
    Gomez-Sirvent, Jose L.
    Morales, Rafael
    Sanchez-Reolid, Roberto
    Fernandez-Caballero, Antonio
    BIOINSPIRED SYSTEMS FOR TRANSLATIONAL APPLICATIONS: FROM ROBOTICS TO SOCIAL ENGINEERING, PT II, IWINAC 2024, 2024, 14675 : 23 - 32
  • [42] Integration of Artificial Neural Network Modeling and Hyperspectral Data Preprocessing for Discrimination of Colla Corii Asini Adulteration
    Wang, Huihui
    Wang, Kunlun
    Wang, Biyao
    Lv, Yan
    Tao, Xueheng
    Zhang, Xu
    Tan, Mingqian
    JOURNAL OF FOOD QUALITY, 2018,
  • [43] REAL-TIME, IN-SITU ELLIPSOMETRY SOLUTIONS USING ARTIFICIAL NEURAL-NETWORK PREPROCESSING
    URBAN, FK
    TABET, MF
    THIN SOLID FILMS, 1994, 245 (1-2) : 167 - 173
  • [44] Retraction Note: Simulation of UML graph classification model by using data preprocessing and convolutional neural network
    Fangli Wang
    Optical and Quantum Electronics, 56 (12)
  • [45] Soft Sensor Model Based on Improved Elman Neural Network with Variable Data Preprocessing and Its Application
    Zhu, Hai-bo
    Zhang, Yong
    JOURNAL OF SENSORS, 2018, 2018
  • [46] Preprocessing technique to signal strength data of wireless sensor network for real-time distance estimation
    Cabrera-Mora, Flavio
    Xiao, Jizhong
    2008 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-9, 2008, : 1537 - +
  • [47] A Subspace-Aware Kelly's Detector using Reduced Secondary Data with Fast and Slow Time Preprocessing
    Sarac, Ugur Berkay
    Guvensen, Gokhan M.
    2019 IEEE RADAR CONFERENCE (RADARCONF), 2019,
  • [48] A wavelet neural network informed by time-domain signal preprocessing for bearing remaining useful life prediction
    Zhou, Kai
    Tang, Jiong
    APPLIED MATHEMATICAL MODELLING, 2023, 122 : 220 - 241
  • [49] Design of radial basis function neural network classifier realized with the aid of data preprocessing techniques: design and analysis
    Oh, Sung-Kwun
    Kim, Wook-Dong
    Pedrycz, Witold
    INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 2016, 45 (04) : 434 - 454
  • [50] Speech Emotion Recognition method using time-stretching in the Preprocessing Phase and Artificial Neural Network Classifiers
    Govoreanu, Valentin Catalin
    Neghina, Mihai
    2020 IEEE 16TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP 2020), 2020, : 69 - 74