Unsupervised Pre-training on Improving the Performance of Neural Network in Regression

被引：0

作者：

Salida, Pallabi ^{[1
]}

Vij, Prateek ^{[1
]}

Baruah, Rashmi Dutta ^{[1
]}

机构：

[1] Indian Inst Technol Guwahati, Dept Comp Sci & Engn, Gauhati 781039, Assam, India

来源：

2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2018年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The paper aims to empirically analyse the performance of the prediction capability of Artificial Neural Network by applying a pre-training mechanism. The pre-training used here is same as the training of Deep Belief Network where the network is formed by stacking Restricted Boltzmann Machine one above the other successively. A different set of experiments are performed to understand in what scenario pre-trained ANN performed better than randomly initialised ANN. The results of experiments showed that pre-trained model performed better than randomly initialised ANN in terms of generalised error, computational units required and most importantly robust to change in hyperparameters such as learning rate and model architecture. The only cost is in additional time involved in the pre-training phase. Further, the learned knowledge in pre-training, which is stored as weights in ANN, are analysed using Hinton diagram. The analysis could provide clear picture of the pre-training that learned some of the hidden characteristics of the data.

引用

页数：6

共 50 条

[31] Effect of pre-training to build a regression model using shallow neural network for semiconductor plasma etch process equipment
Kwon, Ohyung
Lee, Nayeon
Kim, Kangil
2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 2903 - 2906
[32] A Study of Speech Recognition for Kazakh Based on Unsupervised Pre-Training
Meng, Weijing
Yolwas, Nurmemet
SENSORS, 2023, 23 (02)
[33] The Reduction of Fully Connected Neural Network Parameters Using the Pre-training Technique
Kroshchanka, Aliaksandr
Golovko, Vladimir
PROCEEDINGS OF THE 11TH IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS: TECHNOLOGY AND APPLICATIONS (IDAACS'2021), VOL 2, 2021, : 937 - 941
[34] Research on Effectiveness of the Pre-Training Model in Improving the Performance of Spectral Feature Extraction
Ren, Ju-Xiang
Liu, Zhong-Bao
Guang Pu Xue Yu Guang Pu Fen Xi/Spectroscopy and Spectral Analysis, 2024, 44 (12): : 3480 - 3484
[35] Unsupervised pre-training of graph transformers on patient population graphs
Pellegrini, Chantal
Navab, Nassir
Kazi, Anees
MEDICAL IMAGE ANALYSIS, 2023, 89
[36] Pre-training Methods for Neural Machine Translation
Wang, Mingxuan
Li, Lei
ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: TUTORIAL ABSTRACTS, 2021, : 21 - 25
[37] Pre-training on dynamic graph neural networks
Chen, Ke-Jia
Zhang, Jiajun
Jiang, Linpu
Wang, Yunyun
Dai, Yuxuan
NEUROCOMPUTING, 2022, 500 : 679 - 687
[38] Why Does Unsupervised Pre-training Help Deep Learning?
Erhan, Dumitru
Bengio, Yoshua
Courville, Aaron
Manzagol, Pierre-Antoine
Vincent, Pascal
Bengio, Samy
JOURNAL OF MACHINE LEARNING RESEARCH, 2010, 11 : 625 - 660
[39] Unsupervised Point Cloud Pre-training via Occlusion Completion
Wang, Hanchen
Liu, Qi
Yue, Xiangyu
Lasenby, Joan
Kusner, Matt J.
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9762 - 9772
[40] TRANSFORMER BASED UNSUPERVISED PRE-TRAINING FOR ACOUSTIC REPRESENTATION LEARNING
Zhang, Ruixiong
Wu, Haiwei
Li, Wubo
Jiang, Dongwei
Zou, Wei
Li, Xiangang
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6933 - 6937

← 1 2 3 4 5 →