Brief Announcement: Gradual Learning of Deep Recurrent Neural Network

被引:2
|
作者
Aharoni, Ziv [1 ]
Rattner, Gal [1 ]
Permuter, Haim [1 ]
机构
[1] Ben Gurion Univ Negev, IL-8410501 Beer Sheva, Israel
关键词
Data-processing-inequality; Machine-learning; Recurrent-neural-networks; Regularization; Training-methods;
D O I
10.1007/978-3-319-94147-9_21
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep Recurrent Neural Networks (RNNs) achieve state-of-the-art results in many sequence-to-sequence modeling tasks. However, deep RNNs are difficult to train and tend to suffer from overfitting. Motivated by the Data Processing Inequality (DPI) we formulate the multi-layered network as a Markov chain, introducing a training method that comprises training the network gradually and using layer-wise gradient clipping. In total, we have found that applying our methods combined with previously introduced regularization and optimization methods resulted in improvement to the state-of-the-art architectures operating in language modeling tasks.
引用
收藏
页码:274 / 277
页数:4
相关论文
共 50 条
  • [41] Abstractive morphological learning with a recurrent neural network
    Malouf R.
    Morphology, 2017, 27 (4) : 431 - 458
  • [42] Brief Announcement: The Theory Of Network Tracing
    Acharya, Hrishikesh B.
    Gouda, Mohamed G.
    PODC'09: PROCEEDINGS OF THE 2009 ACM SYMPOSIUM ON PRINCIPLES OF DISTRIBUTED COMPUTING, 2009, : 318 - 319
  • [43] RDN-NET: A Deep Learning Framework for Asthma Prediction and Classification Using Recurrent Deep Neural Network
    Iqbal, Md. Asim
    Devarajan, K.
    Ahmed, Syed Musthak
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2024, 24 (06)
  • [44] Deep transform and metric learning network: Wedding deep dictionary learning and neural network
    Tang, Wen
    Chouzenoux, Emilie
    Pesquet, Jean-Christophe
    Krim, Hamid
    NEUROCOMPUTING, 2022, 509 : 244 - 256
  • [45] Deep Process Neural Network for Temporal Deep Learning
    Huang, Wenhao
    Hong, Haikun
    Song, Guojie
    Xie, Kunqing
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 451 - 458
  • [46] DEEP RECURRENT REGULARIZATION NEURAL NETWORK FOR SPEECH RECOGNITION
    Chien, Jen-Tzung
    Lu, Tsai-Wei
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4560 - 4564
  • [47] Clickbait Detection Using Deep Recurrent Neural Network
    Razaque, Abdul
    Alotaibi, Bandar
    Alotaibi, Munif
    Hussain, Shujaat
    Alotaibi, Aziz
    Jotsov, Vladimir
    APPLIED SCIENCES-BASEL, 2022, 12 (01):
  • [48] Modular Deep Recurrent Neural Network: Application to Quadrotors
    Mohajerin, Nima
    Waslandcr, Steven L.
    2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, : 1374 - 1379
  • [49] A Deep Recurrent Neural Network for Plant Disease Classification
    Divya Singh
    Ashish Kumar
    SN Computer Science, 5 (8)
  • [50] Recurrent Neural Network to Deep Learn Conversation in Indonesian
    Chowanda, Andry
    Chowanda, Alan Darmasaputra
    DISCOVERY AND INNOVATION OF COMPUTER SCIENCE TECHNOLOGY IN ARTIFICIAL INTELLIGENCE ERA, 2017, 116 : 579 - 586