Semi-supervised Sequence Learning

被引:0
|
作者
Dai, Andrew M. [1 ]
Le, Quoc V. [1 ]
机构
[1] Google Inc, Mountain View, CA 94043 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present two approaches to use unlabeled data to improve Sequence Learning with recurrent networks. The first approach is to predict what comes next in a sequence, which is a language model in NLP. The second approach is to use a sequence autoencoder, which reads the input sequence into a vector and predicts the input sequence again. These two algorithms can be used as a "pretraining" algorithm for a later supervised sequence learning algorithm. In other words, the parameters obtained from the pretraining step can then be used as a starting point for other supervised training models. In our experiments, we find that long short term memory recurrent networks after pretrained with the two approaches become more stable to train and generalize better. With pretraining, we were able to achieve strong performance in many classification tasks, such as text classification with IMDB, DBpedia or image recognition in CIFAR-10.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Semi-supervised Multitask Learning for Sequence Labeling
    Rei, Marek
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 2121 - 2130
  • [2] Semi-supervised learning for classification of protein sequence data
    King, Brian R.
    Guda, Chittibabu
    [J]. SCIENTIFIC PROGRAMMING, 2008, 16 (01) : 5 - 29
  • [3] Semi-supervised Learning
    Adams, Niall
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2009, 172 : 530 - 530
  • [4] On semi-supervised learning
    Cholaquidis, A.
    Fraiman, R.
    Sued, M.
    [J]. TEST, 2020, 29 (04) : 914 - 937
  • [5] On semi-supervised learning
    A. Cholaquidis
    R. Fraiman
    M. Sued
    [J]. TEST, 2020, 29 : 914 - 937
  • [6] DeepHeart: Semi-Supervised Sequence Learning for Cardiovascular Risk Prediction
    Ballinger, Brandon
    Hsieh, Johnson
    Singh, Avesh
    Sohoni, Nimit
    Wang, Jack
    Tison, Geoffrey H.
    Marcus, Gregory M.
    Sanchez, Jose M.
    Maguire, Carol
    Olgin, Jeffrey E.
    Pletcher, Mark J.
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 2079 - 2086
  • [7] Learning Semi-Supervised Representation Towards a Unified Optimization Framework for Semi-Supervised Learning
    Li, Chun-Guang
    Lin, Zhouchen
    Zhang, Honggang
    Guo, Jun
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2767 - 2775
  • [8] Semi-supervised learning by disagreement
    Zhou, Zhi-Hua
    Li, Ming
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2010, 24 (03) : 415 - 439
  • [9] A survey on semi-supervised learning
    Jesper E. van Engelen
    Holger H. Hoos
    [J]. Machine Learning, 2020, 109 : 373 - 440
  • [10] Semi-Supervised Incremental Learning
    Bouchachia, Abdelhamid
    Prossegger, Markus
    Duman, Hakan
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2010), 2010,