Semi-supervised Sequence Learning

被引:0
|
作者
Dai, Andrew M. [1 ]
Le, Quoc V. [1 ]
机构
[1] Google Inc, Mountain View, CA 94043 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present two approaches to use unlabeled data to improve Sequence Learning with recurrent networks. The first approach is to predict what comes next in a sequence, which is a language model in NLP. The second approach is to use a sequence autoencoder, which reads the input sequence into a vector and predicts the input sequence again. These two algorithms can be used as a "pretraining" algorithm for a later supervised sequence learning algorithm. In other words, the parameters obtained from the pretraining step can then be used as a starting point for other supervised training models. In our experiments, we find that long short term memory recurrent networks after pretrained with the two approaches become more stable to train and generalize better. With pretraining, we were able to achieve strong performance in many classification tasks, such as text classification with IMDB, DBpedia or image recognition in CIFAR-10.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Semi-Supervised Knowledge Amalgamation for Sequence Classification
    Thadajarassiri, Jidapa
    Hartvigsen, Thomas
    Kong, Xiangnan
    Rundensteiner, Elke A.
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 9859 - 9867
  • [42] Semi-Supervised Learning by Gaussian Mixtures
    Choi, Byoung-Jeong
    Chae, Youn-Seok
    Choi, Woo-Young
    Park, Changyi
    Koo, Ja-Yong
    [J]. KOREAN JOURNAL OF APPLIED STATISTICS, 2008, 21 (05) : 825 - 833
  • [43] A Theoretical Analysis of Semi-supervised Learning
    Fujii, Takashi
    Ito, Hidetaka
    Miyoshi, Seiji
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2016, PT II, 2016, 9948 : 28 - 36
  • [44] Semi-Supervised Learning with Normalizing Flows
    Izmailov, Pavel
    Kirichenko, Polina
    Finzi, Marc
    Wilson, Andrew Gordon
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [45] Information mining with semi-supervised learning
    Klose, A
    Kruse, R
    [J]. SOFT METHODOLOGY AND RANDOM INFORMATION SYSTEMS, 2004, : 67 - 74
  • [46] Augmentation Learning for Semi-Supervised Classification
    Frommknecht, Tim
    Zipf, Pedro Alves
    Fan, Quanfu
    Shvetsova, Nina
    Kuehne, Hilde
    [J]. PATTERN RECOGNITION, DAGM GCPR 2022, 2022, 13485 : 85 - 98
  • [47] Semi-supervised Learning with Multimodal Perturbation
    Su, Lei
    Liao, Hongzhi
    Yu, Zhengtao
    Tang, Jiahua
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2009, PT 1, PROCEEDINGS, 2009, 5551 : 651 - +
  • [48] Semi-Supervised Learning for Video Captioning
    Lin, Ke
    Gan, Zhuoxin
    Wang, Liwei
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 1096 - 1106
  • [49] Quantum semi-supervised kernel learning
    Seyran Saeedi
    Aliakbar Panahi
    Tom Arodz
    [J]. Quantum Machine Intelligence, 2021, 3
  • [50] Semi-supervised learning by sparse representation
    Yan, Shuicheng
    Wang, Huan
    [J]. Society for Industrial and Applied Mathematics - 9th SIAM International Conference on Data Mining 2009, Proceedings in Applied Mathematics, 2009, 2 : 788 - 797