Semi-supervised Sequence Learning

被引：0

作者：

Dai, Andrew M. ^{[1
]}

Le, Quoc V. ^{[1
]}

机构：

[1] Google Inc, Mountain View, CA 94043 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015) | 2015年 / 28卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present two approaches to use unlabeled data to improve Sequence Learning with recurrent networks. The first approach is to predict what comes next in a sequence, which is a language model in NLP. The second approach is to use a sequence autoencoder, which reads the input sequence into a vector and predicts the input sequence again. These two algorithms can be used as a "pretraining" algorithm for a later supervised sequence learning algorithm. In other words, the parameters obtained from the pretraining step can then be used as a starting point for other supervised training models. In our experiments, we find that long short term memory recurrent networks after pretrained with the two approaches become more stable to train and generalize better. With pretraining, we were able to achieve strong performance in many classification tasks, such as text classification with IMDB, DBpedia or image recognition in CIFAR-10.

引用

页数：9

共 50 条

[1] Semi-supervised Multitask Learning for Sequence Labeling
Rei, Marek
[J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 2121 - 2130
[2] Semi-supervised learning for classification of protein sequence data
King, Brian R.
Guda, Chittibabu
[J]. SCIENTIFIC PROGRAMMING, 2008, 16 (01) : 5 - 29
[3] Semi-supervised Learning
Adams, Niall
[J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2009, 172 : 530 - 530
[4] On semi-supervised learning
Cholaquidis, A.
Fraiman, R.
Sued, M.
[J]. TEST, 2020, 29 (04) : 914 - 937
[5] On semi-supervised learning
A. Cholaquidis
R. Fraiman
M. Sued
[J]. TEST, 2020, 29 : 914 - 937
[6] DeepHeart: Semi-Supervised Sequence Learning for Cardiovascular Risk Prediction
Ballinger, Brandon
Hsieh, Johnson
Singh, Avesh
Sohoni, Nimit
Wang, Jack
Tison, Geoffrey H.
Marcus, Gregory M.
Sanchez, Jose M.
Maguire, Carol
Olgin, Jeffrey E.
Pletcher, Mark J.
[J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 2079 - 2086
[7] Learning Semi-Supervised Representation Towards a Unified Optimization Framework for Semi-Supervised Learning
Li, Chun-Guang
Lin, Zhouchen
Zhang, Honggang
Guo, Jun
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2767 - 2775
[8] Semi-supervised learning by disagreement
Zhou, Zhi-Hua
Li, Ming
[J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2010, 24 (03) : 415 - 439
[9] A survey on semi-supervised learning
Jesper E. van Engelen
Holger H. Hoos
[J]. Machine Learning, 2020, 109 : 373 - 440
[10] Semi-Supervised Incremental Learning
Bouchachia, Abdelhamid
Prossegger, Markus
Duman, Hakan
[J]. 2010 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2010), 2010,

← 1 2 3 4 5 →