Locally Embedding Autoencoders: A Semi-Supervised Manifold Learning Approach of Document Representation

被引:13
|
作者
Wei, Chao [1 ]
Luo, Senlin [1 ]
Ma, Xincheng [1 ]
Ren, Hao [1 ]
Zhang, Ji [1 ]
Pan, Limin [1 ]
机构
[1] Beijing Inst Technol, Beijing 10081, Peoples R China
来源
PLOS ONE | 2016年 / 11卷 / 01期
关键词
NETWORK;
D O I
10.1371/journal.pone.0146672
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Topic models and neural networks can discover meaningful low-dimensional latent representations of text corpora; as such, they have become a key technology of document representation. However, such models presume all documents are non-discriminatory, resulting in latent representation dependent upon all other documents and an inability to provide discriminative document representation. To address this problem, we propose a semi-supervised manifold-inspired autoencoder to extract meaningful latent representations of documents, taking the local perspective that the latent representation of nearby documents should be correlative. We first determine the discriminative neighbors set with Euclidean distance in observation spaces. Then, the autoencoder is trained by joint minimization of the Bernoulli cross-entropy error between input and output and the sum of the square error between neighbors of input and output. The results of two widely used corpora show that our method yields at least a 15% improvement in document clustering and a nearly 7% improvement in classification tasks compared to comparative methods. The evidence demonstrates that our method can readily capture more discriminative latent representation of new documents. Moreover, some meaningful combinations of words can be efficiently discovered by activating features that promote the comprehensibility of latent representation.
引用
下载
收藏
页数:20
相关论文
共 50 条
  • [1] Nonnegative low-rank representation based manifold embedding for semi-supervised learning
    Liu, Zhonghua
    Wang, Xiaohong
    Pu, Jiexin
    Wang, Lin
    Zhang, Lin
    KNOWLEDGE-BASED SYSTEMS, 2017, 136 : 121 - 129
  • [2] Semi-supervised elastic manifold embedding with deep learning architecture
    Zhu, R.
    Dornaika, F.
    Ruichek, Y.
    PATTERN RECOGNITION, 2020, 107
  • [3] Accelerating Flexible Manifold Embedding for Scalable Semi-Supervised Learning
    Qiu, Suo
    Nie, Feiping
    Xu, Xiangmin
    Qing, Chunmei
    Xu, Dong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (09) : 2786 - 2795
  • [4] Rank Flow Embedding for Unsupervised and Semi-Supervised Manifold Learning
    Valem, Lucas Pascotti
    Pedronette, Daniel Carlos Guimaraes
    Latecki, Longin Jan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 2811 - 2826
  • [5] Semi-Supervised Learning by Locally Linear Embedding in Kernel Space
    Liu, Rujie
    Wang, Yuehong
    Baba, Takayuki
    Masumoto, Daiki
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 2970 - 2973
  • [6] Semi-supervised representation learning via dual autoencoders for domain adaptation
    Yang, Shuai
    Wang, Hao
    Zhang, Yuhong
    Li, Peipei
    Zhu, Yi
    Hu, Xuegang
    KNOWLEDGE-BASED SYSTEMS, 2020, 190 (190)
  • [7] Co-embedding: a semi-supervised multi-view representation learning approach
    Xiaodong Jia
    Xiao-Yuan Jing
    Xiaoke Zhu
    Ziyun Cai
    Chang-Hui Hu
    Neural Computing and Applications, 2022, 34 : 4437 - 4457
  • [8] Co-embedding: a semi-supervised multi-view representation learning approach
    Jia, Xiaodong
    Jing, Xiao-Yuan
    Zhu, Xiaoke
    Cai, Ziyun
    Hu, Chang-Hui
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (06): : 4437 - 4457
  • [9] Semi-Supervised Manifold Alignment Using Parallel Deep Autoencoders
    Aziz, Fayeem
    Wong, Aaron S. W.
    Chalup, Stephan
    ALGORITHMS, 2019, 12 (09)
  • [10] A semi-supervised SVM for manifold learning
    Wu, Zhili
    Li, Chun-hung
    Zhu, Ji
    Huang, Jian
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2006, : 490 - +