Mixture Variational Autoencoder of Boltzmann Machines for Text Processing

被引:0
|
作者
Guilherme Gomes, Bruno [1 ]
Murai, Fabricio [1 ]
Goussevskaia, Olga [1 ]
Couto Da Silva, Ana Paula [1 ]
机构
[1] Univ Fed Minas Gerais, Belo Horizonte, MG, Brazil
关键词
D O I
10.1007/978-3-030-80599-9_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Variational autoencoders (VAEs) have been successfully used to learn good representations in unsupervised settings, especially for image data. More recently, mixture variational autoencoders (MVAEs) have been proposed to enhance the representation capabilities of VAEs by assuming that data can come from a mixture distribution. In this work, we adapt MVAEs for text processing by modeling each component's joint distribution of latent variables and document's bag-of-words as a graphical model known as the Boltzmann Machine, popular in natural language processing for performing well in a number of tasks. The proposed model, MVAE-BM, can learn text representations from unlabeled data without requiring pre-trained word embeddings. We evaluate the representations obtained by MVAE-BM on six corpora w.r.t. the perplexity metric and accuracy on binary and multi-class text classification. Despite its simplicity, our results show that MVAE-BM's performance is on par with or superior to that of modern deep learning techniques such as BERT and RoBERTa. Last, we show that the mapping to mixture components learned by the model lends itself naturally to document clustering.
引用
收藏
页码:46 / 56
页数:11
相关论文
共 50 条
  • [11] Variational Autoencoder for Semi-Supervised Text Classification
    Xu, Weidi
    Sun, Haoze
    Deng, Chao
    Tan, Ying
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3358 - 3364
  • [12] Text feature extraction based on stacked variational autoencoder
    Che, Lei
    Yang, Xiaoping
    Wang, Liang
    MICROPROCESSORS AND MICROSYSTEMS, 2020, 76 (76)
  • [13] Variational restricted Boltzmann machines to automated anomaly detection
    Konstantinos Demertzis
    Lazaros Iliadis
    Elias Pimenidis
    Panagiotis Kikiras
    Neural Computing and Applications, 2022, 34 : 15207 - 15220
  • [14] Variational restricted Boltzmann machines to automated anomaly detection
    Demertzis, Konstantinos
    Iliadis, Lazaros
    Pimenidis, Elias
    Kikiras, Panagiotis
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (18): : 15207 - 15220
  • [15] Training quantum Boltzmann machines with the β-variational quantum eigensolver
    Huijgen, Onno
    Coopmans, Luuk
    Najafi, Peyman
    Benedetti, Marcello
    Kappen, Hilbert J.
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2024, 5 (02):
  • [16] Differentially Private Recurrent Variational Autoencoder For Text Privacy Preservation
    Wang, Yuyang
    Meng, Xianjia
    Liu, Ximeng
    MOBILE NETWORKS & APPLICATIONS, 2023, 28 (05): : 1565 - 1580
  • [17] Variational Autoencoder with Truncated Mixture of Gaussians for Functional Connectivity Analysis
    Zhao, Qingyu
    Honnorat, Nicolas
    Adeli, Ehsan
    Pfefferbaum, Adolf
    Sullivan, Edith V.
    Pohl, Kilian M.
    INFORMATION PROCESSING IN MEDICAL IMAGING, IPMI 2019, 2019, 11492 : 867 - 879
  • [18] Riemannian Normalizing Flow on Variational Wasserstein Autoencoder for Text Modeling
    Wang, Prince Zizhuang
    Wang, William Yang
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 284 - 294
  • [19] A Mixture Variational Autoencoder Regression Model for Soft Sensor Application
    Cui L.-L.
    Shen B.-B.
    Ge Z.-Q.
    Zidonghua Xuebao/Acta Automatica Sinica, 2022, 48 (02): : 398 - 407
  • [20] Text feature extraction based on sparse balanced variational autoencoder
    Che L.
    Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 2022, 44 (01): : 169 - 178