Mixture Variational Autoencoder of Boltzmann Machines for Text Processing

被引:0
|
作者
Guilherme Gomes, Bruno [1 ]
Murai, Fabricio [1 ]
Goussevskaia, Olga [1 ]
Couto Da Silva, Ana Paula [1 ]
机构
[1] Univ Fed Minas Gerais, Belo Horizonte, MG, Brazil
关键词
D O I
10.1007/978-3-030-80599-9_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Variational autoencoders (VAEs) have been successfully used to learn good representations in unsupervised settings, especially for image data. More recently, mixture variational autoencoders (MVAEs) have been proposed to enhance the representation capabilities of VAEs by assuming that data can come from a mixture distribution. In this work, we adapt MVAEs for text processing by modeling each component's joint distribution of latent variables and document's bag-of-words as a graphical model known as the Boltzmann Machine, popular in natural language processing for performing well in a number of tasks. The proposed model, MVAE-BM, can learn text representations from unlabeled data without requiring pre-trained word embeddings. We evaluate the representations obtained by MVAE-BM on six corpora w.r.t. the perplexity metric and accuracy on binary and multi-class text classification. Despite its simplicity, our results show that MVAE-BM's performance is on par with or superior to that of modern deep learning techniques such as BERT and RoBERTa. Last, we show that the mapping to mixture components learned by the model lends itself naturally to document clustering.
引用
收藏
页码:46 / 56
页数:11
相关论文
共 50 条
  • [21] Decoupled variational autoencoder with interactive attention for affective text generation
    Chen, Ruijun
    Wang, Jin
    Yu, Liang-Chih
    Zhang, Xuejie
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [22] Neural Variational Inference for Text Processing
    Miao, Yishu
    Yu, Lei
    Blunsom, Phil
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [23] Gaussian Mixture Variational Autoencoder for Semi-Supervised Topic Modeling
    Zhou, Cangqi
    Ban, Hao
    Zhang, Jing
    Li, Qianmu
    Zhang, Yinghua
    IEEE ACCESS, 2020, 8 : 106843 - 106854
  • [24] Sparse Boltzmann Machines with Structure Learning as Applied to Text Analysis
    Chen, Zhourong
    Zhang, Nevin L.
    Yeung, Dit-Yan
    Chen, Peixian
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1805 - 1811
  • [25] Fault Diagnosis of Machines Using Deep Convolutional Beta-Variational Autoencoder
    Dewangan G.
    Maurya S.
    IEEE Transactions on Artificial Intelligence, 2022, 3 (02): : 287 - 296
  • [26] SEMI-SUPERVISED GAUSSIAN MIXTURE VARIATIONAL AUTOENCODER FOR PULSE SHAPE DISCRIMINATION
    Abdulaziz, Abdullah
    Zhou, Jianxin
    Di Fulvio, Angela
    Altmann, Yoann
    McLaughlin, Stephen
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3538 - 3542
  • [27] Gaussian Mixture Variational Autoencoder with Contrastive Learning for Multi-Label Classification
    Bai, Junwen
    Kong, Shufeng
    Gomes, Carla
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [28] Multimodal Weibull Variational Autoencoder for Jointly Modeling Image-Text Data
    Wang, Chaojie
    Chen, Bo
    Xiao, Sucheng
    Wang, Zhengjue
    Zhang, Hao
    Wang, Penghui
    Han, Ning
    Zhou, Mingyuan
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (10) : 11156 - 11171
  • [29] A Unified Unsupervised Gaussian Mixture Variational Autoencoder for High Dimensional Outlier Detection
    Liao, Weixian
    Guo, Yifan
    Chen, Xuhui
    Li, Pan
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 1208 - 1217
  • [30] An Improved Semi-supervised Variational Autoencoder with Gate Mechanism for Text Classification
    Ye, Haiming
    Zhang, Weiwen
    Nie, Mengna
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (10)