A Comparative Study of Fuzzy Topic Models and LDA in terms of Interpretability

被引:9
|
作者
Rijcken, Emil [1 ]
Scheepers, Floortje [2 ]
Mosteiro, Pablo [3 ]
Zervanou, Kalliopi [4 ]
Spruit, Marco [5 ]
Kaymak, Uzay [1 ]
机构
[1] Eindhoven Univ Technol, Jheronimus Acad Data Sci, Eindhoven, Netherlands
[2] Univ Med Ctr Utrecht, Psychiat, Utrecht, Netherlands
[3] Univ Med Ctr Utrecht, Informat & Comp Sci, Utrecht, Netherlands
[4] Eindhoven Univ Technol, Ind Engn & Informat Sci, Eindhoven, Netherlands
[5] Leiden Univ, Med Ctr, Publ Hlth & Primary Care, Leiden, Netherlands
关键词
Topic Models; Text Classification; Fuzzy Modelling; Explainable AI; NLP; CLASSIFICATION; TEXT;
D O I
10.1109/SSCI50451.2021.9660139
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many domains that employ machine learning models, both high performing and interpretable models are needed. A typical machine learning task is text classification, where models are hardly interpretable. Topic models, used as topic embeddings, carry the potential to better understand the decisions made by text classification algorithms. With this goal in mind, we propose two new fuzzy topic models; FLSA-W and FLSA-V. Both models are derived from the topic model Fuzzy Latent Semantic Analysis (FLSA). After training each model ten times, we use the mean coherence score to compare the different models with the benchmark models Latent Dirichlet Allocation (LDA) and FLSA. Our proposed models generally lead to higher coherence scores and lower standard deviations than the benchmark models. These proposed models are specifically useful as topic embeddings in text classification, since the coherence scores do not drop for a high number of topics, as opposed to the decay that occurs with LDA and FLSA.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] A Comparative Study on Text Representation Models for Topic Detection in Arabic
    Koulali, Rim
    Meziane, Abdelouafi
    COMPUTACION Y SISTEMAS, 2019, 23 (03): : 683 - 691
  • [22] Latent Dirichlet allocation (LDA) and topic modeling: models, applications, a survey
    Jelodar, Hamed
    Wang, Yongli
    Yuan, Chi
    Feng, Xia
    Jiang, Xiahui
    Li, Yanchao
    Zhao, Liang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (11) : 15169 - 15211
  • [23] On-Line LDA: Adaptive Topic Models for Mining Text Streams with Applications to Topic Detection and Tracking
    AlSumait, Loulwah
    Barbara, Daniel
    Domeniconi, Carlotta
    ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 3 - 12
  • [24] A Comparative Study of PCA, LDA and Kernel LDA for Image Classification
    Ye, Fei
    Shi, Zhiping
    Shi, Zhongzhi
    2009 INTERNATIONAL SYMPOSIUM ON UBIQUITOUS VIRTUAL REALITY (ISUVR 2009), 2009, : 51 - 54
  • [25] Interpretability in HealthCare: A Comparative Study of Local Machine Learning Interpretability Techniques
    El Shawi, Radwa
    Sherif, Youssef
    Al-Mallah, Mouaz
    Sakr, Sherif
    2019 IEEE 32ND INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2019, : 275 - 280
  • [26] GOW-LDA: Applying Term Co-occurrence Graph Representation in LDA Topic Models Improvement
    Phu Pham
    Phuc Do
    Ta, Chien D. C.
    COMPUTATIONAL SCIENCE AND TECHNOLOGY, ICCST 2017, 2018, 488 : 420 - 431
  • [27] The Impact of Variable Selection and Transformation on the Interpretability and Accuracy of Fuzzy Models
    Fuchs, Caro
    Spolaor, Simone
    Kaymak, Uzay
    Nobile, Marco S.
    2022 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (IEEE CIBCB 2022), 2022, : 155 - 162
  • [28] Interpretability in healthcare: A comparative study of local machine learning interpretability techniques
    ElShawi, Radwa
    Sherif, Youssef
    Al-Mallah, Mouaz
    Sakr, Sherif
    COMPUTATIONAL INTELLIGENCE, 2021, 37 (04) : 1633 - 1650
  • [29] The performance-interpretability trade-off: a comparative study of machine learning models
    André Assis
    Jamilson Dantas
    Ermeson Andrade
    Journal of Reliable Intelligent Environments, 2025, 11 (1)
  • [30] Analyzing LDA and NMF Topic Models for Urdu Tweets via Automatic Labeling
    Zoya
    Latif, Seemab
    Shafait, Faisal
    Latif, Rabia
    IEEE ACCESS, 2021, 9 : 127531 - 127547