A Survey of Topic Modeling in Text Mining

被引:7
|
作者
Alghamdi, Rubayyi [1 ]
Alfalqi, Khalid [1 ]
机构
[1] Concordia Univ, Informat Syst Secur CIISE, Montreal, PQ, Canada
关键词
Topic Modeling; Methods of Topic Modeling; Latent semantic analysis (LSA); Probabilistic latent semantic analysis (PLSA); Latent Dirichlet allocation (LDA); Correlated topic model (CTM); Topic Evolution Modelin;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Topic models provide a convenient way to analyze large of unclassified text. A topic contains a cluster of words that frequently occur together. A topic modeling can connect words with similar meanings and distinguish between uses of words with multiple meanings. This paper provides two categories that can be under the field of topic modeling. First one discusses the area of methods of topic modeling, which has four methods that can be considerable under this category. These methods are Latent semantic analysis (LSA), Probabilistic latent semantic analysis (PLSA), Latent Dirichlet allocation (LDA), and Correlated topic model (CTM). The second category is called topic evolution models, which model topics by considering an important factor time. In the second category, different models are discussed, such as topic over time (TOT), dynamic topic models (DTM), multiscale topic tomography, dynamic topic correlation detection, detecting topic evolution in scientific literature, etc.
引用
收藏
页码:147 / 153
页数:7
相关论文
共 50 条
  • [31] Heterogeneous Latent Topic Discovery for Semantic Text Mining
    Li, Yawen
    Jiang, Di
    Lian, Rongzhong
    Wu, Xueyang
    Tan, Conghui
    Xu, Yi
    Su, Zhiyang
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (01) : 533 - 544
  • [32] TopCat: Data mining for topic identification in a text corpus
    Clifton, C
    Cooley, R
    [J]. PRINCIPLES OF DATA MINING AND KNOWLEDGE DISCOVERY, 1999, 1704 : 174 - 183
  • [33] Building topic maps using a text mining approach
    Yang, HC
    Lee, CH
    [J]. FOUNDATIONS OF INTELLIGENT SYSTEMS, 2003, 2871 : 307 - 314
  • [34] TopCat: Data mining for topic identification in a text corpus
    Clifton, C
    Cooley, R
    Rennie, J
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2004, 16 (08) : 949 - 964
  • [35] Chemical Topic Modeling: Exploring Molecular Data Sets Using a Common Text-Mining Approach
    Schneider, Nadine
    Fechner, Nikolas
    Landrum, Gregory A.
    Stiefl, Nikolaus
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2017, 57 (08) : 1816 - 1831
  • [36] The Evolution of Public Perceptions of Automated Vehicles in China: A Text Mining Approach Based Dynamic Topic Modeling
    Ma, Jun
    Feng, Xuejing
    Yang, Qinrui
    [J]. HCI INTERNATIONAL 2023 LATE BREAKING PAPERS, HCII 2023,PT IV, 2023, 14057 : 340 - 350
  • [37] An Efficient Topic Modeling Approach for Text Mining and Information Retrieval through K-means Clustering
    Rashid, Junaid
    Shah, Syed Muhammad Adnan
    Irtaza, Aun
    [J]. MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2020, 39 (01) : 213 - 222
  • [38] Statistical Topic Modeling for Urdu Text Articles
    Rehman, Anwar Ur
    Rehman, Zobia
    Akram, Junaid
    Ali, Waqar
    Shah, Munam Ali
    Salman, Muhammad
    [J]. 2018 24TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING (ICAC' 18), 2018, : 62 - 67
  • [39] Text Segmentation with Topic Modeling and Entity Coherence
    John, Adebayo Kolawole
    Di Caro, Luigi
    Boella, Guido
    [J]. PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS (HIS 2016), 2017, 552 : 175 - 185
  • [40] Hierarchical Topic Modeling for Urdu Text Articles
    Rehman, Anwar Ur
    Khan, Ali Haider
    Aftab, Mustansar
    Rehman, Zobia
    Shah, Munam Ali
    [J]. 2019 25TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING (ICAC), 2019, : 464 - 469