A hierarchical topic modelling approach for short text clustering

被引:0
|
作者
Pradhan R. [1 ]
Sharma D.K. [1 ]
机构
[1] GLA University, UP, Mathura
关键词
Dirichlet multinomial mixture; DMM; short text clustering; STT; topic modelling; Twitter topic modelling;
D O I
10.1504/IJICT.2022.123161
中图分类号
学科分类号
摘要
Social networking websites such as Twitter and WeChat provide services for microblogging to its users; they post millions of short messages on it every day. Creating a dataset of these messages helps in solving many non-trivial tasks in the domain of computer science, natural language processing, opinion mining, and many more. Topic modelling is critical in understanding the tweets and segregate then into manageable sets. We are bringing the topic modelling approaches to cluster the tweets or short text messages to groups as conventional approaches fail to properly deal with noisy, high volume, dimensionality, and short text sparseness. The method we have proposed can deal with the issue of data sparsity of short text. Our method involves a hierarchical two-stage clustering method. We have analysed the results on standard datasets, and we find that our method had better results as compared to other methods. Copyright © 2022 Inderscience Enterprises Ltd.
引用
收藏
页码:463 / 481
页数:18
相关论文
共 50 条
  • [41] Classifying spam emails using agglomerative hierarchical clustering and a topic-based approach
    Janez-Martino, Francisco
    Alaiz-Rodriguez, Rocio
    Gonzalez-Castro, Victor
    Fidalgo, Eduardo
    Alegre, Enrique
    APPLIED SOFT COMPUTING, 2023, 139
  • [42] Comparing text corpora via topic modelling
    Krasnov, Fedor
    Shvartsman, Mikhail
    Dimentov, Alexander
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2022, 14 (03) : 203 - 216
  • [43] An effective short-text topic modelling with neighbourhood assistance-driven NMF in Twitter
    Shalani Athukorala
    Wathsala Mohotti
    Social Network Analysis and Mining, 2022, 12
  • [44] Short text topic modelling using local and global word-context semantic correlation
    Kinariwala, Supriya
    Deshmukh, Sachin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (17) : 26411 - 26433
  • [45] Short text topic modelling using local and global word-context semantic correlation
    Supriya Kinariwala
    Sachin Deshmukh
    Multimedia Tools and Applications, 2023, 82 : 26411 - 26433
  • [46] An effective short-text topic modelling with neighbourhood assistance-driven NMF in Twitter
    Athukorala, Shalani
    Mohotti, Wathsala
    SOCIAL NETWORK ANALYSIS AND MINING, 2022, 12 (01)
  • [47] Topic Detection from Microblog Based on Text Clustering and Topic Model Analysis
    Huang, Siqi
    Yang, Yitao
    Li, Huakang
    Sun, Guozi
    2014 ASIA-PACIFIC SERVICES COMPUTING CONFERENCE (APSCC), 2014, : 88 - 92
  • [48] Causality Model for Text Data with a Hierarchical Topic Structure
    Ogawa, Takuro
    Shimadzu, Hideyasu
    Saga, Ryosuke
    2020 25TH INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2020), 2020, : 205 - 210
  • [49] Joint Sentiment Topic Model for objective text clustering
    Sanchez, Octavio
    Sierra, Gerardo
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (04) : 3119 - 3128
  • [50] Clustering Based Topic Events Detection on Text Stream
    Li, Chunshan
    Ye, Yunming
    Zhang, Xiaofeng
    Chu, Dianhui
    Deng, Shengchun
    Xu, Xiaofei
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, PT 1, 2014, 8397 : 42 - 52