A hierarchical topic modelling approach for short text clustering

被引:0
|
作者
Pradhan R. [1 ]
Sharma D.K. [1 ]
机构
[1] GLA University, UP, Mathura
关键词
Dirichlet multinomial mixture; DMM; short text clustering; STT; topic modelling; Twitter topic modelling;
D O I
10.1504/IJICT.2022.123161
中图分类号
学科分类号
摘要
Social networking websites such as Twitter and WeChat provide services for microblogging to its users; they post millions of short messages on it every day. Creating a dataset of these messages helps in solving many non-trivial tasks in the domain of computer science, natural language processing, opinion mining, and many more. Topic modelling is critical in understanding the tweets and segregate then into manageable sets. We are bringing the topic modelling approaches to cluster the tweets or short text messages to groups as conventional approaches fail to properly deal with noisy, high volume, dimensionality, and short text sparseness. The method we have proposed can deal with the issue of data sparsity of short text. Our method involves a hierarchical two-stage clustering method. We have analysed the results on standard datasets, and we find that our method had better results as compared to other methods. Copyright © 2022 Inderscience Enterprises Ltd.
引用
收藏
页码:463 / 481
页数:18
相关论文
共 50 条
  • [21] Improving Hierarchical Short Text Clustering through Dominant Feature Learning
    Akritidis, Leonidas
    Alamaniotis, Miltiadis
    Fevgas, Athanasios
    Tsompanopoulou, Panagiota
    Bozanis, Panayiotis
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2022, 31 (05)
  • [22] A Self-Training Approach for Short Text Clustering
    Hadifar, Amir
    Sterckx, Lucas
    Demeester, Thomas
    Develder, Chris
    4TH WORKSHOP ON REPRESENTATION LEARNING FOR NLP (REPL4NLP-2019), 2019, : 194 - 199
  • [23] A semi-supervised approach of short text topic modeling using embedded fuzzy clustering for Twitter hashtag recommendation
    Pradipta Kumar Pattanayak
    Rudra Mohan Tripathy
    Sudarsan Padhy
    Discover Sustainability, 5
  • [24] A semi-supervised approach of short text topic modeling using embedded fuzzy clustering for Twitter hashtag recommendation
    Pattanayak, Pradipta Kumar
    Tripathy, Rudra Mohan
    Padhy, Sudarsan
    DISCOVER SUSTAINABILITY, 2024, 5 (01):
  • [25] Hierarchical clustering of text documents
    Lomakina, L. S.
    Rodionov, V. B.
    Surkova, A. S.
    AUTOMATION AND REMOTE CONTROL, 2014, 75 (07) : 1309 - 1315
  • [26] Hierarchical clustering of text documents
    L. S. Lomakina
    V. B. Rodionov
    A. S. Surkova
    Automation and Remote Control, 2014, 75 : 1309 - 1315
  • [27] Effective Seed-Guided Topic Labeling for Dataless Hierarchical Short Text Classification
    Yang, Yi
    Wang, Hongan
    Zhu, Jiaqi
    Shi, Wandong
    Guo, Wenli
    Zhang, Jiawen
    WEB ENGINEERING, ICWE 2021, 2021, 12706 : 271 - 285
  • [28] Topic Detection from Short Text: A Term-based Consensus Clustering Method
    Lin, Hao
    Sun, Bo
    Wu, Junjie
    Xiong, Haitao
    2016 13TH INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT, 2016,
  • [29] Gibbs-BERTopic: A Hybrid Approach for Short Text Topic Modeling
    Zhu, Yan
    Liu, Yueying
    IEEE ACCESS, 2025, 13 : 49162 - 49173
  • [30] Hierarchical Topic Modelling for Knowledge Graphs
    Zhang, Yujia
    Pietrasik, Marcin
    Xu, Wenjie
    Reformat, Marek
    SEMANTIC WEB, ESWC 2022, 2022, 13261 : 270 - 286