DAC: Descendant-aware clustering algorithm for network-based topic emergence prediction

被引:4
|
作者
Jung, Sukhwan [1 ]
Segev, Aviv [1 ]
机构
[1] Univ S Alabama, Dept Comp Sci, 150 Student Serv Dr, Mobile, AL 36688 USA
关键词
Topic evolution; Topic prediction; Clustering; Topic emergence prediction; Scientometrics; EVOLUTION;
D O I
10.1016/j.joi.2022.101320
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Topic emergence detection aids in pinpointing prominent topics within a given domain, providing practical insights into all interested parties on where to focus the limited resources. This paper employs the network-based topic evolution approach to overcome limitations in text-based topic evolution, providing prospective topic emergence prediction capabilities by representing emer-gent topics by their ancestors. A descendant-aware clustering algorithm is proposed to generate non-exhaustive and overlapping clusters, utilizing the pace of collaborations and structural sim-ilarities between topics with iterative edge removal and addition processes. Over 100 datasets specific to a research topic were extracted from the Microsoft Academic Graph dataset for the experiments, where the proposed algorithm consistently outperformed existing clustering algo-rithms in generating clusters with a higher likelihood of being ancestors to an emergent topic up to three years in the future. Regression-based cluster filtering using five structural cluster features and topic cluster qualities showed that the prediction performance can be enhanced by automat-ically classifying undesirable clusters from previously known data. The results showed that the proposed algorithm can enhance topic emergence predictions on a wide range of research do-mains regardless of their maturities, popularities, and magnitudes without having access to the data in the predicted year, paving a road to prospective predictions on emergent topics.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Analyzing the generalizability of the network-based topic emergence identification method
    Jung, Sukhwan
    Segev, Aviv
    SEMANTIC WEB, 2022, 13 (03) : 423 - 439
  • [2] A novel topic clustering algorithm based on graph neural network for question topic diversity
    Wu, Yongliang
    Wang, Xuejun
    Zhao, Wenbin
    Lv, Xiaofeng
    INFORMATION SCIENCES, 2023, 629 : 685 - 702
  • [3] SNCStream: A Social Network-based Data Stream Clustering Algorithm
    Barddal, Jean Paul
    Gomes, Heitor Murilo
    Enembreck, Fabricio
    30TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, VOLS I AND II, 2015, : 935 - 940
  • [4] A Complex Network-Based Anytime Data Stream Clustering Algorithm
    Barddal, Jean Paul
    Gomes, Heitor Murilo
    Enembreck, Fabricio
    NEURAL INFORMATION PROCESSING, PT I, 2015, 9489 : 615 - 622
  • [5] A novel algorithm for network-based prediction of cancer recurrence
    Ruan, Jianhua
    Jahid, Md Jamiul
    Gu, Fei
    Lei, Chengwei
    Huang, Yi-Wen
    Hsu, Ya-Ting
    Mutch, David G.
    Chen, Chun-Liang
    Kirma, Nameer B.
    Huang, Tim H-M
    GENOMICS, 2019, 111 (01) : 17 - 23
  • [6] Network-based topic structure visualization
    Jeon, Yeseul
    Park, Jina
    Jin, Ick Hoon
    Chung, Dongjun
    JOURNAL OF APPLIED STATISTICS, 2025, 52 (02) : 509 - 523
  • [7] A Network Decomposition-based Text Clustering Algorithm for Topic Detection
    Meng, Zuqiang
    Shen, Shimo
    Chen, Qiulian
    MEASUREMENT TECHNOLOGY AND ITS APPLICATION, PTS 1 AND 2, 2013, 239-240 : 1318 - 1323
  • [8] Network-based semisupervised clustering
    Frigau, Luca
    Contu, Giulia
    Mola, Francesco
    Conversano, Claudio
    APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2021, 37 (02) : 182 - 202
  • [9] FACT: A new neural network-based clustering algorithm for group technology
    Kamal, S
    Burke, LI
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 1996, 34 (04) : 919 - 946
  • [10] A Sensor Network-Based Data Stream Clustering Algorithm for Pervasive Computing
    Ye Ning
    Wang Ruchuan
    CHINESE JOURNAL OF ELECTRONICS, 2009, 18 (02): : 255 - 258