Analyzing the generalizability of the network-based topic emergence identification method

被引:6
|
作者
Jung, Sukhwan [1 ]
Segev, Aviv [1 ]
机构
[1] Univ S Alabama, Dept Comp Sci, 150 Student Serv Dr, Mobile, AL 36608 USA
关键词
Topic evolution; topic prediction; network-based topic modeling; scientometrics; TRENDS;
D O I
10.3233/SW-212951
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Topic evolution helps the understanding of current research topics and their histories by automatically modeling and detecting the set of shared research fields in academic publications as topics. This paper provides a generalized analysis of the topic evolution method for predicting the emergence of new topics, which can operate on any dataset where the topics are defined as the relationships of their neighborhoods in the past by extrapolating to the future topics. Twenty sample topic networks were built with various fields-of-study keywords as seeds, covering domains such as business, materials, diseases, and computer science from the Microsoft Academic Graph dataset. The binary classifier was trained for each topic network using 15 structural features of emerging and existing topics and consistently resulted in accuracy and F1 over 0.91 for all twenty datasets over the periods of 2000 to 2019. Feature selection showed that the models retained most of the performance with only one-third of the tested features. Incremental learning was tested within the same topic over time and between different topics, which resulted in slight performance improvements in both cases. This indicates there is an underlying pattern to the neighbors of new topics common to research domains, likely beyond the sample topics used in the experiment. The result showed that network-based new topic prediction can be applied to various research domains with different research patterns.
引用
收藏
页码:423 / 439
页数:17
相关论文
共 50 条
  • [1] Generalizability of Neural Network-based Identification of PV in Aerial Images
    Ranalli, Joseph
    Zech, Matthias
    [J]. 2023 IEEE 50TH PHOTOVOLTAIC SPECIALISTS CONFERENCE, PVSC, 2023,
  • [2] DAC: Descendant-aware clustering algorithm for network-based topic emergence prediction
    Jung, Sukhwan
    Segev, Aviv
    [J]. JOURNAL OF INFORMETRICS, 2022, 16 (03)
  • [3] Network-based topic structure visualization
    Jeon, Yeseul
    Park, Jina
    Jin, Ick Hoon
    Chung, Dongjun
    [J]. JOURNAL OF APPLIED STATISTICS, 2024,
  • [4] A neural network-based method for analyzing diffracted wave velocity
    Tao, Junhong
    Zhao, Jingtao
    Sheng, Tongjie
    [J]. Meitiandizhi Yu Kantan/Coal Geology and Exploration, 2024, 52 (09): : 166 - 175
  • [5] A network-based method for visual identification of systemic risks
    Soramaki, Kimmo
    Cook, Samantha
    Laubsch, Alan
    [J]. JOURNAL OF NETWORK THEORY IN FINANCE, 2016, 2 (01): : 67 - 101
  • [6] Neural Network-Based Method for Peptide Identification in Proteomics
    Raczynski, Lech
    Rubel, Tymon
    Zaremba, Krzysztof
    [J]. INFORMATION TECHNOLOGIES IN BIOMEDICINE, ITIB 2012, 2012, 7339 : 437 - 444
  • [7] Robustness and Sensitivity of Network-Based Topic Detection
    Galluccio, Carla
    Magnani, Matteo
    Vega, Davide
    Ragozini, Giancarlo
    Petrucci, Alessandra
    [J]. COMPLEX NETWORKS AND THEIR APPLICATIONS XI, COMPLEX NETWORKS 2022, VOL 2, 2023, 1078 : 259 - 270
  • [8] Siamese Network-Based Supervised Topic Modeling
    Huang, Minghui
    Rao, Yanghui
    Liu, Yuwei
    Xie, Haoran
    Wang, Fu Lee
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 4652 - 4662
  • [9] Network-based logistic regression integration method for biomarker identification
    Zhang, Ke
    Geng, Wei
    Zhang, Shuqin
    [J]. BMC SYSTEMS BIOLOGY, 2018, 12
  • [10] A network-based method for the identification of putative genes related to infertility
    Wang, ShaoPeng
    Huang, GuoHua
    Hu, Qinghua
    Zou, Quan
    [J]. BIOCHIMICA ET BIOPHYSICA ACTA-GENERAL SUBJECTS, 2016, 1860 (11): : 2716 - 2724