Dynamic Hybrid Clustering of Bioinformatics by Incorporating Text Mining and Citation Analysis

被引:0
|
作者
Janssens, Frizo [1 ]
Glanzel, Wolfgang [2 ]
De Moor, Bart [1 ]
机构
[1] Katholieke Univ Leuven, Elect Engn ESAT, Kasteelpk Arenberg 10, B-3001 Leuven, Belgium
[2] Katholieke Univ Leuven, Steunpunt O&O Indicatoren, B-3000 Louvain, Belgium
关键词
Fisher's inverse chi-square method; cluster chains;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To unravel the concept structure and dynamics of the bioinformatics field, we analyze a set of 7401 publications from the Web of Science and MEDLINE databases, publication years 1981-2004. For delineating this complex, interdisciplinary field, a novel bibliometric retrieval strategy is used. Given that the performance of unsupervised clustering and classification of scientific publications is significantly improved by deeply merging textual contents with the structure of the citation graph, we proceed with a hybrid clustering method based on Fisher's inverse chi-square. The optimal number of clusters is determined by a compound semiautomatic strategy comprising a combination of distance-based and stability-based methods. We also investigate the relationship between number of Latent Semantic Indexing factors, number of clusters, and clustering performance. The HITS and PageRank algorithms are used to determine representative publications in each cluster. Next, we develop a methodology for dynamic hybrid clustering of evolving bibliographic data sets. The same clustering methodology is applied to consecutive periods defined by time windows on the set, and in a subsequent phase chains are formed by matching and tracking clusters through time. Term networks for the eleven resulting cluster chains present the cognitive structure of the field. Finally, we provide a view on how much attention the bioinformatics community has devoted to the different subfields through time.
引用
下载
收藏
页码:360 / +
页数:2
相关论文
共 50 条
  • [11] Mining a Web citation database for document clustering
    He, Y
    Hui, SC
    Fong, ACM
    APPLIED ARTIFICIAL INTELLIGENCE, 2002, 16 (04) : 283 - 302
  • [12] Text Mining, Clustering and Sentiment analysis: A systematic Literature Review
    Hoti, Mergim H.
    Ajdari, Jaumin
    Hamiti, Mentor
    Zenuni, Xhemal
    2022 11TH MEDITERRANEAN CONFERENCE ON EMBEDDED COMPUTING (MECO), 2022, : 302 - 307
  • [13] Tracing theory diffusion: a text mining and citation-based analysis of TAM
    Wang, Fang
    Wang, Xiaoyu
    JOURNAL OF DOCUMENTATION, 2020, 76 (06) : 1109 - 1134
  • [14] Incorporating Social Network Thai Text Mining with Lifestyle Segmentation Analysis
    Ratanasawadwat, Nitipan
    Jiamthapthaksin, Rachsuda
    2017 6TH IIAI INTERNATIONAL CONGRESS ON ADVANCED APPLIED INFORMATICS (IIAI-AAI), 2017, : 971 - 975
  • [15] A dynamic adaptive self-organising hybrid model for text clustering
    Hung, C
    Wermter, S
    THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2003, : 75 - 82
  • [16] Text mining with information - Theoretic clustering
    Kogan, J
    Nicholas, C
    Volkovich, V
    COMPUTING IN SCIENCE & ENGINEERING, 2003, 5 (06) : 52 - 59
  • [17] Citation mining:: Integrating text mining and bibliometrics for research user profiling
    Kostoff, RN
    del Río, JA
    Humenik, JA
    García, EO
    Ramírez, AM
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2001, 52 (13): : 1148 - 1156
  • [18] A hybrid approach for text clustering
    Ajmi Al-Shuwaili S.O.
    Obied Redywi S.
    Naser M.A.
    Materials Today: Proceedings, 2023, 80 : 2584 - 2589
  • [19] Special Issue on Algorithms for Data and Text Mining in Bioinformatics
    Makris, Christos
    Tsakalidis, Athanasios
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2015, 24 (01)
  • [20] A Hybrid Method for Manufacturing Text Mining Based on Document Clustering and Topic Modeling Techniques
    Shotorbani, Peyman Yazdizadeh
    Ameri, Farhad
    Kulvatunyou, Boonserm
    Ivezic, Nenad
    ADVANCES IN PRODUCTION MANAGEMENT SYSTEMS: INITIATIVES FOR A SUSTAINABLE WORLD, 2016, 488 : 777 - 786