Weighted Hybrid Clustering by Combining Text Mining and Bibliometrics on a Large-Scale Journal Database

被引:54
|
作者
Liu, Xinhai [1 ,2 ]
Yu, Shi [1 ]
Janssens, Frizo [1 ]
Glanzel, Wolfgang [3 ,4 ]
Moreau, Yves [1 ]
De Moor, Bart [1 ]
机构
[1] Katholieke Univ Leuven, ESAT SCD, B-3001 Leuven, Belgium
[2] Wuhan Univ Sci & Technol, Coll Informat Sci & Engn, Wuhan 430081, Hubei, Peoples R China
[3] Katholieke Univ Leuven, Ctr R&D Monitoring, Dept Managerial Econ Strategy & Innovat, B-3000 Leuven, Belgium
[4] Hungarian Acad Sci, IRPS, Budapest, Hungary
关键词
COMBINED COCITATION; WORD ANALYSIS; SCIENCE; INFORMATION; CONSENSUS;
D O I
10.1002/asi.21312
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a new hybrid clustering framework to incorporate text mining with bibliometrics in journal set analysis. The framework integrates two different approaches: clustering ensemble and kernel-fusion clustering. To improve the flexibility and the efficiency of processing large-scale data, we propose an information-based weighting scheme to leverage the effect of multiple data sources in hybrid clustering. Three different algorithms are extended by the proposed weighting scheme and they are employed on a large journal set retrieved from the Web of Science (WoS) database. The clustering performance of the proposed algorithms is systematically evaluated using multiple evaluation methods, and they were cross-compared with alternative methods. Experimental results demonstrate that the proposed weighted hybrid clustering strategy is superior to other methods in clustering performance and efficiency. The proposed approach also provides a more refined structural mapping of journal sets, which is useful for monitoring and detecting new trends in different scientific fields.
引用
收藏
页码:1105 / 1119
页数:15
相关论文
共 50 条
  • [1] Interleaved Text/Image Deep Mining on a Large-Scale Radiology Database
    Shin, Hoo-Chang
    Lu, Le
    Kim, Lauren
    Seff, Ari
    Yao, Jianhua
    Summers, Ronald M.
    [J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 1090 - 1099
  • [2] Graph Clustering for Large-Scale Text-Mining of Brain Imaging Studies
    Chawla, Manisha
    Mesa, Mounika
    Miyapuram, Krishna P.
    [J]. PROCEEDING OF THE THIRD INTERNATIONAL SYMPOSIUM ON WOMEN IN COMPUTING AND INFORMATICS (WCI-2015), 2015, : 163 - 168
  • [3] Large-Scale Text Mining of Biomedical Literature
    Ginter, Filip
    [J]. ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2013, (116): : 43 - 44
  • [4] Text mining using database tomography and bibliometrics: A review
    Kostoff, RN
    Toothman, DR
    Eberhart, HJ
    Humenik, JA
    [J]. TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 2001, 68 (03) : 223 - 253
  • [5] Fractals text mining using bibliometrics and database tomography
    Kostoff, RN
    Shlesinger, MF
    Malpohl, G
    [J]. FRACTALS-COMPLEX GEOMETRY PATTERNS AND SCALING IN NATURE AND SOCIETY, 2004, 12 (01) : 1 - 16
  • [6] Interleaved text/image deep mining on a large-scale radiology database for automated image interpretation
    Shin, Hoo-Chang
    Lu, Le
    Kim, Lauren
    Seff, Ari
    Yao, Jianhua
    Summers, Ronald M.
    [J]. Journal of Machine Learning Research, 2016, 17 : 1 - 31
  • [7] Interleaved Text/Image Deep Mining on a Large-Scale Radiology Database for Automated Image Interpretation
    Shin, Hoo-Chang
    Lu, Le
    Kim, Lauren
    Seff, Ari
    Yao, Jainhua
    Summers, Ronald M.
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2016, 17
  • [8] Nonlinear dynamics text mining using bibliometrics and Database Tomography
    Kostoff, RN
    Shlesinger, MF
    Tshiteya, R
    [J]. INTERNATIONAL JOURNAL OF BIFURCATION AND CHAOS, 2004, 14 (01): : 61 - 92
  • [9] Electrochemical power text mining using bibliometrics and database tomography
    Kostoff, RN
    Tshiteya, R
    Pfeil, KM
    Humenik, JA
    [J]. JOURNAL OF POWER SOURCES, 2002, 110 (01) : 163 - 176
  • [10] Hybrid Clustering by Integrating Text and Citation based Graphs in Journal Database Analysis
    Liu, Xinhai
    Yu, Shi
    Moreau, Yves
    Janssens, Frizo
    De Moor, Bart
    Glaenzel, Wolfgang
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 521 - +