Using Full-text of Academic Articles to Find Software Clusters

被引:0
|
作者
Zhang, Heng [1 ]
Ma, Shutian [1 ]
Zhang, Chengzhi [1 ]
机构
[1] Nanjing Univ Sci & Technol, Dept Informat Management, Nanjing 210094, Peoples R China
关键词
Scientific Software; Software Clustering; Distributed Representation;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Scientific software is making contributions to modern science. To meet huge academic demands such as data analysis, modelling, visualization and so on, various software has been developed to help different steps in scientific work. In order to reveal the connections between scientific software, we conduct cluster analysis among scientific software based on the full-text data of 23,120 articles published in PLOS ONE. Firstly, we select some popular software whose mention times are over 50 to be our candidate software list for clustering analysis. Secondly, Word2Vec is applied to learn distributed representation for each software. Then, we apply Affinity Propagation to cluster software and tune different parameters to obtain better results. Silhouette coefficient is computed here to evaluate clustering performance under each parameter setting. According to our optimal results, software clusters with specific functions can be found. And software which have strong linkage between each other are mainly have functions in common.
引用
收藏
页码:2776 / 2777
页数:2
相关论文
共 50 条
  • [1] Using Full-Text of Research Articles to Analyze Academic Impact of Algorithms
    Wang, Yuzhuo
    Zhang, Chengzhi
    [J]. TRANSFORMING DIGITAL WORLDS, ICONFERENCE 2018, 2018, 10766 : 395 - 401
  • [2] Full-text journal articles on the Internet
    Prakash, CS
    [J]. AUSTRALASIAN BIOTECHNOLOGY, 1998, 8 (05) : 308 - 309
  • [3] Using Full-text Content of Academic Articles to Build a Methodology Taxonomy of Information Science in China
    Zhang, Heng
    Zhang, Chengzhi
    [J]. KNOWLEDGE ORGANIZATION, 2021, 48 (02): : 126 - 139
  • [4] Using Full-text Content of Academic Articles to Classify Research Methods in Library and Information Science
    Wang, Ruping
    Tian, Liang
    Zhang, Chengzhi
    [J]. 18TH INTERNATIONAL CONFERENCE ON SCIENTOMETRICS & INFORMETRICS (ISSI2021), 2021, : 1557 - 1558
  • [5] Using R to develop a corpus of full-text journal articles
    Anderson, Billie
    Bani-Yaghoub, Majid
    Kantheti, Vagmi
    Curtis, Scott
    [J]. JOURNAL OF INFORMATION SCIENCE, 2023,
  • [6] Extracting and quantifying eponyms in full-text articles
    Cabanac, Guillaume
    [J]. SCIENTOMETRICS, 2014, 98 (03) : 1631 - 1645
  • [7] Extracting and quantifying eponyms in full-text articles
    Guillaume Cabanac
    [J]. Scientometrics, 2014, 98 : 1631 - 1645
  • [8] Investigating Citation of Algorithm in Full-text of Academic Articles in NLP domain: A Preliminary Study
    Ding, Ruiyi
    Wang, Yuzhuo
    Zhang, Chengzhi
    [J]. 17TH INTERNATIONAL CONFERENCE ON SCIENTOMETRICS & INFORMETRICS (ISSI2019), VOL II, 2019, : 2726 - 2727
  • [9] Which structure of academic articles do referees pay more attention to?: perspective of peer review and full-text of academic articles
    Qin, Chenglei
    Zhang, Chengzhi
    [J]. ASLIB JOURNAL OF INFORMATION MANAGEMENT, 2023, 75 (05) : 884 - 916
  • [10] Using Full-text to Evaluate Impact of Different Software Groups
    Ma, Shutian
    Zhang, Chengzhi
    [J]. 16TH INTERNATIONAL CONFERENCE ON SCIENTOMETRICS & INFORMETRICS (ISSI 2017), 2017, : 1666 - 1667