A review of cluster analysis techniques and their uses in library and information science research: k-means and k-medoids clustering

被引:27
|
作者
Lund, Brady [1 ]
Ma, Jinxuan [1 ]
机构
[1] Emporia State Univ, Emporia, KS 66801 USA
关键词
Clustering; Library and information science; Research methods; Cluster analysis; Data analysis; K-means;
D O I
10.1108/PMM-05-2021-0026
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Purpose - This literature review explores the definitions and characteristics of cluster analysis, a machine-learning technique that is frequently implemented to identify groupings in big datasets and its applicability to library and information science (LIS) research. This overview is intended for researchers who are interested in expanding their data analysis repertory to include cluster analysis, rather than for existing experts in this area. Design/methodology/approach - A review of LIS articles included in the Library and Information Source (EBSCO) database that employ cluster analysis is performed. An overview of cluster analysis in general (how it works from a statistical standpoint, and how it can be performed by researchers), the most popular cluster analysis techniques and the uses of cluster analysis in LIS is presented. Findings - The number of LIS studies that employ a cluster analytic approach has grown from about 5 per year in the early 2000s to an average of 35 studies per year in the mid- and late-2010s. The journal Scientometrics has the most articles published within LIS that use cluster analysis (102 studies). Scientometrics is the most common subject area to employ a cluster analytic approach (152 studies). The findings of this review indicate that cluster analysis could make LIS research more accessible by providing an innovative and insightful process of knowledge discovery. Originality/value - This review is the first to present cluster analysis as an accessible data analysis approach, specifically from an LIS perspective.
引用
收藏
页码:161 / 173
页数:13
相关论文
共 50 条
  • [1] A Review of Cluster Analysis Techniques and Their Uses in Library and Information Science Research: K-Means and K-Medoids Clustering
    Lund, Brady D.
    Ma, Jinxuan
    SSRN, 2023,
  • [2] Comparative Analysis between K-Means and K-Medoids for Statistical Clustering
    Arbin, Norazam
    Suhaimi, Nur Suhailayani
    Mokhtar, Nurul Zafirah
    Othman, Zalinda
    2015 THIRD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, MODELLING AND SIMULATION (AIMS 2015), 2015, : 117 - 121
  • [3] Comparison between K-Means and K-Medoids Clustering Algorithms
    Madhulatha, Tagaram Soni
    ADVANCES IN COMPUTING AND INFORMATION TECHNOLOGY, 2011, 198 : 472 - 481
  • [4] K-Medoids for K-Means Seeding
    Newling, James
    Fleuret, Francois
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [5] K-Medoids and K-Means Clustering in High School Teacher Distribution
    Widiyaningtyas, Triyanna
    Pujianto, Utomo
    Prabowo, Martin Indra Wisnu
    2019 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS AND INFORMATION ENGINEERING (ICEEIE), 2019, : 330 - 335
  • [6] Analysis of K-Means and K-Medoids Algorithm For Big Data
    Arora, Preeti
    Deepali
    Varshney, Shipra
    1ST INTERNATIONAL CONFERENCE ON INFORMATION SECURITY & PRIVACY 2015, 2016, 78 : 507 - 512
  • [7] k-MM: A Hybrid Clustering Algorithm Based on k-Means and k-Medoids
    Drias, Habiba
    Cherif, Nadjib Fodil
    Kechid, Amine
    ADVANCES IN NATURE AND BIOLOGICALLY INSPIRED COMPUTING, 2016, 419 : 37 - 48
  • [8] K-Means and K-Medoids: Cluster Analysis on Birth Data Collected in City Muzaffarabad, Kashmir
    Abbas, Syed Ali
    Aslam, Adil
    Rehman, Aqeel Ur
    Abbasi, Wajid Arshad
    Arif, Saeed
    Kazmi, Syed Zaki Hassan
    IEEE ACCESS, 2020, 8 : 151847 - 151855
  • [9] K-Means and K-Medoids for Indonesian Text Summarization
    Purnamasari, K. K.
    2ND INTERNATIONAL CONFERENCE ON INFORMATICS, ENGINEERING, SCIENCE, AND TECHNOLOGY (INCITEST 2019), 2019, 662
  • [10] Operational analysis of k-medoids and k-means algorithms on noisy data
    Manjoro, Wellington Simbarashe
    Dhakar, Mradul
    Chaurasia, Brijesh Kumar
    2016 INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), VOL. 1, 2016, : 1500 - 1505