A review of cluster analysis techniques and their uses in library and information science research: k-means and k-medoids clustering

被引:27
|
作者
Lund, Brady [1 ]
Ma, Jinxuan [1 ]
机构
[1] Emporia State Univ, Emporia, KS 66801 USA
关键词
Clustering; Library and information science; Research methods; Cluster analysis; Data analysis; K-means;
D O I
10.1108/PMM-05-2021-0026
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Purpose - This literature review explores the definitions and characteristics of cluster analysis, a machine-learning technique that is frequently implemented to identify groupings in big datasets and its applicability to library and information science (LIS) research. This overview is intended for researchers who are interested in expanding their data analysis repertory to include cluster analysis, rather than for existing experts in this area. Design/methodology/approach - A review of LIS articles included in the Library and Information Source (EBSCO) database that employ cluster analysis is performed. An overview of cluster analysis in general (how it works from a statistical standpoint, and how it can be performed by researchers), the most popular cluster analysis techniques and the uses of cluster analysis in LIS is presented. Findings - The number of LIS studies that employ a cluster analytic approach has grown from about 5 per year in the early 2000s to an average of 35 studies per year in the mid- and late-2010s. The journal Scientometrics has the most articles published within LIS that use cluster analysis (102 studies). Scientometrics is the most common subject area to employ a cluster analytic approach (152 studies). The findings of this review indicate that cluster analysis could make LIS research more accessible by providing an innovative and insightful process of knowledge discovery. Originality/value - This review is the first to present cluster analysis as an accessible data analysis approach, specifically from an LIS perspective.
引用
收藏
页码:161 / 173
页数:13
相关论文
共 50 条
  • [31] Fuzzy K-means clustering with reconstructed information
    Huang, Honglan
    Shi, Wei
    Yang, Fangjie
    Feng, Yanghe
    Zhang, Longfei
    Liang, Xingxing
    Shi, Jun
    Cheng, Guangquan
    Huang, Jincai
    Liu, Zhong
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025, 16 (01) : 43 - 53
  • [32] Stability analysis in K-means clustering
    Steinley, Douglas
    BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2008, 61 : 255 - 273
  • [33] Relocating Local Outliers Produced by K-means and K-medoids Using Local Outlier Rectifier V.2.0
    Badiang, Rogelio O., Jr.
    Gerardo, Bobby D.
    Medina, Ruji P.
    2019 11TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS 2019), 2019, : 89 - 93
  • [34] Cluster structure of K-means clustering via principal component analysis
    Ding, C
    He, XF
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2004, 3056 : 414 - 418
  • [35] Using K-Means Clustering to Cluster Provinces in Indonesia
    Ahmar, Ansari Saleh
    Napitupulu, Darmawan
    Rahim, Robbi
    Hidayat, Rahmat
    Sonatha, Yance
    Azmi, Meri
    2ND INTERNATIONAL CONFERENCE ON STATISTICS, MATHEMATICS, TEACHING, AND RESEARCH 2017, 2018, 1028
  • [36] Privacy Preservation in k-Means Clustering by Cluster Rotation
    Dhiraj, S. S. Shivaji
    Khan, Ameer M. Asif
    Khan, Wajhiulla
    Challagalla, Ajay
    TENCON 2009 - 2009 IEEE REGION 10 CONFERENCE, VOLS 1-4, 2009, : 1437 - 1443
  • [37] Research and Improvement on K-Means Clustering Algorithm
    Wang, Xue-mei
    Wang, Jin-bo
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION APPLICATIONS (ICCIA 2012), 2012, : 1138 - 1141
  • [38] Research on Improved K-means Clustering Algorithm
    Zhang, Yinsheng
    Shan, Huilin
    Li, Jiaqiang
    Zhou, Jie
    MEMS, NANO AND SMART SYSTEMS, PTS 1-6, 2012, 403-408 : 1977 - 1980
  • [39] Research on improved K-means clustering algorithm
    Zhang, Yinsheng
    Shan, Huilin
    Li, Jiaqiang
    Zhou, Jie
    Advanced Materials Research, 2012, 403-408 : 1977 - 1980
  • [40] Cluster center initialization algorithm for K-means clustering
    Khan, SS
    Ahmad, A
    PATTERN RECOGNITION LETTERS, 2004, 25 (11) : 1293 - 1302