A comparison of cluster algorithms as applied to unsupervised surveys

被引:0
|
作者
Garwood K.C. [1 ]
Dhobale A.A. [2 ]
机构
[1] Saint Joseph's University, 5600 City Ave, Philadelphia, 19131, PA
[2] Indian Institute of Technology, Near Doul Gobinda Road, Amingaon, North Guwahati, Guwahati, 781039, Assam
关键词
Cluster analysis; Decision support system; Fuzzy logic; Hierarchical clustering; K-means; K-modes; Survey analysis; Unsupervised learning;
D O I
10.1504/IJBIDM.2021.114471
中图分类号
学科分类号
摘要
When considering answering important questions with data, unsupervised data offers extensive insight opportunity and unique challenges. This study considers student survey data with a specific goal of clustering students into like groups with underlying concept of identifying different poverty levels. Fuzzy logic is considered during the data cleaning and organising phase helping to create a logical dependent variable for analysis comparison. Using multiple data reduction techniques, the survey was reduced and cleaned. Finally, multiple clustering techniques (k-means, k-modes and hierarchical clustering) are applied and compared. Though each method has strengths, the goal was to identify which was most viable when applied to survey data and specifically when trying to identify the most impoverished students. © 2021 Inderscience Enterprises Ltd.
引用
收藏
页码:332 / 363
页数:31
相关论文
共 50 条
  • [21] Performance comparison of parallel sorting algorithms on the cluster of workstations
    Kyi, Lai Lai Win
    Tun, Nay Min
    World Academy of Science, Engineering and Technology, 2011, 75 : 344 - 348
  • [22] Comparison of cluster expansion fitting algorithms for interactions at surfaces
    Herder, Laura M.
    Bray, Jason M.
    Schneider, William F.
    SURFACE SCIENCE, 2015, 640 : 104 - 111
  • [23] Performance Comparison of Supervised and Unsupervised Equalization Algorithms for Frequency Selective Channels
    Fazal-E-Asim
    Bashir, Sajid
    Abrar, Shafayat
    Shah, Syed Ismail
    2016 13TH INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGY (IBCAST), 2016, : 677 - 681
  • [24] A comparison of algorithms to compute the positive matrix factorization and their application to unsupervised unmixing
    Masalmah, Yahya M.
    Velez-Reyes, Miguel
    ALGORITHMS AND TECHNOLOGIES FOR MULTISPECTRAL, HYPERSPECTRAL, AND ULTRASPECTRAL IMAGERY XII PTS 1 AND 2, 2006, 6233
  • [25] Comparison of Bioinspired Algorithms Applied to the Timetabling Problem in Sport
    Silva, Jesus
    Cabrera, Danelys
    Maco, Jose
    Villon, Martin
    Garcia Guliany, Jesus
    Roncallo, Alberto
    Hernandez Palma, Hugo
    11TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT) / THE 3RD INTERNATIONAL CONFERENCE ON EMERGING DATA AND INDUSTRY 4.0 (EDI40) / AFFILIATED WORKSHOPS, 2020, 170 : 965 - 970
  • [26] A comparison of clustering algorithms applied to color image quantization
    Scheunders, P
    PATTERN RECOGNITION LETTERS, 1997, 18 (11-13) : 1379 - 1384
  • [27] ON UNSUPERVISED ESTIMATION ALGORITHMS
    PATRICK, EA
    COSTELLO, JP
    IEEE TRANSACTIONS ON INFORMATION THEORY, 1970, 16 (05) : 556 - +
  • [28] Comparison of two cluster sampling methods for health surveys in developing countries
    Milligan, P
    Njie, A
    Bennett, S
    INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2004, 33 (03) : 469 - 476
  • [29] ON UNSUPERVISED ESTIMATION ALGORITHMS
    PATRICK, EA
    COSTELLO, JP
    IEEE TRANSACTIONS ON INFORMATION THEORY, 1970, 16 (01) : 123 - +
  • [30] Cosmology with cluster surveys
    Subhabrata Majumdar
    Pramana, 2004, 63 : 871 - 875