Silhouette Analysis for Performance Evaluation in Machine Learning with Applications to Clustering

被引:108
|
作者
Shutaywi, Meshal [1 ]
Kachouie, Nezamoddin N. [2 ]
机构
[1] King Abdulaziz Univ, Dept Math, Rabigh 21911, Saudi Arabia
[2] Florida Inst Technol, Dept Math Sci, Melbourne, FL 32901 USA
关键词
k-means; kernel k-means; machine learning; nonlinear clustering; silhouette index; weighted clustering;
D O I
10.3390/e23060759
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Grouping the objects based on their similarities is an important common task in machine learning applications. Many clustering methods have been developed, among them k-means based clustering methods have been broadly used and several extensions have been developed to improve the original k-means clustering method such as k-means ++ and kernel k-means. K-means is a linear clustering method; that is, it divides the objects into linearly separable groups, while kernel k-means is a non-linear technique. Kernel k-means projects the elements to a higher dimensional feature space using a kernel function, and then groups them. Different kernel functions may not perform similarly in clustering of a data set and, in turn, choosing the right kernel for an application could be challenging. In our previous work, we introduced a weighted majority voting method for clustering based on normalized mutual information (NMI). NMI is a supervised method where the true labels for a training set are required to calculate NMI. In this study, we extend our previous work of aggregating the clustering results to develop an unsupervised weighting function where a training set is not available. The proposed weighting function here is based on Silhouette index, as an unsupervised criterion. As a result, a training set is not required to calculate Silhouette index. This makes our new method more sensible in terms of clustering concept.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Singularity for Machine Learning Applications - Analysis of Performance Impact
    Jordan, Bruce R., Jr.
    Barrett, David
    Burke, David
    Jardin, Patrick
    Littrell, Amelia
    Monticciolo, Paul
    Newey, Michael
    Piou, Jean
    Warner, Kara
    2019 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2019,
  • [2] Performance Evaluation of Serverless Edge Computing for Machine Learning Applications
    Trieu, Quoc Lap
    Javadi, Bahman
    Basilakis, Jim
    Toosi, Adel N.
    2022 IEEE/ACM 15TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING, UCC, 2022, : 139 - 144
  • [3] Performance assessment of load profiles clustering methods based on silhouette analysis
    Bosisio, Alessandro
    Berizzi, Alberto
    Morotti, Andrea
    Greco, Bartolomeo
    Iarmarelli, Gaetano
    Moscatiello, Cristina
    Boccaletti, Chiara
    Noriega, Holguer
    2021 21ST IEEE INTERNATIONAL CONFERENCE ON ENVIRONMENT AND ELECTRICAL ENGINEERING AND 2021 5TH IEEE INDUSTRIAL AND COMMERCIAL POWER SYSTEMS EUROPE (EEEIC/I&CPS EUROPE), 2021,
  • [4] Performance Evaluation of the Silhouette Index
    Starczewski, Artur
    Krzyzak, Adam
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, PT II (ICAISC 2015), 2015, 9120 : 49 - 58
  • [5] Analysis and evaluation of machine learning applications in materials design and discovery
    Golmohammadi, Mahsa
    Aryanpour, Masoud
    MATERIALS TODAY COMMUNICATIONS, 2023, 35
  • [6] A Machine Learning Framework for Performance Coverage Analysis of Proxy Applications
    Islam, Tanzima Z.
    Thiagarajan, Jayaraman J.
    Bhatele, Abhinav
    Schulz, Martin
    Gamblin, Todd
    SC '16: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2016, : 538 - 549
  • [7] Performance Evaluation of Machine Learning and Deep Learning Techniques for Sentiment Analysis
    Mehta, Anushka
    Parekh, Yash
    Karamchandani, Sunil
    INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, INDIA 2017, 2018, 672 : 463 - 471
  • [8] Performance Analysis of Machine Learning Algorithms With Clustering Protocol in Wireless Sensor Networks
    Gantassi, Rahma
    Masood, Zaki
    Lim, Sol
    Sias, Quota Alief
    Choi, Yonghoon
    2023 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION, ICAIIC, 2023, : 543 - 546
  • [9] Sentiment Analysis Using Machine Learning Classifiers: Evaluation of Performance
    Rai, Shamantha B.
    Shetty, Sweekriti M.
    Rai, Prakhyath
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS 2019), 2019, : 21 - 25
  • [10] Performance evaluation of machine learning models on large dataset of android applications reviews
    Qureshi, Ali Adil
    Ahmad, Maqsood
    Ullah, Saleem
    Yasir, Muhammad Naveed
    Rustam, Furqan
    Ashraf, Imran
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (24) : 37197 - 37219