ESTIMATION OF MISSING VALUES USING OPTIMISED HYBRID FUZZY C-MEANS AND MAJORITY VOTE FOR MICROARRAY DATA

被引:0
|
作者
Kumaran, Shamini Raja [1 ]
Othman, Mohd Shahizan [1 ]
Yusuf, Lizawati Mi [1 ]
机构
[1] Univ Teknol Malaysia, Sch Comp, Skudai, Johor, Malaysia
来源
JOURNAL OF INFORMATION AND COMMUNICATION TECHNOLOGY-MALAYSIA | 2020年 / 19卷 / 04期
关键词
Fuzzy C-means; majority vote; missing values; microarray data; data optimisation; IMPUTATION; ALGORITHM;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Missing values are a huge constraint in microarray technologies towards improving and identifying disease-causing genes. Estimating missing values is an undeniable scenario faced by field experts. The imputation method is an effective way to impute the proper values to proceed with the next process in microarray technology. Missing value imputation methods may increase the classification accuracy. Although these methods might predict the values, classification accuracy rates prove the ability of the methods to identify the missing values in gene expression data. In this study, a novel method, Optimised Hybrid of Fuzzy C-Means and Majority Vote (opt-FCMMV), was proposed to identify the missing values in the data. Using the Majority Vote (MV) and optimisation through Particle Swann Optimisation (PSO), this study predicted missing values in the data to form more informative and solid data. In order to verify the effectiveness of opt-FCMMV, several experiments were carried out on two publicly available microarray datasets (i.e. Ovary and Lung Cancer) under three missing value mechanisms with five different percentage values in the biomedical domain using Support Vector Machine (SVM) classifier. The experimental results showed that the proposed method functioned efficiently by showcasing the highest accuracy rate as compared to the one without imputations, with imputation by Fuzzy C-Means (FCM), and imputation by Fuzzy C-Means with Majority Vote (FCMMV). For example, the accuracy rates for Ovary Cancer data with 5% missing values were 64.0% for no imputation, 81.8% (FCM), 90.0% (FCMMV), and 93.7% (opt-FCMMV). Such an outcome indicates that the opt-FCMMV may also be applied in different domains in order to prepare the dataset for various data mining tasks.
引用
收藏
页码:459 / 482
页数:24
相关论文
共 50 条
  • [41] Fuzzy c-means clustering for data with tolerance using kernel functions
    Kanzawa, Yuchi
    Endo, Yasunori
    Miyamoto, Sadaaki
    2006 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2006, : 744 - +
  • [42] A weighted fuzzy c-means clustering model for fuzzy data
    D'Urso, P
    Giordani, P
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 50 (06) : 1496 - 1523
  • [43] Fuzzy Constrained Inversion of Magnetotelluric Data Using Guided Fuzzy C-Means Clustering
    Yang, Bo
    Xu, Kaijun
    Liu, Zhan
    SURVEYS IN GEOPHYSICS, 2021, 42 (02) : 399 - 425
  • [44] Fuzzy Constrained Inversion of Magnetotelluric Data Using Guided Fuzzy C-Means Clustering
    Bo Yang
    Kaijun Xu
    Zhan Liu
    Surveys in Geophysics, 2021, 42 : 399 - 425
  • [45] Biometric Hand Vein Estimation using Bloodstream Filtration and Fuzzy c-means
    Kolda, Lukas
    Krejcar, Ondrej
    2017 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2017,
  • [46] Hybrid Image Segmentation Using Fuzzy C-Means and Gravitational Search Algorithm
    Majd, Emadaldin Mozafari
    As'ari, M. A.
    Sheikh, U. U.
    Abu-Bakar, S. A. R.
    FOURTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2012), 2012, 8334
  • [47] Fuzzy c-means classifier with deterministic initialization and missing value imputation
    Ichihashi, Hidetomo
    Honda, Katsuhiro
    Notsu, Akira
    Yagi, Takafumi
    2007 IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTATIONAL INTELLIGENCE, VOLS 1 AND 2, 2007, : 214 - +
  • [48] Fuzzy Clustering Using C-Means Method
    Krastev, Georgi
    Georgiev, Tsvetozar
    TEM JOURNAL-TECHNOLOGY EDUCATION MANAGEMENT INFORMATICS, 2015, 4 (02): : 144 - 148
  • [49] Missing Values Imputation based on Fuzzy C-Means Algorithm for Classification of Chronic Obstructive Pulmonary Disease (COPD)
    Aristiawati, Kiki
    Siswantining, Titin
    Sarwinda, Devvi
    Soemartojo, Saskya Mary
    PROCEEDINGS OF THE 8TH SEAMS-UGM INTERNATIONAL CONFERENCE ON MATHEMATICS AND ITS APPLICATIONS 2019: DEEPENING MATHEMATICAL CONCEPTS FOR WIDER APPLICATION THROUGH MULTIDISCIPLINARY RESEARCH AND INDUSTRIES COLLABORATIONS, 2019, 2192
  • [50] A modified fuzzy C-means algorithm for bias field estimation and segmentation of MRI data
    Ahmed, MN
    Yamany, SM
    Mohamed, N
    Farag, AA
    Moriarty, T
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2002, 21 (03) : 193 - 199