ESTIMATION OF MISSING VALUES USING OPTIMISED HYBRID FUZZY C-MEANS AND MAJORITY VOTE FOR MICROARRAY DATA

被引:0
|
作者
Kumaran, Shamini Raja [1 ]
Othman, Mohd Shahizan [1 ]
Yusuf, Lizawati Mi [1 ]
机构
[1] Univ Teknol Malaysia, Sch Comp, Skudai, Johor, Malaysia
来源
JOURNAL OF INFORMATION AND COMMUNICATION TECHNOLOGY-MALAYSIA | 2020年 / 19卷 / 04期
关键词
Fuzzy C-means; majority vote; missing values; microarray data; data optimisation; IMPUTATION; ALGORITHM;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Missing values are a huge constraint in microarray technologies towards improving and identifying disease-causing genes. Estimating missing values is an undeniable scenario faced by field experts. The imputation method is an effective way to impute the proper values to proceed with the next process in microarray technology. Missing value imputation methods may increase the classification accuracy. Although these methods might predict the values, classification accuracy rates prove the ability of the methods to identify the missing values in gene expression data. In this study, a novel method, Optimised Hybrid of Fuzzy C-Means and Majority Vote (opt-FCMMV), was proposed to identify the missing values in the data. Using the Majority Vote (MV) and optimisation through Particle Swann Optimisation (PSO), this study predicted missing values in the data to form more informative and solid data. In order to verify the effectiveness of opt-FCMMV, several experiments were carried out on two publicly available microarray datasets (i.e. Ovary and Lung Cancer) under three missing value mechanisms with five different percentage values in the biomedical domain using Support Vector Machine (SVM) classifier. The experimental results showed that the proposed method functioned efficiently by showcasing the highest accuracy rate as compared to the one without imputations, with imputation by Fuzzy C-Means (FCM), and imputation by Fuzzy C-Means with Majority Vote (FCMMV). For example, the accuracy rates for Ovary Cancer data with 5% missing values were 64.0% for no imputation, 81.8% (FCM), 90.0% (FCMMV), and 93.7% (opt-FCMMV). Such an outcome indicates that the opt-FCMMV may also be applied in different domains in order to prepare the dataset for various data mining tasks.
引用
收藏
页码:459 / 482
页数:24
相关论文
共 50 条
  • [31] Hybrid Fuzzy C-Means Clustering Algorithm Oriented to Big Data Realms
    Perez-Ortega, Joaquin
    Silvia Roblero-Aguilar, Sandra
    Nely Almanza-Ortega, Nelva
    Frausto Solis, Juan
    Zavala-Diaz, Crispin
    Hernandez, Yasmin
    Landero-Najera, Vanesa
    AXIOMS, 2022, 11 (08)
  • [32] A fuzzy microaggregation algorithm using fuzzy c-means
    Torra, Vicenc
    ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2015, 277 : 214 - 223
  • [33] Partial volume estimation and the fuzzy C-means algorithm
    Pham, DL
    Prince, JL
    1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 3, 1998, : 819 - 822
  • [34] Bearing Fault Detection using Fuzzy C-means and Hybrid C-means-Subtractive Algorithms
    Lotfan, Saeed
    Salehpour, Nazanin
    Adiban, Hossein
    Mashroutechi, Aydin
    2015 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2015), 2015,
  • [35] Data Clustering using a Hybrid of Fuzzy C-Means and Quantum-behaved Particle Swarm Optimization
    Sengupta, Saptarshi
    Basak, Sanchita
    Peters, Richard Alan, II
    2018 IEEE 8TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2018, : 137 - 142
  • [36] Full fuzzy land cover mapping using remote sensing data based on fuzzy c-means and density estimation
    Kumar, Anil
    Ghosh, S. K.
    Dadhwal, V. K.
    CANADIAN JOURNAL OF REMOTE SENSING, 2007, 33 (02) : 81 - 87
  • [37] Full fuzzy land cover mapping using remote sensing data based on fuzzy c-means and density estimation
    Kumar, Anil
    Ghosh, S.K.
    Dadhwal, V.K.
    Canadian Journal of Remote Sensing, 2007, 33 (1-4) : 81 - 87
  • [38] Fuzzy c-means algorithms for data with tolerance using kernel functions
    Kanzawa, Yuchi
    Endo, Yasunori
    Miyamoto, Sadaaki
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2008, E91A (09) : 2520 - 2534
  • [39] Bias field estimation and adaptive segmentation of MRI data using a modified fuzzy c-means algorithm
    Ahmed, MN
    Yamany, SM
    Mohamed, NA
    Farag, AA
    Moriarty, T
    CARS '99: COMPUTER ASSISTED RADIOLOGY AND SURGERY, 1999, 1191 : 1004 - 1004
  • [40] Noisy data reduction by using tensor and fuzzy c-means algorithm
    Hunkrajok, Mongkol
    Skulpakdee, Wanrudee
    PROCEEDINGS OF THE 7TH WSEAS INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMPUTATIONAL GEOMETRY AND ARTIFICIAL VISION (ISCGAV'-07), 2007, : 46 - +