Graph clustering-based discretization approach to microarray data

被引:0
|
作者
Kittakorn Sriwanna
Tossapon Boongoen
Natthakan Iam-On
机构
[1] Chiang Rai Rajabhat University,School of Computer and Information Technology
[2] Mae Fah Luang University,IQ
来源
关键词
Multivariate discretization; Graph clustering; Microarray data; High-dimensional data; Data mining;
D O I
暂无
中图分类号
学科分类号
摘要
Several techniques in data mining require discrete data. In fact, learning with discrete domains often performs better than the case of continuous data. Multivariate discretization is the algorithm that transforms continuous data to discrete one by considering correlations among attributes. Given the benefit of this idea, many multivariate discretization algorithms have been proposed. However, there are a few discretization algorithms that directly apply to microarray or gene expression data, which is high-dimensional and unbalance data. Even so interesting, no multivariate method has been put forward for microarray data analysis. According to the recent published research, graph clustering-based discretization of splitting and merging methods (GraphS and GraphM) usually achieves superior results compared to many well-known discretization algorithms. In this paper, GraphS and GraphM are extended by adding the alpha parameter that is the ratio between the similarity of gene expressions (distance) and the similarity of the class label. Moreover, the extensions consider 3 similarity measures of cosine similarity, Euclidean distance, and Pearson correlation in order to determine the proper pairwise similarity measure. The evaluation against 20 real microarray datasets and 4 classifiers suggests that the results of three classification performances (ACC, AUC, Kappa) and running time of two proposed methods based on cosine similarity, GraphM(C) and GraphS(C) are better than 9 state-of-the-art discretization algorithms.
引用
收藏
页码:879 / 906
页数:27
相关论文
共 50 条
  • [1] Graph clustering-based discretization approach to microarray data
    Sriwanna, Kittakorn
    Boongoen, Tossapon
    Iam-On, Natthakan
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 60 (02) : 879 - 906
  • [2] An evolutionary cut points search for graph clustering-based discretization
    Sriwanna, Kittakorn
    Boongoen, Tossapon
    Iam-On, Natthakan
    [J]. 2016 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE), 2016, : 514 - 519
  • [3] Graph clustering-based discretization of splitting and merging methods (GraphS and GraphM)
    Sriwanna, Kittakorn
    Boongoen, Tossapon
    Iam-On, Natthakan
    [J]. HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2017, 7
  • [4] Clustering-based hybrid feature selection approach for high dimensional microarray data
    Babu, Samson Anosh P.
    Annavarapu, Chandra Sekhara Rao
    Dara, Suresh
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2021, 213
  • [5] A clustering-based discretization for supervised learning
    Gupta, Ankit
    Mehrotra, Kishan G.
    Mohan, Chilukuri
    [J]. STATISTICS & PROBABILITY LETTERS, 2010, 80 (9-10) : 816 - 824
  • [6] Clustering-based approach for medical data classification
    Kodabagi, Mallikarjun M.
    Tikotikar, Ahelam
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (14):
  • [7] Improving Association Rule Mining Using Clustering-based Discretization of Numerical Data
    Tan, Swee Chuan
    [J]. 2018 INTERNATIONAL CONFERENCE ON INTELLIGENT AND INNOVATIVE COMPUTING APPLICATIONS (ICONIC), 2018, : 260 - 263
  • [8] A Genetics Clustering-based Approach for Weblog Data Cleaning
    Ganibardi, Amine
    Ali, Cherif Arab
    [J]. 2018 SIXTH INTERNATIONAL CONFERENCE ON ENTERPRISE SYSTEMS (ES 2018), 2018, : 75 - 81
  • [9] A clustering-based hybrid approach for dual data reduction
    Ratnoo, Saroj
    Rathee, Seema
    Ahuja, Jyoti
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT ENGINEERING INFORMATICS, 2018, 6 (05) : 468 - 490
  • [10] Clustering-based Safety Grouping Strategy for Bipartite Graph Data Publishing
    Luo, Yongcheng
    Le, Jiajin
    Jiang, Yaqian
    Chen, Dehua
    [J]. INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2012, 15 (12A): : 5387 - 5394