Weighted K-means support vector machine for cancer prediction

被引:21
|
作者
Kim, SungHwan [1 ]
机构
[1] Korea Univ, Dept Stat, Seoul 136701, South Korea
来源
SPRINGERPLUS | 2016年 / 5卷
关键词
Support vector machine; K-means clustering; Weighted SVM; TCGA; BREAST-CANCER; RECURRENCE; TAMOXIFEN; RISK;
D O I
10.1186/s40064-016-2677-4
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
To date, the support vector machine (SVM) has been widely applied to diverse biomedical fields to address disease subtype identification and pathogenicity of genetic variants. In this paper, I propose the weighted K-means support vector machine (wKM-SVM) and weighted support vector machine (wSVM), for which I allow the SVM to impose weights to the loss term. Besides, I demonstrate the numerical relations between the objective function of the SVM and weights. Motivated by general ensemble techniques, which are known to improve accuracy, I directly adopt the boosting algorithm to the newly proposed weighted KM-SVM (and wSVM). For predictive performance, a range of simulation studies demonstrate that the weighted KM-SVM (and wSVM) with boosting outperforms the standard KM-SVM (and SVM) including but not limited to many popular classification rules. I applied the proposed methods to simulated data and two large-scale real applications in the TCGA pan-cancer methylation data of breast and kidney cancer. In conclusion, the weighted KM-SVM (and wSVM) increases accuracy of the classification model, and will facilitate disease diagnosis and clinical treatment decisions to benefit patients. A software package (wSVM) is publicly available at the R-project webpage (https://www.r-project.org).
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Unlabeled data classiflcation via support vector machines and k-means clustering
    Li, MK
    Cheng, YS
    Zhao, HH
    [J]. INTERNATIONAL CONFERENCE ON COMPUTER GRAPHICS, IMAGING AND VISUALIZATION, PROCEEDINGS, 2004, : 183 - 186
  • [42] Support Vector Machines Based on Sectional Set Fuzzy K-Means Clustering
    Ma, Li-juan
    Ha, Ming-hu
    [J]. FUZZY INFORMATION AND ENGINEERING, VOL 1, 2009, 54 : 420 - 425
  • [43] A Support Vector and K-Means Based Hybrid Intelligent Data Clustering Algorithm
    Sun, Liang
    Yoshida, Shinichi
    Liang, Yanchun
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (11) : 2234 - 2243
  • [44] Support vector machines and k-means to build implementation areas of bundling hubs
    El Ouadi, Jihane
    Malhene, Nicolas
    Benhadou, Siham
    Medromi, Hicham
    [J]. EUROPEAN TRANSPORT-TRASPORTI EUROPEI, 2021, (83):
  • [45] Multi-level hybrid support vector machine and extreme learning machine based on modified K-means for intrusion detection system
    Al-Yaseen, Wathiq Laftah
    Othman, Zulaiha Ali
    Nazri, Mohd Zakree Ahmad
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2017, 67 : 296 - 303
  • [46] Splitting K-means Generated Neural Fuzzy System with Support Vector Regression
    Hsieh, Cheng-Da
    Juang, Chia-Feng
    [J]. IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ 2011), 2011, : 1417 - 1421
  • [47] Prediction of Breast Cancer Using Support Vector Machine and K-Nearest Neighbors
    Islam, Md. Milon
    Iqbal, Hasib
    Haque, Md. Rezwanul
    Hasan, Md. Kamrul
    [J]. 2017 IEEE REGION 10 HUMANITARIAN TECHNOLOGY CONFERENCE (R10-HTC), 2017, : 226 - 229
  • [48] Entropy Weighted Power k-Means Clustering
    Chakraborty, Saptarshi
    Paul, Debolina
    Das, Swagatam
    Xu, Jason
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 691 - 700
  • [49] Constrained Clustering with Minkowski Weighted K-Means
    de Amorim, Renato Cordeiro
    [J]. 13TH IEEE INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS (CINTI 2012), 2012, : 13 - 17
  • [50] Weighted adjacent matrix for K-means clustering
    Zhou, Jukai
    Liu, Tong
    Zhu, Jingting
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (23) : 33415 - 33434