Weighted K-means support vector machine for cancer prediction

被引:21
|
作者
Kim, SungHwan [1 ]
机构
[1] Korea Univ, Dept Stat, Seoul 136701, South Korea
来源
SPRINGERPLUS | 2016年 / 5卷
关键词
Support vector machine; K-means clustering; Weighted SVM; TCGA; BREAST-CANCER; RECURRENCE; TAMOXIFEN; RISK;
D O I
10.1186/s40064-016-2677-4
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
To date, the support vector machine (SVM) has been widely applied to diverse biomedical fields to address disease subtype identification and pathogenicity of genetic variants. In this paper, I propose the weighted K-means support vector machine (wKM-SVM) and weighted support vector machine (wSVM), for which I allow the SVM to impose weights to the loss term. Besides, I demonstrate the numerical relations between the objective function of the SVM and weights. Motivated by general ensemble techniques, which are known to improve accuracy, I directly adopt the boosting algorithm to the newly proposed weighted KM-SVM (and wSVM). For predictive performance, a range of simulation studies demonstrate that the weighted KM-SVM (and wSVM) with boosting outperforms the standard KM-SVM (and SVM) including but not limited to many popular classification rules. I applied the proposed methods to simulated data and two large-scale real applications in the TCGA pan-cancer methylation data of breast and kidney cancer. In conclusion, the weighted KM-SVM (and wSVM) increases accuracy of the classification model, and will facilitate disease diagnosis and clinical treatment decisions to benefit patients. A software package (wSVM) is publicly available at the R-project webpage (https://www.r-project.org).
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Vehicle Classification using Support Vector Machines and k-means Clustering
    Cho, Hsun-Jung
    Li, Rih-Jin
    Lee, Hsia
    Wu, Jennifer Yuh-Jen
    [J]. COMPUTATIONAL METHODS IN SCIENCE AND ENGINEERING, VOL 2: ADVANCES IN COMPUTATIONAL SCIENCE, 2009, 1148 : 449 - +
  • [32] The LINEX Weighted k-Means Clustering
    Ahmadzadehgoli, Narges
    Mohammadpour, Adel
    Behzadi, Mohammad Hassan
    [J]. JOURNAL OF STATISTICAL THEORY AND APPLICATIONS, 2019, 18 (02): : 147 - 154
  • [33] The LINEX Weighted k-Means Clustering
    Narges Ahmadzadehgoli
    Adel Mohammadpour
    Mohammad Hassan Behzadi
    [J]. Journal of Statistical Theory and Applications, 2019, 18 : 147 - 154
  • [34] RFID indoor localization based on support vector regression and k-means
    Berz, Everton Luis
    Tesch, Deivid Antunes
    Hessel, Fabiano Passuelo
    [J]. 2015 IEEE 24TH INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2015, : 1418 - 1423
  • [35] Support Vector Data Descriptions and k-Means Clustering: One Class?
    Goernitz, Nico
    Lima, Luiz Alberto
    Mueller, Klaus-Robert
    Kloft, Marius
    Nakajima, Shinichi
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (09) : 3994 - 4006
  • [36] BLIND BANDWIDTH EXTENSION USING K-MEANS AND SUPPORT VECTOR REGRESSION
    Wu, Chih-Wei
    Vinton, Mark
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 721 - 725
  • [37] Fast support vector data description using K-means clustering
    Kim, Pyo Jae
    Chang, Hyung Jin
    Song, Dong Sung
    Choi, Jin Young
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 3, PROCEEDINGS, 2007, 4493 : 506 - +
  • [38] Learning Weighted Top-k Support Vector Machine
    Kato, Tsuyoshi
    Hirohashi, Yoshihiro
    [J]. ASIAN CONFERENCE ON MACHINE LEARNING, VOL 101, 2019, 101 : 774 - 789
  • [39] Feature selection and design of intrusion detection system based on k-means and triangle area support vector machine
    Tang, Pingjie
    Jiang, Rang-an
    Zhao, Mingwei
    [J]. SECOND INTERNATIONAL CONFERENCE ON FUTURE NETWORKS: ICFN 2010, 2010, : 144 - 148
  • [40] Prediction of loom machine status based on binary K-means theory
    Peng L.
    Tang Q.
    Dai N.
    Hu X.
    [J]. Fangzhi Xuebao/Journal of Textile Research, 2023, 44 (05): : 112 - 118