Identifying relevant features for a multi-factorial disorder with constraint-based subspace clustering

被引:2
|
作者
Hielscher, Tommy [1 ]
Spiliopoulou, Myra [1 ]
Voelzke, Henry [2 ]
Kuehn, Jens-Peter [2 ]
机构
[1] Otto Von Guericke Univ, Magdeburg, Germany
[2] Univ Med Greifswald, Greifswald, Germany
来源
2016 IEEE 29TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS) | 2016年
关键词
medical data mining; patient similarity; feature selection; classification; epidemiological studies; hepatic steatosis;
D O I
10.1109/CBMS.2016.42
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The identification of predictive features associated with distinct medical outcomes is a key requirement for meaningful clinical decision support. Usually, their discovery is based on sets of labeled examples and an analysis of the inherent information of the features w.r.t. the target variable. However, obtaining large sets of labeled examples may be not feasible and the sole label consideration could even dilute characteristics unique to distinct subgroups. In such cases, instead of considering the value of the target variable, expert knowledge on the similarity between examples could be utilized. In this work we propose a new algorithm for the "Discovery of Relevant Example-constrained Subspaces" (DRESS) which uses constraints on the similarity between examples to discover feature sets that describe a target concept. DRESS exploits the density of clusters and the distance-behavior between constrained examples to evaluate the quality of a feature set without requiring explicit information about the target variable. We evaluate DRESS against classical feature selection methods on cohort participants for the disorder "hepatic steatosis", and report on our findings on classifier performance and identified important features.
引用
收藏
页码:207 / 212
页数:6
相关论文
共 50 条
  • [1] Constraint-based clustering selection
    Van Craenendonck, Toon
    Blockeel, Hendrik
    MACHINE LEARNING, 2017, 106 (9-10) : 1497 - 1521
  • [2] Constraint-based clustering selection
    Toon Van Craenendonck
    Hendrik Blockeel
    Machine Learning, 2017, 106 : 1497 - 1521
  • [3] Constraint-based query clustering
    Ruiz, Carlos
    Menasalvas, Ernestina
    Spiliopoulou, Myra
    ADVANCES IN INTELLIGENT WEB MASTERING, 2007, 43 : 304 - +
  • [4] Subspace clustering algorithm based on multi-rule constraint
    Li, Huiping
    International Journal of Multimedia and Ubiquitous Engineering, 2014, 9 (12): : 107 - 116
  • [5] Constraint-based clustering in large databases
    Tung, AKH
    Han, JW
    Lakshmanan, LVS
    Ng, RT
    DATABASE THEORY - ICDT 2001, PROCEEDINGS, 2001, 1973 : 405 - 419
  • [6] Graph constraint-based robust latent space low-rank and sparse subspace clustering
    Yunjun Xiao
    Jia Wei
    Jiabing Wang
    Qianli Ma
    Shandian Zhe
    Tolga Tasdizen
    Neural Computing and Applications, 2020, 32 : 8187 - 8204
  • [7] Graph constraint-based robust latent space low-rank and sparse subspace clustering
    Xiao, Yunjun
    Wei, Jia
    Wang, Jiabing
    Ma, Qianli
    Zhe, Shandian
    Tasdizen, Tolga
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (12): : 8187 - 8204
  • [8] Constraint-based Hierarchical Clustering for Time Sequences
    Kou, Yufeng
    Knackstedt, Chris
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 2705 - 2711
  • [9] Constraint-based Clustering Algorithm for Multi-Density Data and Arbitrary Shapes
    Atwa, Walid
    Li, Kan
    ADVANCES IN DATA MINING: APPLICATIONS AND THEORETICAL ASPECTS, ICDM 2017, 2017, 10357 : 78 - 92
  • [10] Active Informative Pairwise Constraint Formulation Algorithm for Constraint-Based Clustering
    Zhong, Guoxiang
    Deng, Xiuqin
    Xu, Shengbing
    IEEE ACCESS, 2019, 7 : 81983 - 81993