Discovery of significant rules for classifying cancer diagnosis data

被引:37
|
作者
Li, Jinyan [1 ]
Liu, Huiqing [1 ]
Ng, See-Kiong [1 ]
Wong, Limsoon [1 ]
机构
[1] Inst Infocomm Res, Singapore 119613, Singapore
关键词
D O I
10.1093/bioinformatics/btg1066
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Methods and Results: We introduce a new method to discover many diversified and significant rules from high dimensional profiling data. We also propose to aggregate the discriminating power of these rules for reliable predictions. The discovered rules are found to contain low-ranked features; these features are found to be sometimes necessary for classifiers to achieve perfect accuracy. The use of low-ranked but essential features in our method is in constrast to the prevailing use of an adhoc number of only top-ranked features. On a wide range of data sets, our method displayed highly competitive accuracy compared to the best performance of other kinds of classification models. In addition to accuracy, our method also provides comprehensible rules to help elucidate the translation between raw data and useful knowledge.
引用
收藏
页码:II93 / II102
页数:10
相关论文
共 50 条
  • [1] On the discovery of significant temporal rules
    Blanchard, Julien
    Guillet, Fabrice
    Gras, Regis
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-8, 2007, : 3074 - 3081
  • [2] Simple Bayesian binary framework for discovering significant genes and classifying cancer diagnosis
    Yang, Tae Young
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2009, 53 (05) : 1743 - 1754
  • [3] Efficient discovery of statistically significant association rules
    Hamalainen, Wilhelmiina
    Nykanen, Matti
    [J]. ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 203 - +
  • [4] Discovery of Significant Classification Rules from Incrementally Inducted Decision Tree Ensemble for Diagnosis of Disease
    Piao, Minghao
    Lee, Jong Bum
    Saeed, Khalid E. K.
    Ryu, Keun Ho
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2009, 5678 : 587 - 594
  • [5] Obtaining Significant and Interpretable Rules for Subgroup Discovery Tasks
    Dominguez, J. L.
    Mata, J.
    [J]. IEEE LATIN AMERICA TRANSACTIONS, 2017, 15 (10) : 2012 - 2016
  • [6] Ontologies for classifying and modeling drug discovery data
    Schuerer, Stephan
    Lin, Asiyah Yu
    Mehta, Saurabh
    McGinty, Hande Kucuk
    Cheng, Qiong
    Koleti, Amar
    Zadeh, Nooshin
    Vidovic, Dusica
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2016, 251
  • [7] Searching for the most significant rules: an evolutionary approach for subgroup discovery
    Victoria Pachón
    Jacinto Mata
    Juan Luis Domínguez
    [J]. Soft Computing, 2017, 21 : 2609 - 2618
  • [8] Searching for the most significant rules: an evolutionary approach for subgroup discovery
    Pachon, Victoria
    Mata, Jacinto
    Luis Dominguez, Juan
    [J]. SOFT COMPUTING, 2017, 21 (10) : 2609 - 2618
  • [9] Discovery of association rules in tabular data
    Richards, G
    Rayward-Smith, VJ
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, : 465 - 472
  • [10] Discovery of association rules in medical data
    Doddi, S
    Marathe, A
    Ravi, SS
    Torney, DC
    [J]. MEDICAL INFORMATICS AND THE INTERNET IN MEDICINE, 2001, 26 (01): : 25 - 33