DMET-Miner: Efficient discovery of association rules from pharmacogenomic data

被引:34
|
作者
Agapito, Giuseppe [1 ]
Guzzi, Pietro H. [1 ]
Cannataro, Mario [1 ,2 ]
机构
[1] Magna Graecia Univ Catanzaro, Dept Med & Surg Sci, Catanzaro, Italy
[2] CNR, ICAR, I-00185 Rome, Italy
关键词
Personalized medicine; Single nucleotide polymorphism; Frequent itemset mining; Association rules; COLORECTAL-CANCER PATIENTS; POLYMORPHISM;
D O I
10.1016/j.jbi.2015.06.005
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Microarray platforms enable the investigation of allelic variants that may be correlated to phenotypes. Among those, the Affymetrix DMET (Drug Metabolism Enzymes and Transporters) platform enables the simultaneous investigation of all the genes that are related to drug absorption, distribution, metabolism and excretion (ADME). Although recent studies demonstrated the effectiveness of the use of DMET data for studying drug response or toxicity in clinical studies, there is a lack of tools for the automatic analysis of DMET data. In a previous work we developed DMET-Analyzer, a methodology and a supporting platform able to automatize the statistical study of allelic variants, that has been validated in several clinical studies. Although DMET-Analyzer is able to correlate a single variant for each probe (related to a portion of a gene) through the use of the Fisher test, it is unable to discover multiple associations among allelic variants, due to its underlying statistic analysis strategy that focuses on a single variant for each time. To overcome those limitations, here we propose a new analysis methodology for DMET data based on Association Rules mining, and an efficient implementation of this methodology, named DMET-Miner. DMET-Miner extends the DMET-Analyzer tool with data mining capabilities and correlates the presence of a set of allelic variants with the conditions of patient's samples by exploiting association rules. To face the high number of frequent itemsets generated when considering large clinical studies based on DMET data, DMET-Miner uses an efficient data structure and implements an optimized search strategy that reduces the search space and the execution time. Preliminary experiments on synthetic DMET datasets, show how DMET-Miner outperforms off-the-shelf data mining suites such as the FP-Growth algorithms available in Weka and RapidMiner. To demonstrate the biological relevance of the extracted association rules and the effectiveness of the proposed approach from a medical point of view, some preliminary studies on a real clinical dataset are currently under medical investigation. (C) 2015 Elsevier Inc. All rights reserved.
引用
下载
收藏
页码:273 / 283
页数:11
相关论文
共 50 条
  • [1] DMET-Miner: Efficient Learning of Association Rules from Genotyping Data for Personalized Medicine
    Guzzi, Pietro Hiram
    Agapito, Giuseppe
    Di Martino, Maria Teresa
    Arbitrio, Mariamena
    Tassone, Pierfrancesco
    Tagliaferri, Pierosandro
    Cannataro, Mario
    2014 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2014,
  • [2] Efficient data-structures and parallel algorithms for association rules discovery
    Cérin, C
    Gay, JS
    Le Mahec, GL
    Koskas, M
    PROCEEDINGS OF THE FIFTH MEXICAN INTERNATIONAL CONFERENCE IN COMPUTER SCIENCE (ENC 2004), 2004, : 399 - 406
  • [3] Discovery of association rules in tabular data
    Richards, G
    Rayward-Smith, VJ
    2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, : 465 - 472
  • [4] Discovery of association rules in medical data
    Doddi, S
    Marathe, A
    Ravi, SS
    Torney, DC
    MEDICAL INFORMATICS AND THE INTERNET IN MEDICINE, 2001, 26 (01): : 25 - 33
  • [5] Discovery of Association Rules from Data including Missing Values
    Sakurai, Shigeaki
    Mori, Kouichirou
    Orihara, Ryohei
    CISIS: 2009 INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS, VOLS 1 AND 2, 2009, : 67 - 74
  • [6] Discovery of Spatial Association Rules from Fuzzy Spatial Data
    da Silva, Henrique P.
    Felix, Thiago D. R.
    de Venancio, Pedro V. A. B.
    Carniel, Anderson C.
    CONCEPTUAL MODELING (ER 2022), 2022, 13607 : 179 - 193
  • [7] Efficient discovery of statistically significant association rules
    Hamalainen, Wilhelmiina
    Nykanen, Matti
    ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 203 - +
  • [8] PMCR-Miner: parallel maximal confident association rules miner algorithm for microarray data set
    Zakaria, Wael
    Kotb, Yasser
    Ghaleb, Fayed F. M.
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2015, 13 (03) : 225 - 247
  • [9] Discovery of interesting association rules from Livelink web log data
    Huang, XG
    An, AJ
    Cercone, N
    Promhouse, G
    2002 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2002, : 763 - 766
  • [10] Extraction of primitive motion and discovery of association rules from motion data
    Mori, T
    Uehara, K
    ROBOT AND HUMAN COMMUNICATION, PROCEEDINGS, 2001, : 200 - 206