Information Extraction from Microarray Data: A Survey of Data Mining Techniques

被引:4
|
作者
Fiori, Alessandro [1 ]
Grand, Alberto [1 ]
Bruno, Giulia [2 ]
Brundu, Francesco Gavino [3 ]
Schioppa, Domenico [4 ]
Bertotti, Andrea [4 ]
机构
[1] IRCC, Inst Canc Res & Tretment, Candiolo, Italy
[2] Politecn Torino, Dipartimento Ingn Gest & Prod, Turin, Italy
[3] Politecn Torino, Dipartimento Automat & Informat, Turin, Italy
[4] Univ Turin, Sch Med, Dept Oncol, Candiolo, Italy
关键词
Association Rules; Classification; Clustering; Data Analysis; Data Mining; Data Normalization; Feature Selection; Microarray; GENE-EXPRESSION DATA; NONNEGATIVE MATRIX FACTORIZATION; C-MEANS METHOD; FEATURE-SELECTION; CLUSTERING ANALYSIS; ASSOCIATION RULES; CLASSIFICATION METHODS; TUMOR SUBTYPES; MARKER GENES; CANCER;
D O I
10.4018/jdm.2014010102
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, a huge amount of high throughput molecular data are available for analysis and provide novel and useful insights into complex biological systems, through the acquisition of a high-resolution picture of their molecular status in defined experimental conditions. In this context, microarrays are a powerful tool to analyze thousands of gene expression values with a single experiment. A number of approaches have been developed to detecting genes highly correlated to diseases, selecting genes that exhibit a similar behavior under specific conditions, building models to predict disease outcome based on genetic profiles, and inferring regulatory networks. This paper discusses popular and recent data mining techniques (i.e., Feature Selection, Clustering, Classification, and Association Rule Mining) applied to microarray data. The main characteristics of microarray data and preprocessing procedures are presented to understand the critical issues introduced by gene expression values analysis. Each technique is analyzed, and relevant examples of pertinent literature are reported. Moreover, real use cases exploiting analytic pipelines that use these methods are also introduced. Finally, future directions of data mining research on microarray data are envisioned.
引用
收藏
页码:29 / 58
页数:30
相关论文
共 50 条
  • [1] Data mining techniques for microarray datasets
    Liu, L
    Yang, J
    Tung, AKH
    [J]. ICDE 2005: 21ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2005, : 1149 - 1149
  • [2] On the classification techniques in data mining for microarray data classification
    Aydadenta, Husna
    Adiwijaya
    [J]. INTERNATIONAL CONFERENCE ON DATA AND INFORMATION SCIENCE (ICODIS), 2018, 971
  • [3] EXTRACTION OF USEFUL INFORMATION OF A PATIENT USING DATA MINING TECHNIQUES
    Revathi, T.
    Rajesh, Sumathi
    [J]. 2011 3RD INTERNATIONAL CONFERENCE ON COMPUTER TECHNOLOGY AND DEVELOPMENT (ICCTD 2011), VOL 3, 2012, : 285 - 289
  • [4] A survey of data mining techniques
    Maojo, V
    Sanandrés, J
    [J]. MEDICAL DATA ANALYSIS, PROCEEDINGS, 2000, 1933 : 17 - 22
  • [5] Data Mining Techniques for IoT and Big Data -A Survey
    Shobanadevi, A.
    Maragatham, G.
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT SUSTAINABLE SYSTEMS (ICISS 2017), 2017, : 66 - 78
  • [6] A survey on data mining and pattern recognition techniques for soil data mining
    Ashok Kumar, D.
    Kannathasan, N.
    [J]. International Journal of Computer Science Issues, 2011, 8 (3 3-1): : 422 - 428
  • [7] Scholarly Big Data: Information Extraction and Data Mining
    Giles, C. Lee
    [J]. PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1 - 1
  • [8] Survey of Classification Techniques in Data Mining
    Phyu, Thair Nu
    [J]. IMECS 2009: INTERNATIONAL MULTI-CONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2009, : 727 - 731
  • [9] Clustering techniques in data mining - A survey
    Pujari, AK
    Rajesh, K
    Reddy, DS
    [J]. IETE JOURNAL OF RESEARCH, 2001, 47 (1-2) : 19 - 28
  • [10] Big Data Analytics Using Data Mining Techniques: A Survey
    Mittal, Shweta
    Sangwan, Om Prakash
    [J]. ADVANCED INFORMATICS FOR COMPUTING RESEARCH, ICAICR 2018, PT I, 2019, 955 : 264 - 273