A 2-phased approach for detecting multiple loci associations with traits

被引:1
|
作者
Lee, Sunwon [1 ]
Kang, Jaewoo [1 ]
Oh, Junho [1 ]
机构
[1] Korea Univ, Coll Informat & Commun, Seoul 136713, South Korea
基金
新加坡国家研究基金会;
关键词
TF-IDF; term frequency - inverse document frequency; class association rule mining; GWAS; SNP; bioinformatics; apriori algorithm; data mining; GENOME-WIDE ASSOCIATION; INFERENCE; SNPS;
D O I
10.1504/IJDMB.2012.049318
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The recent advance in SNP genotyping has made a significant contribution to reduction of the costs for large-scale genotyping. The development also has dramatically increased the size of the SNP genotype data. The increase in the volume of the data, however, has posed a huge obstacle to the conventional analysis techniques that are typically vulnerable to the high-dimensionality problem. To address the issue, we propose a method that exploits two well-tested models: the document-term model and the transaction analysis model. The proposed method consists of two phases. In the first phase, we reduce the dimensions of the SNP genotype data by extracting significant SNPs through transformation of the data in lieu of the document-term model. In the second phase, we discover the association rules that signify the relations between the SNPs and the traits, through the application of transactional analysis in the reduced-dimension genotype data. We validated the discovered rules through literature survey. Experiments were also carried out using the HGDP panel data provided by the Foundation Jean Dausset-CEPH, which prove the validity of our new method for identifying appropriate dimensional reduction and associations of multiple SNPs and traits. This paper is an extended version of our workshop paper presented in the 2010 International Workshop on Data Mining for High Throughput Data from Genome-Wide Association Studies.
引用
收藏
页码:535 / 556
页数:22
相关论文
共 50 条
  • [21] OPTIMAL 2ND REPLENISHMENT POLICY IN 2-PHASED PUSH CONTROL-SYSTEM
    TSAO, DB
    ENKAWA, T
    JOURNAL OF THE OPERATIONS RESEARCH SOCIETY OF JAPAN, 1992, 35 (03) : 273 - 289
  • [22] A Novel Approach for Detecting Gene-gene Interaction Using Multiple Traits
    Cui, Xiaoqi
    Chen, Huann-Sheng
    GENETIC EPIDEMIOLOGY, 2009, 33 (08) : 768 - 768
  • [23] Detecting multiple associations in genomewide studies
    Dudbridge, F
    AMERICAN JOURNAL OF MEDICAL GENETICS PART B-NEUROPSYCHIATRIC GENETICS, 2004, 130B (01): : 8 - 8
  • [24] Multiple marker-traits associations for maize agronomic traits
    Mikic, Sanja
    Kondic-Spika, Ankica
    Brbaklic, Ljiljana
    Stanisavljevic, Dusan
    Trkulja, Dragana
    Tomicic, Marina
    Nastasic, Aleksandra
    Kobiljski, Borislav
    Prodanovic, Slaven
    Momirovic, Gordana Surlan
    CHILEAN JOURNAL OF AGRICULTURAL RESEARCH, 2016, 76 (03): : 300 - 306
  • [25] ASSOCIATIONS BETWEEN QUANTITATIVE TRAITS AND ENZYME LOCI IN THE F2 POPULATION OF A MAIZE HYBRID
    KAHLER, AL
    WEHRHAHN, CF
    THEORETICAL AND APPLIED GENETICS, 1986, 72 (01) : 15 - 26
  • [26] A 2-PHASED ANAEROBIC-DIGESTION PROCESS - CONCEPT, PROCESS FAILURE AND MAXIMUM SYSTEM LOADING RATE
    FONGASTITKUL, P
    MAVINIC, DS
    LO, KV
    WATER ENVIRONMENT RESEARCH, 1994, 66 (03) : 243 - 254
  • [27] A Scalable Method for Detecting Multiple Loci Associated with Traits using TF-IDF Weighting and Association Rule Mining
    Lee, Sunwon
    Kang, Jaewoo
    Oh, Junho
    2010 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS (BIBMW), 2010, : 318 - 323
  • [28] SMALL HEPATOCELLULAR CARCINOMAS UNDETECTED ON 2-PHASED INCREMENTAL COMPUTED-TOMOGRAPHY - ANGIOGRAPHIC AN CLINICOPATHOLOGICAL FINDINGS
    HONDA, H
    KANEKO, K
    MAEDA, T
    KUROIWA, T
    FUKUYA, T
    YOSHIMITSU, K
    IRIE, H
    AIBE, H
    TAKENAKA, K
    TSUNEYOSHI, M
    SUGIMACHI, K
    MASUDA, K
    INVESTIGATIVE RADIOLOGY, 1995, 30 (08) : 458 - 465
  • [29] Mapping Multiple Quantitative Trait Loci for Ordinal Traits
    Nengjun Yi
    Shizhong Xu
    Varghese George
    David B. Allison
    Behavior Genetics, 2004, 34 : 3 - 15
  • [30] Bayesian quantitative trait loci mapping for multiple traits
    Banerjee, Samprit
    Yandell, Brian S.
    Yi, Nengjun
    GENETICS, 2008, 179 (04) : 2275 - 2289