Machine learning approaches for phenotype-genotype mapping:: predicting heterozygous mutations in the CYP21B gene from steroid profiles

被引:3
|
作者
Prank, K [1 ]
Schulze, E
Eckert, O
Nattkemper, TW
Bettendorf, M
Maser-Gluth, C
Sejnowski, TJ
Grote, A
Penner, E
von zur Mühlen, A
Brabant, G
机构
[1] Univ Bielefeld, Int NRW Grad Sch Bioinformat, D-33615 Bielefeld, Germany
[2] Univ Bielefeld, Genome Res Ctr Biotechnol, D-33615 Bielefeld, Germany
[3] Hannover Med Sch, D-30625 Hannover, Germany
[4] Mol Genet Lab Raue, D-69121 Heidelberg, Germany
[5] Hannover Med Sch, Dept Visceral & Transplantat Surg, D-30623 Hannover, Germany
[6] Hannover Med Sch, Dept Clin Endocrinol, D-30623 Hannover, Germany
[7] Univ Bielefeld, Fac Technol, Appl Neuroinformat Grp, D-33615 Bielefeld, Germany
[8] Univ Heidelberg, Dept Pediat, D-69120 Heidelberg, Germany
[9] Univ Heidelberg, Dept Pharmacol, D-69120 Heidelberg, Germany
[10] Salk Inst, Howard Hughes Med Inst, San Diego, CA 92186 USA
[11] Salk Inst, Computat Neurobiol Lab, San Diego, CA 92186 USA
关键词
D O I
10.1530/eje.1.01957
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Objective: Non-linear relations between multiple biochemical parameters are the basis for the diagnosis of many diseases. Traditional linear analytical methods are not reliable predictors. Novel nonlinear techniques are increasingly used to improve the diagnostic accuracy of automated data interpretation. This has been exemplified in particular for the classification and diagnostic prediction of cancers based on expression profiling data. Our objective was to predict the genotype from complex biochemical data by comparing the performance of experienced clinicians to traditional linear analysis, and to novel non-linear analytical methods. Design and methods: As a model, we used a well-defined set of interconnected data consisting of unstimulated serum levels of steroid intermediates assessed in 54 subjects heterozygous for a mutation of the 21-hydroxylase gene (CYP21B) and in 43 healthy controls. Results: The genetic alteration was predicted from the pattern of steroid levels with an accuracy of 39% by clinicians and of 64% by linear analysis. In contrast, non-linear analysis, such as self-organizing artificial neural networks, support vector machines, and nearest neighbour classifiers, allowed for higher accuracy up to 83%. Conclusions: The successful application of these non-linear adaptive methods to capture specific biochemical problems may have generalized implications for biochemical testing in many areas. Nonlinear analytical techniques such as neural networks, support vector machines, and nearest neighbour classifiers may serve as an important adjunct to the decision process of a human investigator not ' trained ' in a specific complex clinical or laboratory setting and may aid them to classify the problem more directly.
引用
收藏
页码:301 / 305
页数:5
相关论文
共 4 条
  • [1] Mutations of CYP21B gene and genotype phenotype correlation in Czech patients with steroid 21-hydroxylase deficiency.
    Prusa, R
    Lisa, L
    Snajderova, M
    Kolouskova, S
    Boday, A
    CLINICAL CHEMISTRY, 1997, 43 : 763 - 763
  • [2] Mutations in CYP11B1 gene:: Phenotype-genotype correlations
    Zhu, YS
    Cordero, JJ
    Can, S
    Cai, LQ
    You, XK
    Herrera, C
    DeFillo-Ricart, M
    Shackleton, C
    Imperato-McGinley, J
    AMERICAN JOURNAL OF MEDICAL GENETICS PART A, 2003, 122A (03): : 193 - 200
  • [3] Nonisotopic detection of point mutations in CYP21B gene in steroid 21-hydroxylase deficiency
    Ezquieta, B
    Varela, JM
    Jariego, C
    Oliver, A
    Gracia, R
    CLINICAL CHEMISTRY, 1996, 42 (07) : 1108 - 1110
  • [4] Mutation screening of the CYP1B1 gene and phenotype-genotype correlation in Primary Congenital Glaucoma cases from Brazil.
    Stoilov, IR
    Costa, VP
    Vasconcellos, JPC
    Mello, MB
    Betinjane, AJ
    Carani, JCE
    Oltrogge, EV
    Sarfarazi, M
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2001, 42 (04) : S530 - S530