Applying semantic web technologies for phenome-wide scan using an electronic health record linked Biobank

被引:17
|
作者
Pathak, Jyotishman [1 ]
Kiefer, Richard C. [2 ]
Bielinski, Suzette J. [3 ]
Chute, Christopher G. [1 ]
机构
[1] Mayo Clin, Dept Hlth Sci Res, Div Biomed Stat & Informat, Rochester, MN 55905 USA
[2] Mayo Clin, Dept Informat Technol, Rochester, MN USA
[3] Mayo Clin, Div Epidemiol, Dept Hlth Sci Res, Rochester, MN USA
来源
关键词
MEDICAL-RECORDS; CANCER RISK; SERUM TSH; GENOME; ASSOCIATION; GENE; VARIANTS; HYPOTHYROIDISM; SUSCEPTIBILITY; POPULATION;
D O I
10.1186/2041-1480-3-10
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: The ability to conduct genome-wide association studies (GWAS) has enabled new exploration of how genetic variations contribute to health and disease etiology. However, historically GWAS have been limited by inadequate sample size due to associated costs for genotyping and phenotyping of study subjects. This has prompted several academic medical centers to form "biobanks" where biospecimens linked to personal health information, typically in electronic health records (EHRs), are collected and stored on a large number of subjects. This provides tremendous opportunities to discover novel genotype-phenotype associations and foster hypotheses generation. Results: In this work, we study how emerging Semantic Web technologies can be applied in conjunction with clinical and genotype data stored at the Mayo Clinic Biobank to mine the phenotype data for genetic associations. In particular, we demonstrate the role of using Resource Description Framework (RDF) for representing EHR diagnoses and procedure data, and enable federated querying via standardized Web protocols to identify subjects genotyped for Type 2 Diabetes and Hypothyroidism to discover gene-disease associations. Our study highlights the potential of Web-scale data federation techniques to execute complex queries. Conclusions: This study demonstrates how Semantic Web technologies can be applied in conjunction with clinical data stored in EHRs to accurately identify subjects with specific diseases and phenotypes, and identify genotype-phenotype associations.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Phenome-wide association study of genetically predicted B vitamins and homocysteine biomarkers with multiple health and disease outcomes: analysis of the UK Biobank
    Wang, Lijuan
    Li, Xue
    Montazeri, Azita
    MacFarlane, Amanda J.
    Momoli, Franco
    Duthie, Susan
    Senekal, Marjanne
    Eguiagaray, Ines Mesa
    Munger, Ron
    Bennett, Derrick
    Campbell, Harry
    Rubini, Michele
    McNulty, Helene
    Little, Julian
    Theodoratou, Evropi
    AMERICAN JOURNAL OF CLINICAL NUTRITION, 2023, 117 (03): : 564 - 575
  • [32] Leveraging electronic healthcare record standards and semantic web technologies for the identification of patient cohorts
    Tomas Fernandez-Breis, Jesualdo
    Alberto Maldonado, Jose
    Marcos, Mar
    del Carmen Legaz-Garcia, Maria
    Moner, David
    Torres-Sospedra, Joaqun
    Esteban-Gil, Angel
    Martinez-Salvador, Begona
    Robles, Montserrat
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2013, 20 (E2) : E288 - E296
  • [33] Health behaviors and quality of life predictors for risk of hospitalization in an electronic health record-linked biobank
    Takahashi, Paul Y.
    Ryu, Euijung
    Olson, Janet E.
    Winkler, Erin M.
    Hathcock, Matthew A.
    Gupta, Ruchi
    Sloan, Jeff A.
    Pathak, Jyotishman
    Bielinski, Suzette J.
    Cerhan, James R.
    INTERNATIONAL JOURNAL OF GENERAL MEDICINE, 2015, 8 : 247 - 254
  • [34] Applying Semantic Web technologies to improve the retrieval, credibility and use of health-related web resources
    Mayer, Miguel A.
    Karampiperis, Pythagoras
    Kukurikos, Antonis
    Karkaletsis, Vangelis
    Stamatakis, Kostas
    Villarroel, Dagmar
    Leis, Angela
    HEALTH INFORMATICS JOURNAL, 2011, 17 (02) : 95 - 115
  • [35] Genetically Determined Chronic Low-Grade Inflammation and Hundreds of Health Outcomes in the UK Biobank and the FinnGen Population: A Phenome-Wide Mendelian Randomization Study
    Si, Shucheng
    Li, Jiqing
    Tewara, Marlvin Anemey
    Xue, Fuzhong
    FRONTIERS IN IMMUNOLOGY, 2021, 12
  • [36] The Geisinger MyCode community health initiative: an electronic health record-linked biobank for precision medicine research
    Carey, David J.
    Fetterolf, Samantha N.
    Davis, Daniel
    Faucett, William A.
    Kirchner, H. Lester
    Mirshahi, Uyenlinh
    Murray, Michael F.
    Smelser, Diane T.
    Gerhard, Glenn S.
    Ledbetter, David H.
    GENETICS IN MEDICINE, 2016, 18 (09) : 906 - 913
  • [37] Leveraging genomic diversity for discovery in an electronic health record linked biobank: the UCLA ATLAS Community Health Initiative
    Ruth Johnson
    Yi Ding
    Vidhya Venkateswaran
    Arjun Bhattacharya
    Kristin Boulier
    Alec Chiu
    Sergey Knyazev
    Tommer Schwarz
    Malika Freund
    Lingyu Zhan
    Kathryn S. Burch
    Christa Caggiano
    Brian Hill
    Nadav Rakocz
    Brunilda Balliu
    Christopher T. Denny
    Jae Hoon Sul
    Noah Zaitlen
    Valerie A. Arboleda
    Eran Halperin
    Sriram Sankararaman
    Manish J. Butte
    Clara Lajonchere
    Daniel H. Geschwind
    Bogdan Pasaniuc
    Genome Medicine, 14
  • [38] Identifying the potential causal role of insomnia symptoms on 11,409 health-related outcomes: a phenome-wide Mendelian randomisation analysis in UK Biobank
    Gibson, Mark J.
    Lawlor, Deborah A.
    Millard, Louise A. C.
    BMC MEDICINE, 2023, 21 (01)
  • [39] Identifying the potential causal role of insomnia symptoms on 11,409 health-related outcomes: a phenome-wide Mendelian randomisation analysis in UK Biobank
    Mark J. Gibson
    Deborah A. Lawlor
    Louise A. C. Millard
    BMC Medicine, 21
  • [40] PHENOME-WIDE ASSOCIATION STUDY OF SMOKING AND CAFFEINE USE ON MENTAL HEALTH OUTCOMES USING DATA FROM THE ALSPAC PREGNANCY COHORT
    Haan, Elis
    Schellhas, Laura
    Sallis, Hannah
    Munafo, Marcus
    Zuccolo, Luisa
    EUROPEAN NEUROPSYCHOPHARMACOLOGY, 2019, 29 : S17 - S17