Mining Textual Data from Primary Healthcare Records - Automatic Identification of Patient Phenotype Cohorts

被引:0
|
作者
Zhou, Shang-Ming [1 ]
Rahman, Muhammad A. [1 ]
Atkinson, Mark [1 ]
Brophy, Sinead [1 ]
机构
[1] Swansea Univ, Publ Hlth Informat Grp, Coll Med, Swansea SA2 8PP, W Glam, Wales
关键词
ANKYLOSING-SPONDYLITIS; VALIDITY; DIAGNOSES; OUTCOMES; DISEASE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to advances of the "omics" technologies, rich sources of clinical, biomedical, contextual, and environmental data about each patient have been available in medical and health sciences. However, an enormous amount of electronic health records is actually generated as textual data, such as descriptive terms/concepts. No doubt, efficiently harnessing these valuable textual data would allow doctors and nurses to identify the most appropriate treatments and the predicted outcomes for a given patient in real time. We used textual data to identify patient phenotypes from UK primary care records that were managed by Read codes (a clinical classification system). The fine granularity level of Read codes leads to a huge number of clinical terms to be handled. Unfortunately, traditional medical statistics methods have struggled to process this sort of data effectively. In this paper, we described how the problem of patient phenotype identification can be transformed into document classification task, a text mining scheme is addressed to integrate feature ranking methods and genetic algorithm to identify the most parsimonious subset of features that still holds the capacity of characterizing the distinction of patient phenotypes. The experimental results have demonstrated that compact feature sets with 2 or 3 important terms describing clinical events were effectively identified from 16852 Read codes while their classification accuracy remained high level of agreements with specialists from secondary care in classifying testing samples.
引用
收藏
页码:3621 / 3627
页数:7
相关论文
共 50 条
  • [1] Organizing standardized electronic healthcare records data for mining
    Batra, Shivani
    Sachdeva, Shelly
    [J]. HEALTH POLICY AND TECHNOLOGY, 2016, 5 (03) : 226 - 242
  • [2] A review of approaches to identifying patient phenotype cohorts using electronic health records
    Shivade, Chaitanya
    Raghavan, Preethi
    Fosler-Lussier, Eric
    Embi, Peter J.
    Elhadad, Noemie
    Johnson, Stephen B.
    Lai, Albert M.
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2014, 21 (02) : 221 - 230
  • [3] Results from data mining in IMS disease analyzer patient records
    Bate, Andrew
    Noren, Niklas G.
    Star, Kristina
    Hopstadius, Johan
    Edwards, Ralph I.
    [J]. PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2007, 16 : S256 - S256
  • [4] Mining causality knowledge from textual data
    Pechsiri, C
    Kawtrakul, A
    Piriyakul, R
    [J]. PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND APPLICATIONS, 2006, : 85 - +
  • [5] Mining explanation knowledge from textual data
    Pechsiri, Chaveevan
    Kawtrakul, Asance
    Piriyakul, Rapepun
    [J]. PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER SCIENCE AND TECHNOLOGY, 2006, : 322 - +
  • [6] Data Mining for Automatic Communication Behaviors Identification
    Chen, Y. J.
    Wu, J. L.
    Yang, H. M.
    [J]. 6TH WORLD CONGRESS OF BIOMECHANICS (WCB 2010), PTS 1-3, 2010, 31 : 91 - +
  • [7] Textual analysis and visualization of research trends in data mining for electronic health records
    Chen, Jingfeng
    Wei, Wei
    Guo, Chonghui
    Tang, Lin
    Sun, Leilei
    [J]. HEALTH POLICY AND TECHNOLOGY, 2017, 6 (04) : 389 - 400
  • [8] Patient Knowledge and Data Privacy in Healthcare Records System
    Mireku, Kissi Kingsford
    Zhang Fengli
    Komlan, Gbongli
    [J]. 2017 2ND INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS, COMPUTING AND IT APPLICATIONS (CSCITA), 2017, : 154 - 159
  • [9] DATA FROM PATIENT RECORDS
    DOYAL, L
    [J]. BRITISH DENTAL JOURNAL, 1995, 178 (12) : 448 - 448
  • [10] Applying data mining techniques to improve data quality of patient records
    Gorla, N
    Bennon, CYK
    [J]. ISSUES AND TRENDS OF INFORMATION TECHNOLOGY MANAGEMENT IN CONTEMPORARY ORGANIZATIONS, VOLS 1 AND 2, 2002, : 237 - 241