Using text mining to extract depressive symptoms and to validate the diagnosis of major depressive disorder from electronic health records

被引:74
|
作者
Wu, Chi-Shin [1 ,2 ]
Kuo, Chian-Jue [3 ,4 ]
Su, Chu-Hsien [1 ]
Wang, Shi-Heng [5 ,6 ]
Dai, Hong-Jie [7 ,8 ]
机构
[1] Natl Taiwan Univ Hosp, Dept Psychiat, Taipei, Taiwan
[2] Natl Taiwan Univ, Coll Med, Taipei, Taiwan
[3] Taipei City Hosp, Taipei City Psychiat Ctr, Taipei, Taiwan
[4] Taipei Med Univ, Coll Med, Sch Med, Dept Psychiat, Taipei, Taiwan
[5] China Med Univ, Dept Publ Hlth, Taichung, Taiwan
[6] China Med Univ, Dept Occupat Safety & Hlth, Taichung, Taiwan
[7] Natl Kaohsiung Univ Sci & Technol, Dept Elect Engn, 415 Jiangong Rd, Kaohsiung 80778, Taiwan
[8] Kaohsiung Med Univ, Coll Med, Sch Postbaccalaureate Med, Kaohsiung, Taiwan
关键词
Text mining; Information extraction; Major depressive disorder; PREVALENCE; PSYCHIATRY; TAIWAN;
D O I
10.1016/j.jad.2019.09.044
中图分类号
R74 [神经病学与精神病学];
学科分类号
摘要
Background: Many studies have used Taiwan's National Health Insurance Research database (NHIRD) to conduct psychiatric research. However, the accuracy of the diagnostic codes for psychiatric disorders in NHIRD is not validated, and the symptom profiles are not available either. This study aimed to evaluate the accuracy of diagnostic codes and use text mining to extract symptom profile and functional impairment from electronic health records (EHRs) to overcome the above research limitations. Methods: A total of 500 discharge notes were randomly selected from a medical center's database. Three annotators reviewed the notes to establish gold standards. The accuracy of diagnostic codes for major psychiatric illness was evaluated. Text mining approaches were applied to extract depressive symptoms and function profiles and to identify patients with major depressive disorder. Results: The accuracy of the diagnostic code for major depressive disorder, schizophrenia, and dementia was acceptable but that of bipolar disorder and minor depression was less satisfactory. The performance of text mining approach to recognize depressive symptoms is satisfactory; however, the recall for functional impairment is lower resulting in lower F-scores of 0.774-0.753. Using the text mining approach to identify major depressive disorder, the recall was 0.85 but precision was only 0.69. Conclusions: The accuracy of the diagnostic code for major depressive disorder in discharge notes was generally acceptable. This finding supports the utilization of psychiatric diagnoses in claims databases. The application of text mining to EHRs might help in overcoming current limitations in research using claims databases.
引用
收藏
页码:617 / 623
页数:7
相关论文
共 50 条
  • [1] Using Electronic Health Records Driven Phenotyping for Major Depressive Disorder
    Pathak, Jyotishman
    Hall-Flavin, Daniel K.
    Biernacka, Joanna M.
    Jenkins, Gregory D.
    Bruce, Kevin T.
    Murphy, Sean P.
    Sagen, Jessica A.
    Skime, Michelle K.
    Bobo, William V.
    Chute, Christopher G.
    Wang, Leiwei
    Weinshilboum, Richard M.
    [J]. BIOLOGICAL PSYCHIATRY, 2014, 75 (09) : 343S - 343S
  • [2] Developing an individualized treatment rule for Veterans with major depressive disorder using electronic health records
    Zainal, Nur Hani
    Bossarte, Robert M.
    Gildea, Sarah M.
    Hwang, Irving
    Kennedy, Chris J.
    Liu, Howard
    Luedtke, Alex
    Marx, Brian P.
    Petukhova, Maria V.
    Post, Edward P.
    Ross, Eric L.
    Sampson, Nancy A.
    Sverdrup, Erik
    Turner, Brett
    Wager, Stefan
    Kessler, Ronald C.
    [J]. MOLECULAR PSYCHIATRY, 2024,
  • [3] DEPRESSIVE SYMPTOMS AND FUNCTIONAL IMPAIRMENTS EXTRACTION FROM ELECTRONIC HEALTH RECORDS
    Zhang, You-Chen
    Chung, Wei-Che
    Li, Kuei-Han
    Huang, Cheng-Chieh
    Dai, Hong-Jie
    Wu, Chi-Shin
    Kuo, Chian-Jue
    Su, Chu-Hsien
    Yang, Horng-Chang
    Lee, Chung-Hong
    Liang, Tyng-Yeu
    [J]. PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), 2019, : 581 - 586
  • [4] Identifying data-driven subtypes of major depressive disorder with electronic health records
    Sharma, Abhishek
    Verhaak, Pilar F.
    McCoy, Thomas H.
    Perlis, Roy H.
    Doshi-Velez, Finale
    [J]. JOURNAL OF AFFECTIVE DISORDERS, 2024, 356 : 64 - 70
  • [5] Efficiently identifying individuals at high risk for treatment resistance in major depressive disorder using electronic health records
    Lage, Isaac
    McCoy, Thomas H. Jr Jr
    Perlis, Roy H.
    Doshi-Velez, Finale
    [J]. JOURNAL OF AFFECTIVE DISORDERS, 2022, 306 : 254 - 259
  • [6] Difficulties in the diagnosis and treatment of major depressive disorder with comorbid anxiety symptoms
    Akkaya, Cengiz
    [J]. TURK PSIKIYATRI DERGISI, 2006, 17 (02) : 139 - 146
  • [7] Typical somatic symptoms of pregnancy and their impact on a diagnosis of major depressive disorder
    Yonkers, Kimberly Ann
    Smith, Megan V.
    Gotman, Nathan
    Belanger, Kathleen
    [J]. GENERAL HOSPITAL PSYCHIATRY, 2009, 31 (04) : 327 - 333
  • [8] Quantifying the impact of chronic conditions on a diagnosis of major depressive disorder in adults: a cohort study using linked electronic medical records
    Euijung Ryu
    Alanna M. Chamberlain
    Richard S. Pendegraft
    Tanya M. Petterson
    William V. Bobo
    Jyotishman Pathak
    [J]. BMC Psychiatry, 16
  • [9] Quantifying the impact of chronic conditions on a diagnosis of major depressive disorder in adults: a cohort study using linked electronic medical records
    Ryu, Euijung
    Chamberlain, Alanna M.
    Pendegraft, Richard S.
    Petterson, Tanya M.
    Bobo, William V.
    Pathak, Jyotishman
    [J]. BMC PSYCHIATRY, 2016, 16
  • [10] Natural Language Processing for Automatic Identification of Major Depressive Disorders in Free-Text Electronic Health Records
    Nunez, Nicolas
    Biernacka, Joanna M.
    Gardea-Resendez, Manuel
    Kshatriya, Bhavani Singh Agnikula
    Ryu, Euijung
    Fu, Sunyang
    Singh, Balwinder
    Coombes, Brandon
    Frye, Mark
    Wang, Yanshan
    [J]. BIOLOGICAL PSYCHIATRY, 2021, 89 (09) : S155 - S155