Dynamic categorization of clinical research eligibility criteria by hierarchical clustering

被引:34
|
作者
Luo, Zhihui [1 ]
Yetisgen-Yildiz, Meliha [2 ]
Weng, Chunhua [1 ]
机构
[1] Columbia Univ, Dept Biomed Informat, New York, NY 10032 USA
[2] Univ Washington, Seattle, WA 98195 USA
关键词
Clinical research eligibility criteria; Classification; Hierarchical clustering; Knowledge representation; Unified Medical Language System (UMLS); Machine learning; Feature representation; CLASSIFICATION;
D O I
10.1016/j.jbi.2011.06.001
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Objective: To semi-automatically induce semantic categories of eligibility criteria from text and to automatically classify eligibility criteria based on their semantic similarity. Design: The UMLS semantic types and a set of previously developed semantic preference rules were utilized to create an unambiguous semantic feature representation to induce eligibility criteria categories through hierarchical clustering and to train supervised classifiers. Measurements: We induced 27 categories and measured the prevalence of the categories in 27,278 eligibility criteria from 1578 clinical trials and compared the classification performance (i.e., precision, recall, and F1-score) between the UMLS-based feature representation and the "bag of words" feature representation among five common classifiers in Weka, including J48, Bayesian Network, Naive Bayesian, Nearest Neighbor, and instance-based learning classifier. Results: The UMLS semantic feature representation outperforms the "bag of words" feature representation in 89% of the criteria categories. Using the semantically induced categories, machine-learning classifiers required only 2000 instances to stabilize classification performance. The J48 classifier yielded the best F1-score and the Bayesian Network classifier achieved the best learning efficiency. Conclusion: The UMLS is an effective knowledge source and can enable an efficient feature representation for semi-automated semantic category induction and automatic categorization for clinical research eligibility criteria and possibly other clinical text. (C) 2011 Elsevier Inc. All rights reserved.
引用
收藏
页码:927 / 935
页数:9
相关论文
共 50 条
  • [41] ASTEC: Automatic selection of clinical trials based on eligibility criteria
    Cuggia, M.
    Dufour, J. -C.
    Zekri, O.
    Gibaud, I.
    Garde, C.
    Bohec, C.
    Duvauferrier, R.
    Fieschi, D.
    Besana, P.
    Charlois, L.
    Bourde, A.
    Garcelon, N.
    Laurent, J.
    Fieschi, M.
    Dameron, O.
    [J]. IRBM, 2012, 33 (02) : 150 - 164
  • [42] Eligibility criteria related to hormone therapy in acne clinical trials
    DeGrazia, Taryn
    Rolader, Robin
    Thiboutot, Diane
    Yeung, Howa
    [J]. JOURNAL OF THE AMERICAN ACADEMY OF DERMATOLOGY, 2020, 83 (06) : AB121 - AB121
  • [43] Participatory Design of a Clinical Trial Eligibility Criteria Simplification Method
    Fang, Yilu
    Kim, Jae Hyun
    Idnay, Betina Ross
    Garcia, Rebeca Aragon
    Castillo, Carmen E.
    Sun, Yingcheng
    Liu, Hao
    Liu, Cong
    Yuan, Chi
    Weng, Chunhua
    [J]. PUBLIC HEALTH AND INFORMATICS, PROCEEDINGS OF MIE 2021, 2021, 281 : 984 - 988
  • [44] Refining eligibility criteria for amyotrophic lateral sclerosis clinical trials
    van Eijk, Ruben P. A.
    Westeneng, Henk-Jan
    Nikolakopoulos, Stavros
    Verhagen, Iris E.
    van Es, Michael A.
    Eijkemans, Marinus J. C.
    van den Berg, Leonard H.
    [J]. NEUROLOGY, 2019, 92 (05) : E451 - E460
  • [45] Eligibility criteria for therapeutic hypothermia: From trials to clinical practice
    Mehta, Shailender
    Joshi, Anjali
    Bajuk, Barbara
    Badawi, Nadia
    McIntyre, Sarah
    Lui, Kei
    [J]. JOURNAL OF PAEDIATRICS AND CHILD HEALTH, 2017, 53 (03) : 295 - 300
  • [46] Chia, a large annotated corpus of clinical trial eligibility criteria
    Kury, Fabricio
    Butler, Alex
    Yuan, Chi
    Fu, Li-heng
    Sun, Yingcheng
    Liu, Hao
    Sim, Ida
    Carini, Simona
    Weng, Chunhua
    [J]. SCIENTIFIC DATA, 2020, 7 (01)
  • [47] Eligibility criteria in knee osteoarthritis clinical trials: systematic review
    Yun Hyung Koog
    Hyungsun Wi
    Won Young Jung
    [J]. Clinical Rheumatology, 2013, 32 : 1569 - 1574
  • [48] Data mining for text categorization with semi-supervised agglomerative hierarchical clustering
    Skarmeta, AG
    Bensaid, A
    Tazi, N
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2000, 15 (07) : 633 - 646
  • [49] Clinical Trial Eligibility Criteria: A Structural Barrier to Diversity in Clinical Trial Enrollment
    Snyder, Rebecca A.
    [J]. JOURNAL OF CLINICAL ONCOLOGY, 2022, 40 (20) : 2183 - +
  • [50] Finding structure in diversity: A hierarchical clustering-method for the categorization of allographs in handwriting
    Vuurpijl, L
    Schomaker, L
    [J]. PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, 1997, : 387 - 393