A scoping review of publicly available language tasks in clinical natural language processing

被引:12
|
作者
Gao, Yanjun [1 ]
Dligach, Dmitriy [2 ]
Christensen, Leslie [3 ]
Tesch, Samuel [3 ]
Laffin, Ryan [3 ]
Xu, Dongfang [4 ]
Miller, Timothy [4 ]
Uzuner, Ozlem [5 ]
Churpek, Matthew M. [1 ]
Afshar, Majid [1 ]
机构
[1] Univ Wisconsin, Dept Med, Sch Med & Publ Hlth, ICU Data Sci Lab, Madison, WI USA
[2] Loyola Univ, Dept Comp Sci, Chicago, IL 60611 USA
[3] Univ Wisconsin, Sch Med & Publ Hlth, Madison, WI USA
[4] Harvard Univ, Boston Childrens Hosp, Computat Hlth Informat Program, Boston, MA 02115 USA
[5] George Mason Univ, Dept Informat Sci & Technol, Fairfax, VA 22030 USA
关键词
natural language processing; clinical informatics; electronic health records; systematic review; clinical decision support; OF-THE-ART; SHARED TASKS; TEMPORAL RELATIONS; DE-IDENTIFICATION; RECORDS; TEXT;
D O I
10.1093/jamia/ocac127
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective To provide a scoping review of papers on clinical natural language processing (NLP) shared tasks that use publicly available electronic health record data from a cohort of patients. Materials and Methods We searched 6 databases, including biomedical research and computer science literature databases. A round of title/abstract screening and full-text screening were conducted by 2 reviewers. Our method followed the PRISMA-ScR guidelines. Results A total of 35 papers with 48 clinical NLP tasks met inclusion criteria between 2007 and 2021. We categorized the tasks by the type of NLP problems, including named entity recognition, summarization, and other NLP tasks. Some tasks were introduced as potential clinical decision support applications, such as substance abuse detection, and phenotyping. We summarized the tasks by publication venue and dataset type. Discussion The breadth of clinical NLP tasks continues to grow as the field of NLP evolves with advancements in language systems. However, gaps exist with divergent interests between the general domain NLP community and the clinical informatics community for task motivation and design, and in generalizability of the data sources. We also identified issues in data preparation. Conclusion The existing clinical NLP tasks cover a wide range of topics and the field is expected to grow and attract more attention from both general domain NLP and clinical informatics community. We encourage future work to incorporate multidisciplinary collaboration, reporting transparency, and standardization in data preparation. We provide a listing of all the shared task papers and datasets from this review in a GitLab repository.
引用
收藏
页码:1797 / 1806
页数:10
相关论文
共 50 条
  • [21] Robustness of GPT Large Language Models on Natural Language Processing Tasks
    Xuanting C.
    Junjie Y.
    Can Z.
    Nuo X.
    Tao G.
    Qi Z.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (05): : 1128 - 1142
  • [22] Natural Language Processing Applied to Clinical Documentation in Post-acute Care Settings: A Scoping Review
    Scharp, Danielle
    Hobensack, Mollie
    Davoudi, Anahita
    Topaz, Maxim
    JOURNAL OF THE AMERICAN MEDICAL DIRECTORS ASSOCIATION, 2024, 25 (01) : 69 - 83
  • [23] A Scoping Review of Ethics Considerations in Clinical Natural Language Processing (vol 5, pg 1, 2022)
    Walk, Oliver J. Bear Don't
    Nieva, Harry Reyes
    Lee, Sandra Soo-Jin
    Elhadad, Noemie
    JAMIA OPEN, 2022, 5 (03)
  • [24] Fear of falling: Scoping review and topic analysis using natural language processing
    Kolpashnikova, Kamila
    Harris, Laurence R.
    Desai, Shital
    PLOS ONE, 2023, 18 (10):
  • [25] A Scoping Literature Review of Natural Language Processing Application to Safety Occurrence Reports
    Ricketts, Jon
    Barry, David
    Guo, Weisi
    Pelham, Jonathan
    SAFETY, 2023, 9 (02)
  • [26] Evaluation of mCODE Coverage in EHR: a Scoping Review of Cancer Natural Language Processing
    Wang, Liwei
    Fu, Sunyang
    Wen, Andrew
    Ruan, Xiaoyang
    He, Huan
    Liu, Sijia
    Moon, Sungrim
    Mai, Michelle
    Riaz, Irbaz
    Wang, Nan
    Yang, Ping
    Xu, Hua
    Warner, Jeremy L.
    Liu, Hongfang
    2022 IEEE 10TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2022), 2022, : 517 - 518
  • [27] Unlocking the Potential: A Comprehensive Systematic Review of ChatGPT in Natural Language Processing Tasks
    Alomari, Ebtesam Ahmad
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024, 141 (01): : 43 - 85
  • [28] Benchmarking Large Language Model Performance on Natural Language Processing Tasks for Pharmacoepidemiology
    Feng, Hui
    Ronzano, Francesco
    LaFleur, JuDe
    Garber, Matthew L.
    de Oliveira, Rodrigo
    Roth, Katharine
    Rough, Kathryn
    Nanavati, Jay
    El Abidine, Khaldoun Zine
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2024, 33 : 70 - 70
  • [29] Unsupervised multi-sense language models for natural language processing tasks
    Roh, Jihyeon
    Park, Sungjin
    Kim, Bo-Kyeong
    Oh, Sang-Hoon
    Lee, Soo-Young
    NEURAL NETWORKS, 2021, 142 : 397 - 409
  • [30] Spanish to Mexican Sign Language glosses corpus for natural language processing tasks
    Vania Lara-Ortiz
    Rita Q. Fuentes-Aguilar
    Isaac Chairez
    Scientific Data, 12 (1)