Towards data-driven medical imaging using natural language processing in patients with suspected urolithiasis

被引:12
|
作者
Jungmann, Florian [1 ]
Kaempgen, Benedikt [2 ]
Mildenberger, Philipp [3 ]
Tsaur, Igor [4 ]
Jorg, Tobias [1 ]
Dueber, Christoph [1 ]
Mildenberger, Peter [1 ]
Kloeckner, Roman [1 ]
机构
[1] Johannes Gutenberg Univ Mainz, Dept Diagnost & Intervent Radiol, Univ Med Ctr, Langenbeckst 1, D-55131 Mainz, Germany
[2] Empolis Informat Management, Kaiserslautern, Germany
[3] Johannes Gutenberg Univ Mainz, Univ Med Ctr, IMBEI, Mainz, Germany
[4] Johannes Gutenberg Univ Mainz, Dept Urol & Pediat Urol, Univ Med Ctr, Mainz, Germany
关键词
Data science; Natural language processing; RadLex; Urolithiasis; RADIOLOGY REPORTS; CT; INFORMATION; PREVALENCE; VALIDATION; QUALITY; STONES;
D O I
10.1016/j.ijmedinf.2020.104106
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: The majority of radiological reports are still written as free text and lack structure. Further evaluation of free-text reports is difficult to achieve without a great deal of manual effort, and is not possible in everyday clinical practice. This study aims to automatically capture clinical information and positive hit rates from narrative radiological reports of suspected urolithiasis using natural language processing (NLP). Methods: Narrative reports of low dose computed tomography (CT) of the retroperitoneum from April 2016 to July 2018 (n = 1714) were analyzed using NLP. These free-text reports were automatically structured based on RadLex concepts. Manual feedback was used to test and train the NLP engine to further enhance the performance. The chi-squared test, phi coefficient, and logistic regression analysis were performed to determine the effect of clinical information on the positive hit rate of urolithiasis. Results: Urolithiasis was affirmed in 72 % of the reports; in 38 % at least one stone was described in the kidneys, and in 45 % at least one stone was described in the ureter. Clinical information, such as previous stone history and obstructive uropathy, showed a strong correlation with confirmed urolithiasis (p = 0.001). Previous stone history and the combination of obstructive uropathy and loin pain had the highest association with positive urolithiasis (p < 0.001). Conclusion: Applying this NLP approach to already existing free-text reports allows the conversion of such reports into a structured form. This may be valuable for epidemiological studies, to evaluate the appropriateness of CT examinations, or to answer a variety of research questions.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Towards Data-driven Ontologies: a Filtering Approach using Keywords and Natural Language Constructs
    de Boer, Maaike H. T.
    Verhoosel, Jack P. C.
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 2285 - 2292
  • [2] Data-driven materials research enabled by natural language processing and information extraction
    Olivetti, Elsa A.
    Cole, Jacqueline M.
    Kim, Edward
    Kononova, Olga
    Ceder, Gerbrand
    Han, Thomas Yong-Jin
    Hiszpanski, Anna M.
    [J]. APPLIED PHYSICS REVIEWS, 2020, 7 (04):
  • [3] Data-driven automatic classification model for construction accident cases using natural language processing with hyperparameter tuning
    Kumi, Louis
    Jeong, Jaewook
    Jeong, Jaemin
    [J]. AUTOMATION IN CONSTRUCTION, 2024, 164
  • [4] From data to insights: how natural language processing and structured reporting advance data-driven radiology
    Fink, Matthias A.
    [J]. EUROPEAN RADIOLOGY, 2023, 33 (11) : 7494 - 7495
  • [5] From data to insights: how natural language processing and structured reporting advance data-driven radiology
    Matthias A. Fink
    [J]. European Radiology, 2023, 33 : 7494 - 7495
  • [6] Natural language spoken interface control using data-driven semantic inference
    Bellegarda, JR
    Silverman, KEA
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (03): : 267 - 277
  • [7] PROCESSING NATURAL MALAY TEXTS: A DATA-DRIVEN APPROACH
    Don, Zuraidah Mohd
    [J]. TRAMES-JOURNAL OF THE HUMANITIES AND SOCIAL SCIENCES, 2010, 14 (01): : 90 - 103
  • [8] A data-driven architecture using natural language processing to improve phenotyping efficiency and accelerate genetic diagnoses of rare disorders
    Parikh, Jignesh R.
    Genetti, Casie A.
    Aykanat, Asli
    Brownstein, Catherine A.
    Schmitz-Abe, Klaus
    Danowski, Morgan
    Quitadomo, Andrew
    Madden, Jill A.
    Yacoubian, Calum
    Gain, Richard
    Williams, Tessa
    Meskell, Mary
    Brown, Andrew
    Frith, Alison
    Rockowitz, Shira
    Sliz, Piotr
    Agrawal, Pankaj B.
    Defay, Thomas
    McDonagh, Paul
    Reynders, John
    Lefebvre, Sebastien
    Beggs, Alan H.
    [J]. HUMAN GENETICS AND GENOMICS ADVANCES, 2021, 2 (03):
  • [9] Identifying Characteristics of Patients With Suspected Stroke by Paramedics but not by Emergency Medical Dispatchers Using Natural Language Processing and Machine Learning
    Richards, Christopher T.
    Garg, Ravi P.
    Mendelson, Scott J.
    Stein-Spencer, Leslee
    Prabhakaran, Shyam
    [J]. STROKE, 2018, 49
  • [10] Automatic Corpus Extension for Data-driven Natural Language Generation
    Manishina, Elena
    Jabaian, Bassam
    Huet, Stephane
    Lefevre, Fabrice
    [J]. LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 3624 - 3631