Development of a natural language processing algorithm for the detection of spinal metastasis based on magnetic resonance imaging reports

被引：0

作者：

Mostafa, Evan ^{[1
]}

Hui, Aaron ^{[2
]}

Aasman, Boudewijn ^{[2
]}

Chowdary, Kamlesh ^{[2
]}

Mani, Kyle ^{[2
]}

Mardakhaev, Edward ^{[2
]}

Zampolin, Richard ^{[2
]}

Blumfield, Einat ^{[2
]}

Berman, Jesse ^{[2
]}

Ramos, Rafael De La Garza ^{[3
]}

Fourman, Mitchell ^{[1
]}

Yassari, Reza ^{[3
]}

Eleswarapu, Ananth ^{[1
]}

Mirhaji, Parsa ^{[2
]}

机构：

[1] Montefiore Med Ctr, Dept Orthopaed Surg, 111 E 210th St, Bronx, NY 10467 USA

[2] Albert Einstein Coll Med, 1300 Morris Pk Ave, Bronx, NY 10461 USA

[3] Montefiore Med Ctr, Dept Neurol Surg, 111 E 210th St, Bronx, NY 10467 USA

来源：

NORTH AMERICAN SPINE SOCIETY JOURNAL | 2024年 / 19卷

关键词：

Spine; Metastatic; Cancer; MRI; Processing; Algorithm; Language;

D O I：

10.1016/j.xnsj.2024.100513

中图分类号：

R74 [神经病学与精神病学];

学科分类号：

摘要：

Background: Metastasis to the spinal column is a common complication of malignancy, potentially causing pain and neurologic injury. An automated system to identify and refer patients with spinal metastases can help overcome barriers to timely treatment. We describe the training, optimization and validation of a natural language processing algorithm to identify the presence of vertebral metastasis and metastatic epidural cord compression (MECC) from radiology reports of spinal MRIs. Methods: Reports from patients with spine MRI studies performed between January 1, 2008 and April 14, 2019 were reviewed by a team of radiologists to assess for the presence of cancer and generate a labeled dataset for model training. Using regular expression, impression sections were extracted from the reports and converted to all lower-case letters with all nonalphabetic characters removed. The reports were then tokenized and vectorized using the doc2vec algorithm. These were then used to train a neural network to predict the likelihood of spinal tumor or MECC. For each report, the model provided a number from 0 to 1 corresponding to its impression. We then obtained 111 MRI reports from outside the test set, 92 manually labeled negative and 19 with MECC to test the model's performance. Results: About 37,579 radiology reports were reviewed. About 36,676 were labeled negative, and 903 with MECC. We chose a cutoff of 0.02 as a positive result to optimize for a low false negative rate. At this threshold we found a 100% sensitivity rate with a low false positive rate of 2.2%. Conclusions: The NLP model described predicts the presence of spinal tumor and MECC in spine MRI reports with high accuracy. We plan to implement the algorithm into our EMR to allow for faster referral of these patients to appropriate specialists, allowing for reduced morbidity and increased survival.

引用

页数：6

共 50 条

[1] An Automated Natural Language Processing Algorithm to Classify Magnetic Resonance Imaging Reports Containing Positive Diagnoses of Hypertrophic Cardiomyopathy
Moon, Sungrim
Sagheb, Elham
Liu, Sijia
Chen, David
Bos, Martijn
Geske, Jeffrey B.
Noseworthy, Peter A.
Ackerman, Michael J.
Shellum, Jane L.
Chaudhry, Rajeev
Ommen, Steve R.
Araoz, Philip A.
Nishimura, Rick A.
Liu, Hongfang
Arruda-Olson, Adelaide
CIRCULATION, 2019, 140
[2] The implementation of natural language processing to extract index lesions from breast magnetic resonance imaging reports
Yi Liu
Qing Liu
Chao Han
Xiaodong Zhang
Xiaoying Wang
BMC Medical Informatics and Decision Making, 19
[3] The implementation of natural language processing to extract index lesions from breast magnetic resonance imaging reports
Liu, Yi
Liu, Qing
Han, Chao
Zhang, Xiaodong
Wang, Xiaoying
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (01)
[4] Automatic extraction of imaging observation and assessment categories from breast magnetic resonance imaging reports with natural language processing
Liu Yi
Zhu Li-Na
Liu Qing
Han Chao
Zhang Xiao-Dong
Wang Xiao-Ying
中华医学杂志英文版, 2019, 132 (14) : 1673 - 1680
[5] Automatic extraction of imaging observation and assessment categories from breast magnetic resonance imaging reports with natural language processing
Liu, Yi
Zhu, Li-Na
Liu, Qing
Han, Chao
Zhang, Xiao-Dong
Wang, Xiao-Ying
CHINESE MEDICAL JOURNAL, 2019, 132 (14) : 1673 - 1680
[6] BERT for the Processing of Radiological Reports: An Attention-based Natural Language Processing Algorithm
Soffer, Shelly
Glicksberg, Benjamin S.
Zimlichman, Eyal
Klang, Eyal
ACADEMIC RADIOLOGY, 2022, 29 (04) : 634 - 635
[7] Development and Accuracy of Natural Language Processing-based Expression Matching to Identify and Classify Cardiomyopathy from Cardiovascular Magnetic Resonance Reports
Shenoy, Ujwala
Zhang, Lu
Jha, Mawra
Kwong, Raymond
Manning, Warren
Nezafat, Reza
Tsao, Connie
CIRCULATION, 2024, 150
[8] Natural language processing for identification of hypertrophic cardiomyopathy patients from cardiac magnetic resonance reports
Dewaswala, Nakeya
Chen, David
Bhopalwala, Huzefa
Kaggal, Vinod C.
Murphy, Sean P.
Bos, J. Martijn
Geske, Jeffrey B.
Gersh, Bernard J.
Ommen, Steve R.
Araoz, Philip A.
Ackerman, Michael J.
Arruda-Olson, Adelaide M.
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2022, 22 (01)
[9] Natural language processing for identification of hypertrophic cardiomyopathy patients from cardiac magnetic resonance reports
Nakeya Dewaswala
David Chen
Huzefa Bhopalwala
Vinod C. Kaggal
Sean P. Murphy
J. Martijn Bos
Jeffrey B. Geske
Bernard J. Gersh
Steve R. Ommen
Philip A. Araoz
Michael J. Ackerman
Adelaide M. Arruda-Olson
BMC Medical Informatics and Decision Making, 22
[10] Extracting hypertrophic cardiomyopathy features from cardiac magnetic resonance reports by natural language processing
Dewaswala-Bhopalwala, N.
Chen, D.
Bhopalwala, H.
Pour, S. Hossein
Moon, S.
Bos, D.
Scott, C.
Geske, J.
Noseworthy, P.
Ommen, S. R.
Erickson, B. J.
Araoz, P. A.
Nishimura, R. A.
Ackerman, M. J.
Arruda-Olson, A. M.
EUROPEAN HEART JOURNAL, 2020, 41 : 199 - 199

← 1 2 3 4 5 →