Highdicom: a Python']Python Library for Standardized Encoding of Image Annotations and Machine Learning Model Outputs in Pathology and Radiology

被引:10
|
作者
Bridge, Christopher P. [1 ,2 ]
Gorman, Chris [3 ]
Pieper, Steven [4 ]
Doyle, Sean W. [2 ]
Lennerz, Jochen K. [5 ,6 ]
Kalpathy-Cramer, Jayashree [1 ,2 ,7 ]
Clunie, David A. [8 ]
Fedorov, Andriy Y. [7 ,9 ]
Herrmann, Markus D. [3 ,6 ]
机构
[1] Massachusetts Gen Hosp, Martinos Ctr Biomed Imaging, Boston, MA 02114 USA
[2] Mass Gen Brigham, MGH & BWH Ctr Clin Data Sci, Boston, MA USA
[3] Massachusetts Gen Hosp, Dept Pathol, Computat Pathol, Boston, MA 02114 USA
[4] Isomics Inc, Cambridge, MA USA
[5] Massachusetts Gen Hosp, Dept Pathol, Ctr Integrated Diagnost, Boston, MA 02114 USA
[6] Harvard Med Sch, Dept Pathol, Boston, MA 02115 USA
[7] Harvard Med Sch, Dept Radiol, Boston, MA 02115 USA
[8] PixelMed Publishing LLC, Bangor, PA USA
[9] Brigham & Womens Hosp, Dept Radiol, Surg Planning Lab, 75 Francis St, Boston, MA 02115 USA
基金
美国国家卫生研究院;
关键词
DICOM; !text type='Python']Python[!/text; Software; Machine learning; Segmentations; Structured reports; RESOURCE; DICOM;
D O I
10.1007/s10278-022-00683-y
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Machine learning (ML) is revolutionizing image-based diagnostics in pathology and radiology. ML models have shown promising results in research settings, but the lack of interoperability between ML systems and enterprise medical imaging systems has been a major barrier for clinical integration and evaluation. The DICOM (R) standard specifies information object definitions (IODs) and services for the representation and communication of digital images and related information, including image-derived annotations and analysis results. However, the complexity of the standard represents an obstacle for its adoption in the ML community and creates a need for software libraries and tools that simplify working with datasets in DICOM format. Here we present the highdicom library, which provides a high-level application programming interface (API) for the Python programming language that abstracts low-level details of the standard and enables encoding and decoding of image-derived information in DICOM format in a few lines of Python code. The highdicom library leverages NumPy arrays for efficient data representation and ties into the extensive Python ecosystem for image processing and machine learning. Simultaneously, by simplifying creation and parsing of DICOM-compliant files, highdicom achieves interoperability with the medical imaging systems that hold the data used to train and run ML models, and ultimately communicate and store model outputs for clinical use. We demonstrate through experiments with slide microscopy and computed tomography imaging, that, by bridging these two ecosystems, highdicom enables developers and researchers to train and evaluate state-of-the-art ML models in pathology and radiology while remaining compliant with the DICOM standard and interoperable with clinical systems at all stages. To promote standardization of ML research and streamline the ML model development and deployment process, we made the library available free and open-source at https://github.com/herrmannlab/ highd icom.
引用
收藏
页码:1719 / 1737
页数:19
相关论文
共 23 条