Highdicom: a Python']Python Library for Standardized Encoding of Image Annotations and Machine Learning Model Outputs in Pathology and Radiology

被引:8
|
作者
Bridge, Christopher P. [1 ,2 ]
Gorman, Chris [3 ]
Pieper, Steven [4 ]
Doyle, Sean W. [2 ]
Lennerz, Jochen K. [5 ,6 ]
Kalpathy-Cramer, Jayashree [1 ,2 ,7 ]
Clunie, David A. [8 ]
Fedorov, Andriy Y. [7 ,9 ]
Herrmann, Markus D. [3 ,6 ]
机构
[1] Massachusetts Gen Hosp, Martinos Ctr Biomed Imaging, Boston, MA 02114 USA
[2] Mass Gen Brigham, MGH & BWH Ctr Clin Data Sci, Boston, MA USA
[3] Massachusetts Gen Hosp, Dept Pathol, Computat Pathol, Boston, MA 02114 USA
[4] Isomics Inc, Cambridge, MA USA
[5] Massachusetts Gen Hosp, Dept Pathol, Ctr Integrated Diagnost, Boston, MA 02114 USA
[6] Harvard Med Sch, Dept Pathol, Boston, MA 02115 USA
[7] Harvard Med Sch, Dept Radiol, Boston, MA 02115 USA
[8] PixelMed Publishing LLC, Bangor, PA USA
[9] Brigham & Womens Hosp, Dept Radiol, Surg Planning Lab, 75 Francis St, Boston, MA 02115 USA
基金
美国国家卫生研究院;
关键词
DICOM; !text type='Python']Python[!/text; Software; Machine learning; Segmentations; Structured reports; RESOURCE; DICOM;
D O I
10.1007/s10278-022-00683-y
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Machine learning (ML) is revolutionizing image-based diagnostics in pathology and radiology. ML models have shown promising results in research settings, but the lack of interoperability between ML systems and enterprise medical imaging systems has been a major barrier for clinical integration and evaluation. The DICOM (R) standard specifies information object definitions (IODs) and services for the representation and communication of digital images and related information, including image-derived annotations and analysis results. However, the complexity of the standard represents an obstacle for its adoption in the ML community and creates a need for software libraries and tools that simplify working with datasets in DICOM format. Here we present the highdicom library, which provides a high-level application programming interface (API) for the Python programming language that abstracts low-level details of the standard and enables encoding and decoding of image-derived information in DICOM format in a few lines of Python code. The highdicom library leverages NumPy arrays for efficient data representation and ties into the extensive Python ecosystem for image processing and machine learning. Simultaneously, by simplifying creation and parsing of DICOM-compliant files, highdicom achieves interoperability with the medical imaging systems that hold the data used to train and run ML models, and ultimately communicate and store model outputs for clinical use. We demonstrate through experiments with slide microscopy and computed tomography imaging, that, by bridging these two ecosystems, highdicom enables developers and researchers to train and evaluate state-of-the-art ML models in pathology and radiology while remaining compliant with the DICOM standard and interoperable with clinical systems at all stages. To promote standardization of ML research and streamline the ML model development and deployment process, we made the library available free and open-source at https://github.com/herrmannlab/ highd icom.
引用
收藏
页码:1719 / 1737
页数:19
相关论文
共 21 条
  • [1] Highdicom: a Python Library for Standardized Encoding of Image Annotations and Machine Learning Model Outputs in Pathology and Radiology
    Christopher P. Bridge
    Chris Gorman
    Steven Pieper
    Sean W. Doyle
    Jochen K. Lennerz
    Jayashree Kalpathy-Cramer
    David A. Clunie
    Andriy Y. Fedorov
    Markus D. Herrmann
    Journal of Digital Imaging, 2022, 35 : 1719 - 1737
  • [2] mOWL: Python']Python library for machine learning with biomedical ontologies
    Zhapa-Camacho, Fernando
    Kulmanov, Maxat
    Hoehndorf, Robert
    BIOINFORMATICS, 2023, 39 (01)
  • [3] MLPro - An integrative middleware framework for standardized machine learning tasks in Python']Python
    Arend, Detlef
    Diprasetya, Mochammad Rizky
    Yuwono, Steve
    Schwung, Andreas
    SOFTWARE IMPACTS, 2022, 14
  • [4] Scikit-Weak: A Python']Python Library for Weakly Supervised Machine Learning
    Campagner, Andrea
    Lienen, Julian
    Huellermeier, Eyke
    Ciucci, Davide
    ROUGH SETS, IJCRS 2022, 2022, 13633 : 57 - 70
  • [5] APLUS: A Python']Python library for usefulness simulations of machine learning models in healthcare
    Wornow, Michael
    Ross, Elsie Gyang
    Callahan, Alison
    Shah, Nigam H.
    JOURNAL OF BIOMEDICAL INFORMATICS, 2023, 139
  • [6] BindsNET: A Machine Learning-Oriented Spiking Neural Networks Library in Python']Python
    Hazan, Hananel
    Saunders, Daniel J.
    Khan, Hassaan
    Patel, Devdhar
    Sanghavi, Darpan T.
    Siegelmann, Hava T.
    Kozma, Robert
    FRONTIERS IN NEUROINFORMATICS, 2018, 12
  • [7] Brain Predictability toolbox: a Python']Python library for neuroimaging-based machine learning
    Hahn, Sage
    Yuan, De Kang
    Thompson, Wesley K.
    Owens, Max
    Allgaier, Nicholas
    Garavan, Hugh
    BIOINFORMATICS, 2021, 37 (11) : 1637 - 1638
  • [8] Machine learning for libraries with Python']Python libraries: practical case in the Library of Congress of Chile
    Gonzalez, Marcelo Lorca
    SOUTH AFRICAN JOURNAL OF LIBRARIES AND INFORMATION SCIENCE, 2024, 90 (02)
  • [9] Deeptime: a Python']Python library for machine learning dynamical models from time series data
    Hoffmann, Moritz
    Scherer, Martin
    Hempel, Tim
    Mardt, Andreas
    de Silva, Brian
    Husic, Brooke E.
    Klus, Stefan
    Wu, Hao
    Kutz, Nathan
    Brunton, Steven L.
    Noe, Frank
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2022, 3 (01):
  • [10] Frouros: An open-source Python']Python library for drift detection in machine learning systems
    Sisniega, Jaime Cespedes
    Garcia, alvaro Lopez
    SOFTWAREX, 2024, 26