Automated data extraction tool (DET) for external applications in radiotherapy

被引:1
|
作者
Gurjar, Mruga [1 ]
Lindberg, Jesper [1 ,2 ,3 ]
Bjork-Eriksson, Thomas [4 ]
Olsson, Caroline [1 ,3 ]
机构
[1] Univ Gothenburg, Inst Clin Sci, Sahlgrenska Acad, Med Radiat Sci, Gothenburg, Sweden
[2] Sahlgrens Univ Hosp, Dept Med Phys & Biomed Engn, Gothenburg, Sweden
[3] Reg Canc Ctr West, Western Sweden Healthcare Reg, Gothenburg, Sweden
[4] Univ Gothenburg, Inst Clin Sci, Sahlgrenska Acad, Dept Oncol, Gothenburg, Sweden
基金
瑞典研究理事会;
关键词
Radiotherapy; Data extraction; Data cleaning; Automation;
D O I
10.1016/j.tipsro.2022.12.001
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Purpose: Oncological Information Systems (OIS) manage information in radiotherapy (RT) departments. Due to database structure limitations, stored information can rarely be directly used except for vendor-specific purposes. Our aim is to enable the use of such data in various external applications by creating a tool for automatic data extraction, cleaning and formatting. Methods and materials: We used OIS data from a nine-linac RT department in Sweden (70 weeks, 2015-16). Extracted data included patients' referrals and appointments with details for RT sub-tasks. The data extraction tool to prepare the data for external use was built in C# programming language. It used excel-automation queries to remove unassigned/duplicated values, substitute missing data and perform application-specific calculations. Descriptive statistics were used to verify the output with the manually prepared dataset from the corresponding time period. Results: From the initial raw data, 2030 (51 %)/907 (23 %) patients had known curative and palliative treatment intent for 84 different cancer diagnoses. After removal of incomplete entries, 373 (10 %) patients had unknown treatment intents which were substituted based on the known curative/palliative ratio. Automatically- and manuallyprepared datasets differed < 1 % for Mould, Treatment planning, Quality assurance and +/- 5 % for Fractions and Magnetic resonance imaging with overestimations in 80/140 (57 %) entries by the tool. Conclusion: We successfully implemented a software tool to prepare ready-touse OIS datasets for external applications. Our evaluations showed overall results close to the manually-prepared dataset. The time taken to prepare the dataset using our automated strategy can reduce the time for manual preparation from weeks to seconds.
引用
收藏
页数:7
相关论文
共 50 条
  • [21] An automated management tool for unstructured data
    Ceglowski, M
    Coburn, A
    Cuadrado, JL
    IEEE/WIC INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, PROCEEDINGS, 2003, : 554 - 557
  • [22] Automated Data Extraction with Multiple Ontologies
    Hong, Jer Lang
    INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2016, 9 (06): : 381 - 391
  • [23] ACME, a GIS tool for Automated Cirque Metric Extraction
    Spagnolo, Matteo
    Pellitero, Ramon
    Barr, Iestyn D.
    Ely, Jeremy C.
    Pellicer, Xavier M.
    Rea, Brice R.
    GEOMORPHOLOGY, 2017, 278 : 280 - 286
  • [24] Automated extraction of database interactions in web applications
    Ngo, Minh Ngoc
    Tan, Hee Beng Kuan
    Trinh, Doanh
    14TH IEEE INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION (ICPC 2006), PROCEEDINGS, 2006, : 117 - +
  • [26] Evaluation of an Automated Information Extraction Tool for Imaging Data Elements to Populate a Breast Cancer Screening Registry
    Lacson, Ronilda
    Harris, Kimberly
    Brawarsky, Phyllis
    Tosteson, Tor D.
    Onega, Tracy
    Tosteson, Anna N. A.
    Kaye, Abby
    Gonzalez, Irina
    Birdwell, Robyn
    Haas, Jennifer S.
    JOURNAL OF DIGITAL IMAGING, 2015, 28 (05) : 567 - 575
  • [27] Evaluation of an Automated Information Extraction Tool for Imaging Data Elements to Populate a Breast Cancer Screening Registry
    Ronilda Lacson
    Kimberly Harris
    Phyllis Brawarsky
    Tor D. Tosteson
    Tracy Onega
    Anna N. A. Tosteson
    Abby Kaye
    Irina Gonzalez
    Robyn Birdwell
    Jennifer S. Haas
    Journal of Digital Imaging, 2015, 28 : 567 - 575
  • [28] Automated Question Generation Tool for Structured Data
    Shirude, A.
    Totala, S.
    Nikhar, S.
    Attar, V.
    Ramanand, J.
    2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2015, : 1546 - 1551
  • [29] An Automated Big Data Accuracy Assessment Tool
    Mylavarapu, Goutam
    Thomas, Johnson P.
    Viswanathan, K. Ashwin
    2019 4TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (ICBDA 2019), 2019, : 193 - 197
  • [30] Automated Content Extraction from SAR Data
    Aiazzi, B.
    Baronti, S.
    Alparone, L.
    Cuozzo, G.
    D'Elia, C.
    Schirinzi, G.
    2006 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOLS 1-8, 2006, : 821 - +