Perspectives on making big data analytics work for oncology

被引:27
|
作者
El Naqa, Issam [1 ]
机构
[1] Univ Michigan, Dept Radiat Oncol, Ann Arbor, MI 48109 USA
关键词
Big data; Oncology; Machine learning; Clinical decision support; PREDICT RADIATION PNEUMONITIS; DOSE-VOLUME; BAYESIAN NETWORK; NEURAL-NETWORK; RADIOTHERAPY OUTCOMES; TEXTURAL FEATURES; PROSTATE-CANCER; TUMOR RESPONSE; NECK-CANCER; FDG-PET;
D O I
10.1016/j.ymeth.2016.08.010
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Oncology, with its unique combination of clinical, physical, technological, and biological data provides an ideal case study for applying big data analytics to improve cancer treatment safety and outcomes. An oncology treatment course such as chemoradiotherapy can generate a large pool of information carrying the 5 Vs hallmarks of big data. This data is comprised of a heterogeneous mixture of patient demographics, radiationichemo dosimetry, multimodality imaging features, and biological markers generated over a treatment period that can span few days to several weeks. Efforts using commercial and in-house tools are underway to facilitate data aggregation, ontology creation, sharing, visualization and varying analytics in a secure environment. However, open questions related to proper data structure representation and effective analytics tools to support oncology decision-making need to be addressed. It is recognized that oncology data constitutes a mix of structured (tabulated) and unstructured (electronic documents) that need to be processed to facilitate searching and subsequent knowledge discovery from relational or NoSQL databases. In this context, methods based on advanced analytics and image feature extraction for oncology applications will be discussed. On the other hand, the classical p (variables) >> n (samples) inference problem of statistical learning is challenged in the Big data realm and this is particularly true for oncology applications where p-omics is witnessing exponential growth while the number of cancer incidences has generally plateaued over the past 5-years leading to a quasi-linear growth in samples per patient. Within the Big data paradigm, this kind of phenomenon may yield undesirable effects such as echo chamber anomalies, Yule-Simpson reversal paradox, or misleading ghost analytics. In this work, we will present these effects as they pertain to oncology and engage small thinking methodologies to counter these effects ranging from incorporating prior knowledge, using information-theoretic techniques to modern ensemble machine learning approaches or combination of these. We will particularly discuss the pros and cons of different approaches to improve mining of big data in oncology. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:32 / 44
页数:13
相关论文
共 50 条
  • [21] Big data and analytics
    Misovic, Andrej
    Duzik, Ondrej
    Pleva, Michal
    ERA OF SCIENCE DIPLOMACY: IMPLICATIONS FOR ECONOMICS, BUSINESS, MANAGEMENT AND RELATED DISCIPLINES (EDAMBA 2015), 2015, : 639 - 644
  • [22] Big Data Analytics
    Andreas Meier
    HMD Praxis der Wirtschaftsinformatik, 2019, 56 (5) : 879 - 880
  • [23] Big Data Analytics
    Rajaraman, V.
    RESONANCE-JOURNAL OF SCIENCE EDUCATION, 2016, 21 (08): : 695 - 716
  • [24] Challenges and Perspectives in Big Eye-Movement Data Visual Analytics
    Blascheck, Tanja
    Burch, Michael
    Raschke, Michael
    Weiskopf, Daniel
    2015 BIG DATA VISUAL ANALYTICS (BDVA), 2015,
  • [25] Decision making performance of business analytics capabilities: the role of big data literacy and analytics competency
    Fattah, Ikhsan A.
    BUSINESS PROCESS MANAGEMENT JOURNAL, 2024,
  • [26] An Empirical Study of the Role of Big Data Analytics in Corporate Decision Making
    Shao, Xu
    JOURNAL OF GLOBAL INFORMATION MANAGEMENT, 2023, 31 (06)
  • [27] Towards Developing Big Data Analytics for Machining Decision-Making
    Ghosh, Angkush Kumar
    Fattahi, Saman
    Ura, Sharifu
    JOURNAL OF MANUFACTURING AND MATERIALS PROCESSING, 2023, 7 (05):
  • [28] Organizational business intelligence and decision making using big data analytics
    Niu, Yanfang
    Ying, Limeng
    Yang, Jie
    Bao, Mengqi
    Sivaparthipan, C. B.
    INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (06)
  • [30] Autonomic deployment decision making for big data analytics applications in the cloud
    Lu, Qinghua
    Li, Zheng
    Zhang, Weishan
    Yang, Laurence T.
    SOFT COMPUTING, 2017, 21 (16) : 4501 - 4512