Perspectives on making big data analytics work for oncology

被引:27
|
作者
El Naqa, Issam [1 ]
机构
[1] Univ Michigan, Dept Radiat Oncol, Ann Arbor, MI 48109 USA
关键词
Big data; Oncology; Machine learning; Clinical decision support; PREDICT RADIATION PNEUMONITIS; DOSE-VOLUME; BAYESIAN NETWORK; NEURAL-NETWORK; RADIOTHERAPY OUTCOMES; TEXTURAL FEATURES; PROSTATE-CANCER; TUMOR RESPONSE; NECK-CANCER; FDG-PET;
D O I
10.1016/j.ymeth.2016.08.010
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Oncology, with its unique combination of clinical, physical, technological, and biological data provides an ideal case study for applying big data analytics to improve cancer treatment safety and outcomes. An oncology treatment course such as chemoradiotherapy can generate a large pool of information carrying the 5 Vs hallmarks of big data. This data is comprised of a heterogeneous mixture of patient demographics, radiationichemo dosimetry, multimodality imaging features, and biological markers generated over a treatment period that can span few days to several weeks. Efforts using commercial and in-house tools are underway to facilitate data aggregation, ontology creation, sharing, visualization and varying analytics in a secure environment. However, open questions related to proper data structure representation and effective analytics tools to support oncology decision-making need to be addressed. It is recognized that oncology data constitutes a mix of structured (tabulated) and unstructured (electronic documents) that need to be processed to facilitate searching and subsequent knowledge discovery from relational or NoSQL databases. In this context, methods based on advanced analytics and image feature extraction for oncology applications will be discussed. On the other hand, the classical p (variables) >> n (samples) inference problem of statistical learning is challenged in the Big data realm and this is particularly true for oncology applications where p-omics is witnessing exponential growth while the number of cancer incidences has generally plateaued over the past 5-years leading to a quasi-linear growth in samples per patient. Within the Big data paradigm, this kind of phenomenon may yield undesirable effects such as echo chamber anomalies, Yule-Simpson reversal paradox, or misleading ghost analytics. In this work, we will present these effects as they pertain to oncology and engage small thinking methodologies to counter these effects ranging from incorporating prior knowledge, using information-theoretic techniques to modern ensemble machine learning approaches or combination of these. We will particularly discuss the pros and cons of different approaches to improve mining of big data in oncology. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:32 / 44
页数:13
相关论文
共 50 条
  • [1] Making the Most of Big Data and Data Analytics
    Turner, Shawn M.
    ITE JOURNAL-INSTITUTE OF TRANSPORTATION ENGINEERS, 2021, 91 (02): : 24 - 26
  • [2] Perspectives, Motivations and Implications Of Big Data Analytics
    Amudhavel, J.
    Padmapriya, V
    Gowri, V
    Lakshmipriya, K.
    Kumar, K. Prem
    Thiyagarajan, B.
    ICARCSET'15: PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON ADVANCED RESEARCH IN COMPUTER SCIENCE ENGINEERING & TECHNOLOGY (ICARCSET - 2015), 2015,
  • [3] Does big data mean big knowledge? KM perspectives on big data and analytics
    Pauleen, David J.
    Wang, William Y. C.
    JOURNAL OF KNOWLEDGE MANAGEMENT, 2017, 21 (01) : 1 - 6
  • [4] Making Sense of Big Data with the Berkeley Data Analytics Stack
    Franklin, Michael
    WSDM'15: PROCEEDINGS OF THE EIGHTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2015, : 1 - 1
  • [5] Data Value, Big Data Analytics, and Decision-Making
    Monino, Jean-Louis
    JOURNAL OF THE KNOWLEDGE ECONOMY, 2021, 12 (01) : 256 - 267
  • [6] Data Value, Big Data Analytics, and Decision-Making
    Jean-Louis Monino
    Journal of the Knowledge Economy, 2021, 12 : 256 - 267
  • [7] big data analytics making the smart grid smarter
    Hong, Tao
    IEEE POWER & ENERGY MAGAZINE, 2018, 16 (03): : 12 - 16
  • [8] Big Data Analytics in Support of the Decision Making Process
    Elgendy, Nada
    Elragal, Ahmed
    INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS/INTERNATIONAL CONFERENCE ON PROJECT MANAGEMENT/INTERNATIONAL CONFERENCE ON HEALTH AND SOCIAL CARE INFORMATION SYSTEMS AND TECHNOLOGIES, CENTERIS/PROJMAN / HCIST 2016, 2016, 100 : 1071 - 1084
  • [9] Using Big Data Analytics to Advance Precision Radiation Oncology
    McNutt, Todd R.
    Benedict, Stanley H.
    Low, Daniel A.
    Moore, Kevin
    Shpitser, Ilya
    Jiang, Wei
    Lakshminarayanan, Pranav
    Cheng, Zhi
    Han, Peijin
    Hui, Xuan
    Nakatsugawa, Minoru
    Lee, Junghoon
    Moore, Joseph A.
    Robertson, Scott P.
    Shah, Veeraj
    Taylor, Russ
    Quon, Harry
    Wong, John
    DeWeese, Theodore
    INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2018, 101 (02): : 285 - 291
  • [10] Big Data Opportunities and Challenges: Discussions from Data Analytics Perspectives
    Zhou, Zhi-Hua
    Chawla, Nitesh V.
    Jin, Yaochu
    Williams, Graham J.
    IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2014, 9 (04) : 62 - 74