Discovering Outliers of Potential Drug Toxicities Using a Large-scale Data-driven Approach

被引:3
|
作者
Luo, Jake [1 ,2 ,3 ]
Cisler, Ron A. [1 ,2 ,3 ,4 ,5 ]
机构
[1] Univ Wisconsin Milwaukee, Ctr Biomed Data & Language Proc, Dept Hlth Informat & Adm, Milwaukee, WI 53211 USA
[2] Univ Wisconsin Milwaukee, Dept Hlth Informat & Adm, Coll Hlth Sci, Milwaukee, WI 53211 USA
[3] Univ Wisconsin Milwaukee, Coll Hlth Sci, Milwaukee, WI 53211 USA
[4] Ctr Urban Populat Hlth, Milwaukee, WI USA
[5] Univ Wisconsin Milwaukee, Joseph J Zilber Sch Publ Hlth, Milwaukee, WI USA
关键词
drug toxicity; cancer informatics; clinical trial; adverse events; outlier discovery; big clinical data;
D O I
10.4137/CIN.S39549
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
We systematically compared the adverse effects of cancer drugs to detect event outliers across different clinical trials using a data-driven approach. Because many cancer drugs are toxic to patients, better understanding of adverse events of cancer drugs is critical for developing therapies that could minimize the toxic effects. However, due to the large variabilities of adverse events across different cancer drugs, methods to efficiently compare adverse effects across different cancer drugs are lacking. To address this challenge, we present an exploration study that integrates multiple adverse event reports from clinical trials in order to systematically compare adverse events across different cancer drugs. To demonstrate our methods, we first collected data on 186,339 clinical trials from ClinicalTrials.gov and selected 30 common cancer drugs. We identified 1602 cancer trials that studied the selected cancer drugs. Our methods effectively extracted 12,922 distinct adverse events from the clinical trial reports. Using the extracted data, we ranked all 12,922 adverse events based on their prevalence in the clinical trials, such as nausea 82%, fatigue 77%, and vomiting 75.97%. To detect the significant drug outliers that could have a statistically high possibility of causing an event, we used the boxplot method to visualize adverse event outliers across different drugs and applied Grubbs' test to evaluate the significance. Analyses showed that by systematically integrating cross-trial data from multiple clinical trial reports, adverse event outliers associated with cancer drugs can be detected. The method was demonstrated by detecting the following four statistically significant adverse event cases: the association of the drug axitinib with hypertension (Grubbs' test, P<0.001), the association of the drug imatinib with muscle spasm (P<0.001), the association of the drug vorinostat with deep vein thrombosis (P<0.001), and the association of the drug afatinib with paronychia (P<0.01).
引用
收藏
页码:211 / 217
页数:7
相关论文
共 50 条
  • [1] A Data-driven Mechanism for Large-scale Data Distribution
    Shi Peichang
    Li Yiying
    Ding Bo
    Jiang Longquan
    Liu Hui
    Zhang Jie
    [J]. 2016 WORLD AUTOMATION CONGRESS (WAC), 2016,
  • [2] Data-driven Authoring of Large-scale Ecosystems
    Kapp, Konrad
    Gain, James
    Guerin, Eric
    Galin, Eric
    Peytavie, Adrien
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2020, 39 (06):
  • [3] Large-scale Data-driven Segmentation of Banking Customers
    Hossain, Md Monir
    Sebestyen, Mark
    Mayank, Dhruv
    Ardakanian, Omid
    Khazaei, Hamzeh
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 4392 - 4401
  • [4] Evaluation of large-scale cycling environment by using the trajectory data of dockless shared bicycles: A data-driven approach
    Ni, Ying
    Wang, Shihan
    Chen, Jiaqi
    Feng, Bufan
    Yu, Rongjie
    Cai, Yilin
    [J]. IET INTELLIGENT TRANSPORT SYSTEMS, 2024,
  • [5] Data-driven realistic animation of large-scale forest
    School of Computer Science, Wuhan University, Wuhan 430079, China
    不详
    不详
    [J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2008, 20 (08): : 1015 - 1022
  • [6] Large-scale mode identification and data-driven sciences
    Mukhopadhyay, Subhadeep
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2017, 11 (01): : 215 - 240
  • [7] Large-Scale Data-Driven Financial Risk Modeling using Big Data Technology
    Stockinger, Kurt
    Heitz, Jonas
    Bundi, Nils
    Breymann, Wolfgang
    [J]. 2018 IEEE/ACM 5TH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING APPLICATIONS AND TECHNOLOGIES (BDCAT), 2018, : 206 - 207
  • [8] Improving large-scale hierarchical classification by rewiring: a data-driven filter based approach
    Azad Naik
    Huzefa Rangwala
    [J]. Journal of Intelligent Information Systems, 2019, 52 : 141 - 164
  • [9] Data-driven fault detection for large-scale network systems: A mixed optimization approach
    Ma, Zhen-Lei
    Li, Xiao-Jian
    [J]. APPLIED MATHEMATICS AND COMPUTATION, 2022, 426
  • [10] Data-Driven Approach to Assess Street Safety: Large-Scale Analysis of the Microscopic Design
    Wu, Wanshu
    Guo, Jinhan
    Ma, Ziying
    Zhao, Kai
    [J]. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2022, 11 (11)