Accelerated knowledge discovery from omics data by optimal experimental design

被引:0
|
作者
Xiaokang Wang
Navneet Rai
Beatriz Merchel Piovesan Pereira
Ameen Eetemadi
Ilias Tagkopoulos
机构
[1] University of California,Department of Biomedical Engineering
[2] University of California,Genome Center
[3] University of California,Department of Computer Science
[4] University of California,Microbiology Graduate Group
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
How to design experiments that accelerate knowledge discovery on complex biological landscapes remains a tantalizing question. We present an optimal experimental design method (coined OPEX) to identify informative omics experiments using machine learning models for both experimental space exploration and model training. OPEX-guided exploration of Escherichia coli’s populations exposed to biocide and antibiotic combinations lead to more accurate predictive models of gene expression with 44% less data. Analysis of the proposed experiments shows that broad exploration of the experimental space followed by fine-tuning emerges as the optimal strategy. Additionally, analysis of the experimental data reveals 29 cases of cross-stress protection and 4 cases of cross-stress vulnerability. Further validation reveals the central role of chaperones, stress response proteins and transport pumps in cross-stress exposure. This work demonstrates how active learning can be used to guide omics data collection for training predictive models, making evidence-driven decisions and accelerating knowledge discovery in life sciences.
引用
下载
收藏
相关论文
共 50 条
  • [1] Accelerated knowledge discovery from omics data by optimal experimental design
    Wang, Xiaokang
    Rai, Navneet
    Pereira, Beatriz Merchel Piovesan
    Eetemadi, Ameen
    Tagkopoulos, Ilias
    NATURE COMMUNICATIONS, 2020, 11 (01)
  • [2] Experimental Design in Clinical 'Omics Biomarker Discovery
    Forshed, Jenny
    JOURNAL OF PROTEOME RESEARCH, 2017, 16 (11) : 3954 - 3960
  • [3] A novel deep mining model for effective knowledge discovery from omics data
    Alzubaidi, Abeer
    Tepper, Jonathan
    Lotfi, Ahmad
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2020, 104
  • [4] Data Warehouse Design For Knowledge Discovery From Healthcare Data
    Ahmed, Aftab
    Zafar, Kashif
    Siddiqui, Abdul Basit
    Abdullah, Umair
    WORLD CONGRESS ON ENGINEERING - WCE 2013, VOL III, 2013, : 1589 - +
  • [5] Optimal experimental design for materials discovery
    Dehghannasiri, Roozbeh
    Xue, Dezhen
    Balachandran, Prasanna V.
    Yousefi, Mohammadmahdi R.
    Dalton, Lori A.
    Lookman, Turab
    Dougherty, Edward R.
    COMPUTATIONAL MATERIALS SCIENCE, 2017, 129 : 311 - 322
  • [6] Combinatorial materials sciences: Experimental strategies for accelerated knowledge discovery
    Rajan, Krishna
    ANNUAL REVIEW OF MATERIALS RESEARCH, 2008, 38 : 299 - 322
  • [7] Knowledge discovery from data?
    Pazzani, MJ
    IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 2000, 15 (02): : 10 - 13
  • [8] Knowledge discovery from data?
    Pazzani, Michael J.
    IEEE Intelligent Systems and Their Applications, 2000, 15 (02): : 10 - 13
  • [9] Design for Reality: Knowledge Discovery in Design and Test Data
    Abadir, Magdy
    SBCCI 2010: 23RD SYMPOSIUM ON INTEGRATED CIRCUITS AND SYSTEMS DESIGN, PROCEEDINGS, 2010, : 54 - 54
  • [10] On fusion methods for knowledge discovery from multi-omics datasets
    Baldwin, Edwin
    Han, Jiali
    Luo, Wenting
    Zhou, Jin
    An, Lingling
    Liu, Jian
    Zhang, Hao Helen
    Li, Haiquan
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2020, 18 (18): : 509 - 517