Applications of Deep-Learning in Exploiting Large-Scale and Heterogeneous Compound Data in Industrial Pharmaceutical Research

被引:32
|
作者
David, Laurianne [1 ,2 ]
Arus-Pous, Josep [1 ,3 ]
Karlsson, Johan [4 ]
Engkvist, Ola [1 ]
Bjerrum, Esben Jannik [1 ]
Kogej, Thierry [1 ]
Kriegl, Jan M. [5 ]
Beck, Bernd [5 ]
Chen, Hongming [1 ,6 ]
机构
[1] AstraZeneca, Biopharmaceut R&D, Discovery Sci, Hit Discovery, Gothenburg, Sweden
[2] Rhein Friedrich Wilhelms Univ Bonn, Dept Life Sci Informat, B IT, Bonn, Germany
[3] Univ Bern, Dept Chem & Biochem, Bern, Switzerland
[4] AstraZeneca, Biopharmaceut R&D, Discovery Sci, Quantitat Biol, Gothenburg, Sweden
[5] Boehringer Ingelheim Pharma GmbH & Co KG, Dept Med Chem, Biberach, Germany
[6] Chem & Chem Biol Ctr, Guangzhou Regenerat Med & Hlth Guangdong Lab, Guangzhou, Guangdong, Peoples R China
基金
欧盟地平线“2020”;
关键词
Artificial intelligence; deep learning; Chemogenomics; Large-scale data; pharmaceutical industry; INTERFERENCE COMPOUNDS PAINS; HUMAN-GENOME-PROJECT; DRUG DISCOVERY; ASSAY INTERFERENCE; SCREENING LIBRARIES; TARGET PREDICTION; MICROSCOPY IMAGES; CONNECTIVITY MAP; SMALL MOLECULES; DESIGN;
D O I
10.3389/fphar.2019.01303
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
In recent years, the development of high-throughput screening (HTS) technologies and their establishment in an industrialized environment have given scientists the possibility to test millions of molecules and profile them against a multitude of biological targets in a short period of time, generating data in a much faster pace and with a higher quality than before. Besides the structure activity data from traditional bioassays, more complex assays such as transcriptomics profiling or imaging have also been established as routine profiling experiments thanks to the advancement of Next Generation Sequencing or automated microscopy technologies. In industrial pharmaceutical research, these technologies are typically established in conjunction with automated platforms in order to enable efficient handling of screening collections of thousands to millions of compounds. To exploit the ever-growing amount of data that are generated by these approaches, computational techniques are constantly evolving. In this regard, artificial intelligence technologies such as deep learning and machine learning methods play a key role in cheminformatics and bio-image analytics fields to address activity prediction, scaffold hopping, de novo molecule design, reaction/retrosynthesis predictions, or high content screening analysis. Herein we summarize the current state of analyzing large-scale compound data in industrial pharmaceutical research and describe the impact it has had on the drug discovery process over the last two decades, with a specific focus on deep-learning technologies.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Exploiting Data Sparsity for Large-Scale Matrix Computations
    Akbudak, Kadir
    Ltaief, Hatem
    Mikhalev, Aleksandr
    Charara, Ali
    Esposito, Aniello
    Keyes, David
    EURO-PAR 2018: PARALLEL PROCESSING, 2018, 11014 : 721 - 734
  • [22] Large-scale Face Clustering Method Research Based on Deep Learning
    Wen, Zixin
    2021 3RD INTERNATIONAL CONFERENCE ON MACHINE LEARNING, BIG DATA AND BUSINESS INTELLIGENCE (MLBDBI 2021), 2021, : 731 - 734
  • [23] Alleviating Load Imbalance in Data Processing for Large-Scale Deep Learning
    Pumma, Sarunya
    Buono, Daniele
    Checconi, Fabio
    Que, Xinyu
    Feng, Wu-chun
    2020 20TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2020), 2020, : 262 - 271
  • [24] Efficient Large-scale Deep Learning Framework for Heterogeneous Multi-GPU Cluster
    Kim, Youngrang
    Choi, Hyeonseong
    Lee, Jaehwan
    Kim, Jik-Soo
    Jei, Hyunseung
    Roh, Hongchan
    2019 IEEE 4TH INTERNATIONAL WORKSHOPS ON FOUNDATIONS AND APPLICATIONS OF SELF* SYSTEMS (FAS*W 2019), 2019, : 176 - 181
  • [25] Efficient Learning of Fuzzy Logic Systems for Large-Scale Data Using Deep Learning
    Koklu, Ata
    Guven, Yusuf
    Kumbasar, Tufan
    INTELLIGENT AND FUZZY SYSTEMS, INFUS 2024 CONFERENCE, VOL 1, 2024, 1088 : 406 - 413
  • [26] Machine Learning and Deep Learning frameworks and libraries for large-scale data mining: a survey
    Giang Nguyen
    Stefan Dlugolinsky
    Martin Bobák
    Viet Tran
    Álvaro López García
    Ignacio Heredia
    Peter Malík
    Ladislav Hluchý
    Artificial Intelligence Review, 2019, 52 : 77 - 124
  • [27] Machine Learning and Deep Learning frameworks and libraries for large-scale data mining: a survey
    Nguyen, Giang
    Dlugolinsky, Stefan
    Bobak, Martin
    Viet Tran
    Lopez Garcia, Alvaro
    Heredia, Ignacio
    Malik, Peter
    Hluchy, Ladislav
    ARTIFICIAL INTELLIGENCE REVIEW, 2019, 52 (01) : 77 - 124
  • [28] HVAC: Removing I/O Bottleneck for Large-Scale Deep Learning Applications
    Khan, Awais
    Paul, Arnab K.
    Zimmer, Christopher
    Oral, Sarp
    Dash, Sajal
    Atchley, Scott
    Wang, Feiyi
    2022 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER 2022), 2022, : 324 - 335
  • [29] Computational drug repurposing by exploiting large-scale gene expression data: Strategy, methods and applications
    He, Hao
    Duo, Hongrui
    Zhang, Xiaoxi
    Zhou, Xinyi
    Zeng, Yujie
    Li, Yinghong
    Li, Bo
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 155
  • [30] A Large-Scale Deep-Learning Approach for Multi-Temporal Aqua and Salt-Culture Mapping
    Diniz, Cesar
    Cortinhas, Luiz
    Pinheiro, Maria Luize
    Sadeck, Luis
    Fernandes Filho, Alexandre
    Baumann, Luis R. F.
    Adami, Marcos
    Souza-Filho, Pedro Walfir M.
    REMOTE SENSING, 2021, 13 (08)