Applications of Deep-Learning in Exploiting Large-Scale and Heterogeneous Compound Data in Industrial Pharmaceutical Research

被引:32
|
作者
David, Laurianne [1 ,2 ]
Arus-Pous, Josep [1 ,3 ]
Karlsson, Johan [4 ]
Engkvist, Ola [1 ]
Bjerrum, Esben Jannik [1 ]
Kogej, Thierry [1 ]
Kriegl, Jan M. [5 ]
Beck, Bernd [5 ]
Chen, Hongming [1 ,6 ]
机构
[1] AstraZeneca, Biopharmaceut R&D, Discovery Sci, Hit Discovery, Gothenburg, Sweden
[2] Rhein Friedrich Wilhelms Univ Bonn, Dept Life Sci Informat, B IT, Bonn, Germany
[3] Univ Bern, Dept Chem & Biochem, Bern, Switzerland
[4] AstraZeneca, Biopharmaceut R&D, Discovery Sci, Quantitat Biol, Gothenburg, Sweden
[5] Boehringer Ingelheim Pharma GmbH & Co KG, Dept Med Chem, Biberach, Germany
[6] Chem & Chem Biol Ctr, Guangzhou Regenerat Med & Hlth Guangdong Lab, Guangzhou, Guangdong, Peoples R China
基金
欧盟地平线“2020”;
关键词
Artificial intelligence; deep learning; Chemogenomics; Large-scale data; pharmaceutical industry; INTERFERENCE COMPOUNDS PAINS; HUMAN-GENOME-PROJECT; DRUG DISCOVERY; ASSAY INTERFERENCE; SCREENING LIBRARIES; TARGET PREDICTION; MICROSCOPY IMAGES; CONNECTIVITY MAP; SMALL MOLECULES; DESIGN;
D O I
10.3389/fphar.2019.01303
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
In recent years, the development of high-throughput screening (HTS) technologies and their establishment in an industrialized environment have given scientists the possibility to test millions of molecules and profile them against a multitude of biological targets in a short period of time, generating data in a much faster pace and with a higher quality than before. Besides the structure activity data from traditional bioassays, more complex assays such as transcriptomics profiling or imaging have also been established as routine profiling experiments thanks to the advancement of Next Generation Sequencing or automated microscopy technologies. In industrial pharmaceutical research, these technologies are typically established in conjunction with automated platforms in order to enable efficient handling of screening collections of thousands to millions of compounds. To exploit the ever-growing amount of data that are generated by these approaches, computational techniques are constantly evolving. In this regard, artificial intelligence technologies such as deep learning and machine learning methods play a key role in cheminformatics and bio-image analytics fields to address activity prediction, scaffold hopping, de novo molecule design, reaction/retrosynthesis predictions, or high content screening analysis. Herein we summarize the current state of analyzing large-scale compound data in industrial pharmaceutical research and describe the impact it has had on the drug discovery process over the last two decades, with a specific focus on deep-learning technologies.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Large-Scale Embedding Learning in Heterogeneous Event Data
    Gui, Huan
    Liu, Jialu
    Tao, Fangbo
    Jiang, Meng
    Norick, Brandon
    Han, Jiawei
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 907 - 912
  • [2] A Data-Centric Approach for Analyzing Large-Scale Deep Learning Applications
    Vineet, S. Sai
    Joseph, Natasha Meena
    Korgaonkar, Kunal
    Paul, Arnab K.
    PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING AND NETWORKING, ICDCN 2023, 2023, : 282 - 283
  • [3] Deep learning for the large-scale cancer data analysis
    Tsuji, Shingo
    Aburatani, Hiroyuki
    CANCER RESEARCH, 2015, 75 (22)
  • [4] Deep-learning methods for unveiling large-scale single-cell transcriptomes
    Xilin Shen
    Xiangchun Li
    Cancer Biology & Medicine, 2023, 20 (12) : 972 - 980
  • [5] Deep-learning methods for unveiling large-scale single-cell transcriptomes
    Shen, Xilin
    Li, Xiangchun
    CANCER BIOLOGY & MEDICINE, 2023, 20 (12) : 972 - 980
  • [6] Large-Scale JPEG Image Steganalysis Using Hybrid Deep-Learning Framework
    Zeng, Jishen
    Tan, Shunquan
    Li, Bin
    Huang, Jiwu
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2018, 13 (05) : 1200 - 1214
  • [7] Large-Scale Uncertainty Management Systems: Learning and Exploiting Your Data
    Babu, Shivnath
    Guha, Sudipto
    Munagala, Kamesh
    ACM SIGMOD/PODS 2009 CONFERENCE, 2009, : 995 - 998
  • [8] DDHH: A Decentralized Deep Learning Framework for Large-scale Heterogeneous Networks
    Imran, Mubashir
    Yin, Hongzhi
    Chen, Tong
    Huang, Zi
    Zhang, Xiangliang
    Zheng, Kai
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 2033 - 2038
  • [9] Special issue on towards advancements in machine learning for exploiting large-scale and heterogeneous repositories
    Anwar, Sajid
    Rocha, Alvaro
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (11): : 7909 - 7911
  • [10] Special issue on towards advancements in machine learning for exploiting large-scale and heterogeneous repositories
    Sajid Anwar
    Álvaro Rocha
    Neural Computing and Applications, 2023, 35 : 7909 - 7911