Collation and data-mining of literature bioactivity data for drug discovery

被引:26
|
作者
Bellis, Louisa J. [1 ]
Akhtar, Ruth [1 ]
Al-Lazikani, Bissan [2 ]
Atkinson, Francis [1 ]
Bento, A. Patricia [1 ]
Chambers, Jon [1 ]
Davies, Mark [1 ]
Gaulton, Anna [1 ]
Hersey, Anne [1 ]
Ikeda, Kazuyoshi [1 ]
Krueger, Felix A. [1 ]
Light, Yvonne [1 ]
McGlinchey, Shaun [1 ]
Santos, Rita [1 ]
Stauch, Benjamin [1 ]
Overington, John P. [1 ]
机构
[1] EMBL EBI, Hinxton CB10 1SD, Cambs, England
[2] Inst Canc Res, Sutton SM2 5NG, Surrey, England
基金
英国惠康基金;
关键词
bioactivity data; ChEMBL; data-mining; drug discovery; structure-activity relationship (SAR); CRYSTAL-STRUCTURE; LIGAND EFFICIENCY; RECEPTOR; EXPRESSION; RHODOPSIN; CLONING;
D O I
10.1042/BST0391365
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The challenge of translating the huge amount of genomic and biochemical data into new drugs is a costly and challenging task. Historically, there has been comparatively little focus on linking the biochemical and chemical worlds. To address this need, we have developed ChEMBL, an online resource of small-molecule SAR (structure-activity relationship) data, which can be used to support chemical biology, lead discovery and target selection in drug discovery. The database contains the abstracted structures, properties and biological activities for over 700 000 distinct compounds and in excess of more than 3 million bioactivity records abstracted from over 40 000 publications. Additional public domain resources can be readily integrated into the same data model (e.g. PubChem BioAssay data). The compounds in ChEMBL are largely extracted from the primary medicinal chemistry literature, and are therefore usually 'drug-like' or 'lead-like' small molecules with full experimental context. The data cover a significant fraction of the discovery of modern drugs, and are useful in a wide range of drug design and discovery tasks. In addition to the compound data, ChEMBL also contains information for over 8000 protein, cell line and whole-organism 'targets', with over 4000 of those being proteins linked to their underlying genes. The database is searchable both chemically, using an interactive compound sketch tool, protein sequences, family hierarchies, SMILES strings, compound research codes and key words, and biologically, using a variety of gene identifiers, protein sequence similarity and protein families. The information retrieved can then be readily filtered and downloaded into various formats. ChEMBL can be accessed online at https://www.ebi.ac.uk/chembldb.
引用
收藏
页码:1365 / 1370
页数:6
相关论文
共 50 条
  • [1] Novel Data-Mining Methodologies for Adverse Drug Event Discovery and Analysis
    Harpaz, R.
    DuMouchel, W.
    Shah, N. H.
    Madigan, D.
    Ryan, P.
    Friedman, C.
    [J]. CLINICAL PHARMACOLOGY & THERAPEUTICS, 2012, 91 (06) : 1010 - 1021
  • [2] Application of Data-Mining and Knowledge Discovery in Automotive Data Engineering
    Keller, Joerg
    Bauer, Valerij
    Kwedlo, Wojciech
    [J]. LECTURE NOTES IN COMPUTER SCIENCE <D>, 2000, 1910 : 464 - 469
  • [3] Data-mining discovery of pattern and process in ecological systems
    Hochachka, Wesley M.
    Caruana, Rich
    Fink, Danniel
    Munson, Art
    Riedewald, Mirek
    Sorokina, Darla
    Kelling, Steve
    [J]. JOURNAL OF WILDLIFE MANAGEMENT, 2007, 71 (07): : 2427 - 2437
  • [4] In silico veritas - Data-mining and automated discovery: the truth is in there
    Allen, JF
    [J]. EMBO REPORTS, 2001, 2 (07) : 542 - 544
  • [5] Data-Mining for Sulfur and Fluorine: An Evaluation of Pharmaceuticals To Reveal Opportunities for Drug Design and Discovery
    Ilardi, Elizabeth A.
    Vitaku, Edon
    Njardarson, Jon T.
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 2014, 57 (07) : 2832 - 2842
  • [6] Reconstruction and data-mining of assay cascades from literature bioacitivity data
    Overington, John P.
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2013, 246
  • [7] Clinical Data-Mining
    Guzzetta, Charles
    [J]. JOURNAL OF TEACHING IN SOCIAL WORK, 2010, 30 (03) : 353 - 355
  • [8] Clinical Data-Mining
    Joelson, Richard B.
    [J]. SOCIAL WORK IN MENTAL HEALTH, 2010, 9 (01) : 71 - 72
  • [9] DATA-MINING DYNAMITE
    KRIVDA, CD
    [J]. BYTE, 1995, 20 (10): : 97 - &
  • [10] Study on Data Mining with Drug Discovery
    Diwan, Bahul
    Bhardwaj, Shweta
    [J]. EMERGING TRENDS IN EXPERT APPLICATIONS AND SECURITY, 2019, 841 : 513 - 520