Similarity maps - a visualization strategy for molecular fingerprints and machine-learning methods

被引:114
|
作者
Riniker, Sereina [1 ]
Landrum, Gregory A. [1 ]
机构
[1] Novartis Inst BioMed Res, Basel, Switzerland
来源
关键词
Visualization; Machine-learning; Similarity; Fingerprints; DOPAMINE D3 RECEPTOR; LIGANDS; DESIGN; MODELS;
D O I
10.1186/1758-2946-5-43
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Fingerprint similarity is a common method for comparing chemical structures. Similarity is an appealing approach because, with many fingerprint types, it provides intuitive results: a chemist looking at two molecules can understand why they have been determined to be similar. This transparency is partially lost with the fuzzier similarity methods that are often used for scaffold hopping and tends to vanish completely when molecular fingerprints are used as inputs to machine-learning (ML) models. Here we present similarity maps, a straightforward and general strategy to visualize the atomic contributions to the similarity between two molecules or the predicted probability of a ML model. We show the application of similarity maps to a set of dopamine D3 receptor ligands using atom-pair and circular fingerprints as well as two popular ML methods: random forests and naive Bayes. An open-source implementation of the method is provided.
引用
下载
收藏
页数:7
相关论文
共 50 条
  • [31] Machine-learning accelerated geometry optimization in molecular simulation
    Yang, Yilin
    Jimenez-Negron, Omar A.
    Kitchin, John R.
    JOURNAL OF CHEMICAL PHYSICS, 2021, 154 (23):
  • [32] Molecular similarity for machine learning in drug development
    M Rupp
    E Proschak
    G Schneider
    Chemistry Central Journal, 2 (Suppl 1)
  • [33] The influence of hashed fingerprints density on the machine learning methods performance
    Sabina Smusz
    Rafał Kurczab
    Andrzej J Bojarski
    Journal of Cheminformatics, 5 (Suppl 1)
  • [34] Combining machine-learning and molecular-modeling methods for drug-target affinity predictions
    Perez-Lopez, Carles
    Molina, Alexis
    Lozoya, Estrella
    Segarra, Victor
    Municoy, Marti
    Guallar, Victor
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE, 2023, 13 (04)
  • [35] Molecular Diagnosis and Survival Predicition of Glioma Patients by Using Machine-Learning based Radiomics Methods
    Shi, Zhifeng
    Yu, Jinhua
    Qi, Zengxin
    Yang, Bojie
    Chen, Liang
    Mao, Ying
    Zhou, Liangfu
    CANCER SCIENCE, 2018, 109 : 289 - 289
  • [36] Exploring machine learning for untargeted metabolomics using molecular fingerprints
    Sirocchi, Christel
    Biancucci, Federica
    Donati, Matteo
    Bogliolo, Alessandro
    Magnani, Mauro
    Menotta, Michele
    Montagna, Sara
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2024, 250
  • [37] Influence of Data Similarity on the Scoring Power of Machine-learning Scoring Functions for Docking
    Sze, Kam-Heung
    Xiong, Zhiqiang
    Ma, Jinlong
    Lu, Gang
    Chan, Wai-Yee
    Li, Hongjian
    PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, VOL 3: BIOINFORMATICS, 2020, : 85 - 92
  • [38] Keratoconus Diagnostic and Treatment Algorithms Based on Machine-Learning Methods
    Malyugin, Boris
    Sakhnov, Sergej
    Izmailova, Svetlana
    Boiko, Ernest
    Pozdeyeva, Nadezhda
    Axenova, Lyubov
    Axenov, Kirill
    Titov, Aleksej
    Terentyeva, Anna
    Zakaraiia, Tamriko
    Myasnikova, Viktoriya
    DIAGNOSTICS, 2021, 11 (10)
  • [39] Enhancing Machine-Learning Methods for Sentiment Classification of Web Data
    Wang, Zhaoxia
    Tong, Victor Joo Chuan
    Chin, Hoong Chor
    INFORMATION RETRIEVAL TECHNOLOGY, AIRS 2014, 2014, 8870 : 394 - 405
  • [40] Advanced Machine-Learning Methods for Brain-Computer Interfacing
    Lv, Zhihan
    Qiao, Liang
    Wang, Qingjun
    Piccialli, Francesco
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (05) : 1688 - 1698