Molecular Similarity Perception Based on Machine-Learning Models

被引:2
|
作者
Gandini, Enrico [1 ]
Marcou, Gilles [2 ]
Bonachera, Fanny [2 ]
Varnek, Alexandre [2 ]
Pieraccini, Stefano [1 ]
Sironi, Maurizio [1 ]
机构
[1] Univ Milan, Dipartimento Chim, Via Golgi 19, I-20133 Milan, Italy
[2] Univ Strasbourg, Lab Chemoinformat, CNRS, UMR 7140, 4 Rue Blaise Pascal, F-67000 Strasbourg, France
关键词
molecular similarity; similarity perception; machine learning; chemical data set; CONFORMER GENERATION; CHEMISTRY; DATABASE; DRUG;
D O I
10.3390/ijms23116114
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Molecular similarity is an impressively broad topic with many implications in several areas of chemistry. Its roots lie in the paradigm that 'similar molecules have similar properties'. For this reason, methods for determining molecular similarity find wide application in pharmaceutical companies, e.g., in the context of structure-activity relationships. The similarity evaluation is also used in the field of chemical legislation, specifically in the procedure to judge if a new molecule can obtain the status of orphan drug with the consequent financial benefits. For this procedure, the European Medicines Agency uses experts' judgments. It is clear that the perception of the similarity depends on the observer, so the development of models to reproduce the human perception is useful. In this paper, we built models using both 2D fingerprints and 3D descriptors, i.e., molecular shape and pharmacophore descriptors. The proposed models were also evaluated by constructing a dataset of pairs of molecules which was submitted to a group of experts for the similarity judgment. The proposed machine-learning models can be useful to reduce or assist human efforts in future evaluations. For this reason, the new molecules dataset and an online tool for molecular similarity estimation have been made freely available.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Quantum Chemical Roots of Machine-Learning Molecular Similarity Descriptors
    Gugler, Stefan
    Reiher, Markus
    [J]. JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2022, : 6670 - 6689
  • [2] Similarity maps - a visualization strategy for molecular fingerprints and machine-learning methods
    Sereina Riniker
    Gregory A Landrum
    [J]. Journal of Cheminformatics, 5
  • [3] Similarity maps - a visualization strategy for molecular fingerprints and machine-learning methods
    Riniker, Sereina
    Landrum, Gregory A.
    [J]. JOURNAL OF CHEMINFORMATICS, 2013, 5
  • [4] Certified Machine-Learning Models
    Damiani, Ernesto
    Ardagna, Claudio A.
    [J]. SOFSEM 2020: THEORY AND PRACTICE OF COMPUTER SCIENCE, 2020, 12011 : 3 - 15
  • [5] Machine-Learning Studies on Spin Models
    Shiina, Kenta
    Mori, Hiroyuki
    Okabe, Yutaka
    Lee, Hwee Kuan
    [J]. SCIENTIFIC REPORTS, 2020, 10 (01)
  • [6] Machine-Learning Studies on Spin Models
    Kenta Shiina
    Hiroyuki Mori
    Yutaka Okabe
    Hwee Kuan Lee
    [J]. Scientific Reports, 10
  • [7] Improving measurements of similarity judgments with machine-learning algorithms
    Jeffrey R. Stevens
    Alexis Polzkill Saltzman
    Tanner Rasmussen
    Leen-Kiat Soh
    [J]. Journal of Computational Social Science, 2021, 4 : 613 - 629
  • [8] Improving measurements of similarity judgments with machine-learning algorithms
    Stevens, Jeffrey R.
    Saltzman, Alexis Polzkill
    Rasmussen, Tanner
    Soh, Leen-Kiat
    [J]. JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE, 2021, 4 (02): : 613 - 629
  • [9] Machine-learning frameworks in molecular simulation
    Kitchin, John
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 255
  • [10] Advancing interpretability of machine-learning prediction models
    Trenary, Laurie
    DelSole, Timothy
    [J]. ENVIRONMENTAL DATA SCIENCE, 2022, 1