Chemoinformatics-based classification of prohibited substances employed for doping in sport

被引:29
|
作者
Cannon, Edward O. [1 ]
Bender, Andreas [1 ]
Palmer, David S. [1 ]
Mitchell, John B. O. [1 ]
机构
[1] Univ Cambridge, Dept Chem, Unilever Ctr Mol Sci Informat, Cambridge CB2 1EW, England
基金
英国生物技术与生命科学研究理事会;
关键词
D O I
10.1021/ci0601160
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Representative molecules from 10 classes of prohibited substances were taken from the World Anti-Doping Agency (WADA) list, augmented by molecules from corresponding activity classes found in the MDDR database. Together with some explicitly allowed compounds, these formed a set of 5245 molecules. Five types of fingerprints were calculated for these substances. The random forest classification method was used to predict membership of each prohibited class on the basis of each type of fingerprint, using 5-fold cross-validation. We also used a k-nearest neighbors (kNN) approach, which worked well for the smallest values of k. The most successful classifiers are based on Unity 2D fingerprints and give very similar Matthews correlation coefficients of 0.836 (kNN) and 0.829 (random forest). The kNN classifiers tend to give a higher recall of positives at the expense of lower precision. A naive Bayesian classifier, however, lies much further toward the extreme of high recall and low precision. Our results suggest that it will be possible to produce a reliable and quantitative assignment of membership or otherwise of each class of prohibited substances. This should aid the fight against the use of bioactive novel compounds as doping agents, while also protecting athletes against unjust disqualification.
引用
收藏
页码:2369 / 2380
页数:12
相关论文
共 50 条
  • [1] Doping Updating of the list of substances prohibited in sport
    Derrien, Elisa
    [J]. ACTUALITES PHARMACEUTIQUES, 2022, 61 (613): : 5 - 5
  • [2] Chemoinformatics-based enumeration of chemical libraries: a tutorial
    Saldivar-Gonzalez, Fernanda I.
    Huerta-Garcia, C. Sebastian
    Medina-Franco, Jose L.
    [J]. JOURNAL OF CHEMINFORMATICS, 2020, 12 (01)
  • [3] Chemoinformatics-based enumeration of chemical libraries: a tutorial
    Fernanda I. Saldívar-González
    C. Sebastian Huerta-García
    José L. Medina-Franco
    [J]. Journal of Cheminformatics, 12
  • [4] Prevalence, knowledge and attitude of prohibited substances use (doping) among Saudi sport players
    Al Ghobain, M.
    Konbaz, M. S.
    Almassad, A.
    Alsultan, A.
    Al Shubaili, M.
    AlShabanh, O.
    [J]. SUBSTANCE ABUSE TREATMENT PREVENTION AND POLICY, 2016, 11
  • [5] Prevalence, knowledge and attitude of prohibited substances use (doping) among Saudi sport players
    M. Al Ghobain
    M. S. Konbaz
    A. Almassad
    A. Alsultan
    M. Al Shubaili
    O. AlShabanh
    [J]. Substance Abuse Treatment, Prevention, and Policy, 11
  • [6] General practitioner knowledge of prohibited substances in sport
    Greenway, P
    Greenway, M
    [J]. BRITISH JOURNAL OF SPORTS MEDICINE, 1997, 31 (02) : 129 - 131
  • [7] General practitioner knowledge of prohibited substances in sport
    Webborn, ADJ
    [J]. BRITISH JOURNAL OF SPORTS MEDICINE, 1997, 31 (03) : 259 - 259
  • [8] SOME LEGAL PROBLEMS OF COUNTERING THE USE OF PROHIBITED SUBSTANCES AND PROHIBITED METHODS IN SPORT
    Pitulko, K., V
    Sergeeva, A. A.
    Gorbatova, M. A.
    [J]. HUMAN SPORT MEDICINE, 2021, 21 : 130 - 135
  • [9] The List of Prohibited Substances and Methods in Sport: Structure and Review Process by the World Anti-Doping Agency
    Mazzoni, Irene
    Barroso, Osquel
    Rabin, Olivier
    [J]. JOURNAL OF ANALYTICAL TOXICOLOGY, 2011, 35 (09) : 608 - 612
  • [10] Knowledge about doping and prohibited substances in a population of Cuban students
    Manuel Cabrera-Oliva, C. Victor
    Castillo-Diaz, Pablo
    Sabon-Cisneros, Ivet
    [J]. ARRANCADA, 2021, 21 (39): : 163 - 178