ChemFLuo: a web-server for structure analysis and identification of fluorescent compounds

被引:7
|
作者
Yang, Zi-Yi
Dong, Jie [4 ]
Yang, Zhi-Jiang [2 ]
Yin, Mingzhu [1 ]
Jiang, Hong-Li
Lu, Ai-Ping [3 ]
Chen, Xiang [1 ]
Hou, Ting-Jun
Cao, Dong-Sheng [2 ]
机构
[1] Cent South Univ, Xiangya Hosp, Hunan Key Lab Skin Canc & Psoriasis, Dept Dermatol,Hunan Engn Res Ctr Skin Hlth & Dis, Changsha, Hunan, Peoples R China
[2] Cent South Univ, Xiangya Sch Pharmaceut Sci, Changsha 410003, Peoples R China
[3] Hong Kong Baptist Univ, Sch Chinese Med, Inst Adv Translat Med Bone & Joint Dis, Hong Kong, Peoples R China
[4] Univ Macau, Zhuhai, Peoples R China
基金
中国国家自然科学基金;
关键词
frequent hitters; false positives; fluorescent compounds; machine learning; substructure screening; public webserver; AVAILABLE [!text type='PYTHON']PYTHON[!/text] PACKAGE; INTERFERENCE; GENERATION; REACTIVITY; ARTIFACTS; BIOLOGY; DESIGN; RULES;
D O I
10.1093/bib/bbaa282
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Fluorescent detection methods are indispensable tools for chemical biology. However, the frequent appearance of potential fluorescent compound has greatly interfered with the recognition of compounds with genuine activity. Such fluorescence interference is especially difficult to identify as it is reproducible and possesses concentration-dependent characteristic. Therefore, the development of a credible screening tool to detect fluorescent compounds from chemical libraries is urgently needed in early stages of drug discovery. Results: In this study, we developed a webserver ChemFLuo for fluorescent compound detection, based on two large and high-quality training datasets containing 4906 blue and 8632 green fluorescent compounds. These molecules were used to construct a group of prediction models based on the combination of three machine learning algorithms and seven types of molecular representations. The best blue fluorescence prediction model achieved with balanced accuracy (BA)=0.858 and area under the receiver operating characteristic curve (AUC)=0.931 for the validation set, and BA=0.823 and AUC=0.903 for the test set. The best green fluorescence prediction model achieved the prediction accuracy with BA=0.810 and AUC=0.887 for the validation set, and BA=0.771 and AUC=0.852 for the test set. Besides prediction model, 22 blue and 16 green representative fluorescent substructures were summarized for the screening of potential fluorescent compounds. The comparison with other fluorescence detection tools and the application to external validation sets and large molecule libraries have demonstrated the reliability of prediction model for fluorescent compound detection. Conclusion: ChemFLuo is a public webserver to filter out compounds with undesirable fluorescent properties, which will benefit the design of high-quality chemical libraries for drug discovery.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] The DOMINO web-server for active module identification analysis
    Levi, Hagai
    Rahmanian, Nima
    Elkon, Ran
    Shamir, Ron
    [J]. BIOINFORMATICS, 2022, 38 (08) : 2364 - 2366
  • [2] Analysis of Buffer Delay in Web-Server Control
    Kjaer, Martin Ansbjerg
    Robertsson, Anders
    [J]. 2010 AMERICAN CONTROL CONFERENCE, 2010, : 1047 - 1052
  • [3] MotifNet: a web-server for network motif analysis
    Smoly, Ilan Y.
    Lerman, Eugene
    Ziv-Ukelson, Michal
    Yeger-Lotem, Esti
    [J]. BIOINFORMATICS, 2017, 33 (12) : 1907 - 1909
  • [4] Feedback algorithm and web-server for protein structure alignment
    Zhao, Zhiyu
    Fu, Bin
    Alanis, Francisco J.
    Summa, Christopher M.
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2008, 15 (05) : 505 - 524
  • [5] Occwserv: An occam web-server
    Barnes, Fred
    [J]. Concurrent Systems Engineering Series, 2003, 61 : 251 - 268
  • [6] Occwserv: An occam Web-Server
    Barnes, F
    [J]. COMMUNICATING PROCESS ARCHITECTURES 2003, 2003, 61 : 251 - 268
  • [7] Web-server broadcast storm
    不详
    [J]. BYTE, 1996, 21 (06): : 20 - 20
  • [8] Analysis and design of admission control in web-server systems
    Robertsson, A
    Wittenmark, B
    Kihl, M
    [J]. PROCEEDINGS OF THE 2003 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2003, : 254 - 259
  • [9] SPEC fields Web-server benchmarks
    Weiss, R
    [J]. COMPUTER DESIGN, 1996, 35 (12): : 26 - 26
  • [10] Predictability of web-server traffic congestion*
    Baryshnikov, Y
    Coffman, E
    Pierre, G
    Rubenstein, D
    Squillante, M
    Yimwadsana, T
    [J]. WCW 2005: 10th International Workshop on Web Content Caching and Distribution, Proceedings, 2005, : 97 - 103