ChemFLuo: a web-server for structure analysis and identification of fluorescent compounds

被引:7
|
作者
Yang, Zi-Yi
Dong, Jie [4 ]
Yang, Zhi-Jiang [2 ]
Yin, Mingzhu [1 ]
Jiang, Hong-Li
Lu, Ai-Ping [3 ]
Chen, Xiang [1 ]
Hou, Ting-Jun
Cao, Dong-Sheng [2 ]
机构
[1] Cent South Univ, Xiangya Hosp, Hunan Key Lab Skin Canc & Psoriasis, Dept Dermatol,Hunan Engn Res Ctr Skin Hlth & Dis, Changsha, Hunan, Peoples R China
[2] Cent South Univ, Xiangya Sch Pharmaceut Sci, Changsha 410003, Peoples R China
[3] Hong Kong Baptist Univ, Sch Chinese Med, Inst Adv Translat Med Bone & Joint Dis, Hong Kong, Peoples R China
[4] Univ Macau, Zhuhai, Peoples R China
基金
中国国家自然科学基金;
关键词
frequent hitters; false positives; fluorescent compounds; machine learning; substructure screening; public webserver; AVAILABLE [!text type='PYTHON']PYTHON[!/text] PACKAGE; INTERFERENCE; GENERATION; REACTIVITY; ARTIFACTS; BIOLOGY; DESIGN; RULES;
D O I
10.1093/bib/bbaa282
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Fluorescent detection methods are indispensable tools for chemical biology. However, the frequent appearance of potential fluorescent compound has greatly interfered with the recognition of compounds with genuine activity. Such fluorescence interference is especially difficult to identify as it is reproducible and possesses concentration-dependent characteristic. Therefore, the development of a credible screening tool to detect fluorescent compounds from chemical libraries is urgently needed in early stages of drug discovery. Results: In this study, we developed a webserver ChemFLuo for fluorescent compound detection, based on two large and high-quality training datasets containing 4906 blue and 8632 green fluorescent compounds. These molecules were used to construct a group of prediction models based on the combination of three machine learning algorithms and seven types of molecular representations. The best blue fluorescence prediction model achieved with balanced accuracy (BA)=0.858 and area under the receiver operating characteristic curve (AUC)=0.931 for the validation set, and BA=0.823 and AUC=0.903 for the test set. The best green fluorescence prediction model achieved the prediction accuracy with BA=0.810 and AUC=0.887 for the validation set, and BA=0.771 and AUC=0.852 for the test set. Besides prediction model, 22 blue and 16 green representative fluorescent substructures were summarized for the screening of potential fluorescent compounds. The comparison with other fluorescence detection tools and the application to external validation sets and large molecule libraries have demonstrated the reliability of prediction model for fluorescent compound detection. Conclusion: ChemFLuo is a public webserver to filter out compounds with undesirable fluorescent properties, which will benefit the design of high-quality chemical libraries for drug discovery.
引用
下载
收藏
页数:14
相关论文
共 50 条
  • [31] Nebula-a web-server for advanced ChIP-seq data analysis
    Boeva, Valentina
    Lermine, Alban
    Barette, Camille
    Guillouf, Christel
    Barillot, Emmanuel
    BIOINFORMATICS, 2012, 28 (19) : 2517 - 2519
  • [32] AVIA: an interactive web-server for annotation, visualization and impact analysis of genomic variations
    Hue Vuong
    Robert M Stephens
    Natalia Volfovsky
    BMC Proceedings, 6 (Suppl 6)
  • [33] Detecting denial of service by modelling web-server behaviour
    Campo Giralte, Luis
    Conde, Cristina
    Martin de Diego, Isaac
    Cabello, Enrique
    COMPUTERS & ELECTRICAL ENGINEERING, 2013, 39 (07) : 2252 - 2262
  • [34] Temporal load-balancing of web-server traffic
    Sandnes, Frode Eika
    Huang, Yo-Ping
    SEVENTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PROCEEDINGS, 2006, : 266 - +
  • [35] The state of the art in locally distributed Web-server systems
    Cardellini, V
    Casalicchio, E
    Colajanni, M
    Yu, PS
    ACM COMPUTING SURVEYS, 2002, 34 (02) : 263 - 311
  • [37] Development of model web-server for crop variety identification using throughput SNP genotyping data
    Singh, Rajender
    Iquebal, M. A.
    Mishra, C. N.
    Jaiswal, Sarika
    Kumar, Deepender
    Raghav, Nishu
    Paul, Surinder
    Sheoran, Sonia
    Sharma, Pradeep
    Gupta, Arun
    Tiwari, Vinod
    Angadi, U. B.
    Kumar, Neeraj
    Rai, Anil
    Singh, G. P.
    Kumar, Dinesh
    Tiwari, Ratan
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [38] Development of model web-server for crop variety identification using throughput SNP genotyping data
    Rajender Singh
    M. A. Iquebal
    C. N. Mishra
    Sarika Jaiswal
    Deepender Kumar
    Nishu Raghav
    Surinder Paul
    Sonia Sheoran
    Pradeep Sharma
    Arun Gupta
    Vinod Tiwari
    U. B. Angadi
    Neeraj Kumar
    Anil Rai
    G. P. Singh
    Dinesh Kumar
    Ratan Tiwari
    Scientific Reports, 9
  • [39] Modeling and performance analysis of QoS-aware load balancing of Web-server clusters
    Shan, ZG
    Lin, C
    Marinescu, DC
    Yang, Y
    COMPUTER NETWORKS, 2002, 40 (02) : 235 - 256
  • [40] FL-Online: An x-ray crystallographic web-server for atomic-scale structure analysis of biomolecule
    Wang, Bintang
    Niu, Tongxin
    Fan, Haifu
    Ding, Wei
    CHINESE PHYSICS B, 2024, 33 (07)