Toward an Optimal Approach for Variable Selection in Counter-Propagation Neural Networks: Modeling Protein-Tyrosine Kinase Inhibitory of Flavanoids Using Substituent Electronic Descriptors

被引:5
|
作者
Hemmateenejad, Bahram [1 ,2 ]
Mehdipour, Ahmadreza [2 ]
Deeb, Omar [3 ]
Sanchooli, Mahmood
Miri, Ramin [2 ]
机构
[1] Shiraz Univ, Dept Chem, Shiraz, Iran
[2] Shiraz Univ Med Sci, Med & Nat Prod Chem Res Ctr, Shiraz, Iran
[3] Al Quds Univ, Fac Pharm, Jerusalem, Israel
关键词
Counter propagation neural network; Flavanoids; Variable selection; Protein-tyrosine kinase; Substituent electronic descriptors; STRUCTURE-PROPERTY RELATIONSHIP; QUANTITATIVE STRUCTURE-ACTIVITY; SUPPORT VECTOR MACHINE; QSAR MODEL; 2ND-ORDER CALIBRATION; MOLECULAR DESCRIPTORS; PATTERN-RECOGNITION; LEARNING-METHODS; RANDOM FOREST; HUMAN PLASMA;
D O I
10.1002/minf.201100081
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Counter propagation neural network (CPNN) is one of the attractive tools of classification in QSAR studies. A major obstacle in classification by CPNN is finding the best subset of variables. In this study, the performance of some different feature selection algorithms including F score-based ranking, eigenvalue ranking of PCs obtained from data set, Non-Error-Rate (NER) ranking of both descriptors and PCs, and 3-way handling of data, Parallel Factor Analysis (PARAFAC), was evaluated in order to find the best classification model. The methods were applied for modeling protein-tyrosine kinase inhibitory of some flavonoid derivatives using substituent electronic descriptors (SED) as novel source of electronic descriptors. The results showed that the best performance was achieved by F-score ranking while the NER ranking of principal components (PCs) showed very fluctuate results and the worst performance was belonging to PARAFAC-CPNN. Furthermore, comparison of results of these nonlinear algorithms with linear discriminate analysis method revealed better predictions by the former.
引用
收藏
页码:939 / 949
页数:11
相关论文
empty
未找到相关数据