A context evaluation approach for structural comparison of proteins using cross entropy over n-gram modelling

被引:0
|
作者
Razmara, Jafar [1 ]
Deris, Safaai B. [1 ]
Parvizpour, Sepideh [2 ]
机构
[1] Univ Teknol Malaysia, Fac Comp, Johor Baharu, Malaysia
[2] Univ Teknol Malaysia, Fac Biosci & Med Engn, Johor Baharu, Malaysia
关键词
Protein structure comparison; Structure alignment; Sequence alignment; Text modelling; STRUCTURE ALIGNMENT; STRUCTURE DATABASE; SEARCH; SIMILARITY; ALPHABET; TOOL;
D O I
10.1016/j.compbiomed.2013.07.022
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The structural comparison of proteins is a vital step in structural biology that is used to predict and analyse a new unknown protein function. Although a number of different techniques have been explored, the study to develop new alternative methods is still an active research area. The present paper introduces a text modelling-based technique for the structural comparison of proteins. The method models the secondary and tertiary structure of proteins in two linear sequences and then applies them to the comparison of two structures. The technique used for pairwise comparison of the sequences has been adopted from computational linguistics and its well-known techniques for analysing and quantifying textual sequences. To this end, an n-gram modelling technique is used to capture regularities between sequences, and then, the cross-entropy concept is employed to measure their similarities. Several experiments are conducted to evaluate the performance of the method and compare it with other commonly used programs. The assessments for information retrieval evaluation demonstrate that the technique has a high running speed, which is similar to other linear encoding methods, such as 3D-BLAST, SARST, and TS-AMIR, whereas its accuracy is comparable to CE and TM-align, which are high accuracy comparison tools. Accordingly, the results demonstrate that the algorithm has high efficiency compared with other state-of-the-art methods. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1614 / 1621
页数:8
相关论文
共 42 条
  • [21] An Evaluation of Page Segment Recommendation System using User's Notes and N-Gram Models
    Thunnom, Burin
    Ramingwong, Lachana
    2015 4TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION ICIEV 15, 2015,
  • [22] Improving Sentiment Classification Accuracy of Financial News using N-gram Approach and Feature Weighting Methods
    Foroozan, S.
    Murad, M. A. Azmi
    Sharef, N. M.
    Latiff, A. R. Abdul
    2015 2ND INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND SECURITY (ICISS), 2015, : 211 - 214
  • [23] Log Posterior Approach in Learning Rules Generated using N-Gram based Edit distance for Keyword Search
    Priya, M.
    Kalpana, R.
    JOURNAL OF INTELLIGENT SYSTEMS, 2018, 27 (04) : 555 - 563
  • [24] Automatic Phrase Boundary Labeling of Speech Synthesis Database Using Context-Dependent HMMs and N-Gram Prior Distributions
    Chen, Qian
    Ling, Zhen-Hua
    Yang, Chen-Yu
    Dai, Li-Rong
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1581 - 1585
  • [25] Regret cross-efficiency evaluation using attitudinal entropy approach
    Pan, Hao
    Yang, Guo-liang
    Chen, Xiao-lei
    Lou, Yuan-yu
    Wang, Teng
    Guan, Zhong-cheng
    HUMANITIES & SOCIAL SCIENCES COMMUNICATIONS, 2024, 11 (01):
  • [26] Comparison of Semantic Similarity for Different Languages Using the Google n-gram Corpus and Second-Order Co-occurrence Measures
    Joubarne, Colette
    Inkpen, Diana
    ADVANCES IN ARTIFICIAL INTELLIGENCE, 2011, 6657 : 216 - 221
  • [27] Filtering Spam Mail in Non-Segmented Languages Using Hybrid Approach: the Integration of Stopword Removal, N-gram Extraction and Classification Techniques
    Khumsong, Ployphailin
    Chumwatana, Todsanai
    Augsirikul, Supanit
    PROCEEDINGS OF KNOWLEDGE MANAGEMENT INTERNATIONAL CONFERENCE (KMICE) 2016, 2016, : 373 - 378
  • [28] A comprehensive experimental and modelling approach for the evaluation of cross-over fluxes in Vanadium Redox Flow Battery
    Cecchetti, Marco
    Toja, Francesco
    Casalegno, Andrea
    Zago, Matteo
    JOURNAL OF ENERGY STORAGE, 2023, 68
  • [29] Evaluation of movie piracy using an integrated approach of interpretive structural modelling and MICMAC analysis
    Gupta, Pradeep Kumar
    Venkataramani, Bhama
    INTERNATIONAL JOURNAL OF INDIAN CULTURE AND BUSINESS MANAGEMENT, 2015, 11 (01) : 43 - 58
  • [30] Estimating parameters and structural change in CGE models using a Bayesian cross-entropy estimation approach
    Go, Delfin S.
    Lofgren, Hans
    Ramos, Fabian Mendez
    Robinson, Sherman
    ECONOMIC MODELLING, 2016, 52 : 790 - 811