novel feature selection based on apriori property and correlation analysis for protein sequence classification using MapReduce

被引:2
|
作者
Bhavani, R. [1 ]
Sadasivam, G. Sudha [2 ]
机构
[1] Govt Coll Technol, Dept Comp Sci & Engn, Coimbatore, Tamil Nadu, India
[2] PSG Coll Technol, Dept Comp Sci & Engn, Coimbatore, Tamil Nadu, India
关键词
apriori property; sequence classification; correlation analysis; feature subset selection; MapReduce; bioinformatics; STRUCTURAL CLASS;
D O I
10.1504/IJDMB.2017.10006248
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Feature selection is a crucial step in classification of protein sequences into existing superfamilies. Classifying protein sequences into different families based on their sequence patterns is helpful in predicting the structure and function of protein. This paper proposes a novel feature selection algorithm which first transforms the protein sequences into feature vectors and reduces the size of the feature vector based on the apriori property and correlation measure using MapReduce programming on Hadoop framework. Experimental results show that the proposed method of feature selection reduces the features by 99% and also improves accuracy by 5% to 6%.
引用
收藏
页码:255 / 265
页数:11
相关论文
共 50 条
  • [21] Gene selection and classification using correlation feature selection based binary bat algorithm with greedy crossover
    Seetharaman, Akila
    Sundersingh, Allin Christe
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (05):
  • [22] Pearson Correlation-Based Feature Selection for Document Classification Using Balanced Training
    Nasir, Inzamam Mashood
    Khan, Muhammad Attique
    Yasmin, Mussarat
    Shah, Jamal Hussain
    Gabryel, Marcin
    Scherer, Rafal
    Damasevicius, Robertas
    SENSORS, 2020, 20 (23) : 1 - 18
  • [23] Sarcasm classification: A novel approach by using Content Based Feature Selection Method
    Kumar, H. M. Keerthi
    Harish, B. S.
    8TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING & COMMUNICATIONS (ICACC-2018), 2018, 143 : 378 - 386
  • [24] Correlation-based feature selection strategy in neural classification
    Michalak, Krzysztof
    Kwasnicka, Halina
    ISDA 2006: SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 1, 2006, : 741 - 746
  • [25] A Novel Sequence-Based Method for Phosphorylation Site Prediction with Feature Selection and Analysis
    He, Zhi-Song
    Shi, Xiao-He
    Kong, Xiang-Ying
    Zhu, Yu-Bei
    Chou, Kuo-Chen
    PROTEIN AND PEPTIDE LETTERS, 2012, 19 (01): : 70 - 78
  • [26] Intelligent IoT Traffic Classification Using Novel Search Strategy for Fast-Based-Correlation Feature Selection in Industrial Environments
    Egea, Santiago
    Rego Manez, Albert
    Carro, Belen
    Sanchez-Esguevillas, Antonio
    Lloret, Jaime
    IEEE INTERNET OF THINGS JOURNAL, 2018, 5 (03): : 1616 - 1624
  • [27] Diagnosis of Bipolar Disease Using Correlation-Based Feature Selection with Different Classification Methods
    Cigdem, Ozkan
    Sulucay, Aysu
    Yilmaz, Arif
    Oguz, Kaya
    Demirel, Hasan
    Kitis, Omer
    Eker, Cagdas
    Gonul, Ali Saffet
    Unay, Devrim
    2019 MEDICAL TECHNOLOGIES CONGRESS (TIPTEKNO), 2019, : 456 - 459
  • [28] Gene Microarray Cancer Classification using Correlation Based Feature Selection Algorithm and Rules Classifiers
    Al-Batah, Mohammad
    Zaqaibeh, Belal
    Alomari, Saleh Ali
    Alzboon, Mowafaq Salem
    INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2019, 15 (08) : 62 - 73
  • [29] An effective ensemble classification framework using random forests and a correlation based feature selection technique
    Chutia, Dibyajyoti
    Bhattacharyya, Dhruba Kumar
    Sarma, Jaganath
    Raju, Penumetcha Narasa Lakshmi
    TRANSACTIONS IN GIS, 2017, 21 (06) : 1165 - 1178
  • [30] Feature Selection Based Classification of Sentiment Analysis using Biogeography Optimization Algorithm
    Shahid, Ramsha
    Javed, Sobia Tariq
    Zafar, Kashif
    2017 INTERNATIONAL CONFERENCE ON INNOVATIONS IN ELECTRICAL ENGINEERING AND COMPUTATIONAL TECHNOLOGIES (ICIEECT), 2017,