Feature Selection for Chemical Compound Extraction using Wrapper Approach with Naive Bayes Classifier

被引:0
|
作者
Alshaikhdeeb, Basel [1 ]
Ahmad, Kamsuriah [1 ]
机构
[1] Natl Univ Malaysia, Dept Strateg Informat Syst, Bangi, Selangor, Malaysia
关键词
Biomedical Named Entity Recognition; Chemical Compounds Extraction; Naive Bayes; Feature Selection;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Biomedical Entity extraction is the process of identifying biomedical instances such as disorders, viruses, proteins, genes and others. One of these instances is the chemical compound which caught many researchers' attentions regarding the challenging task of extracting them. In fact, most of the studies that have been proposed for chemical compounds extraction have relied on supervised machine learning techniques regarding its ability to adopt a statistical model rather than handcrafted rules. However, the key characteristic of the use of supervised machine learning techniques lies on the utilized features. There is a wide range of features that have been used in the previous studies for the process of extracting chemical compounds. Hence, the need of accommodating a feature selection task in order to determine the best combination of features is becoming imperative. Therefore, this paper aims to apply a combination of Naive Bayes classification method with the Wrapper Subset Selection approach to identify the best features. Results showed that the proposed combination has the ability to identify the best combination of features which consists of Capitalization, Punctuation, Prefix and Part-Of-Speech Tagging by achieving 0.72 of f-measure. Such result has been compared to the state of the art and it demonstrated competitive performance.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Speeding up incremental wrapper feature subset selection with Naive Bayes classifier
    Bermejo, Pablo
    Gamez, Jose A.
    Puerta, Jose M.
    [J]. KNOWLEDGE-BASED SYSTEMS, 2014, 55 : 140 - 147
  • [2] An Efficient Feature Selection using Parallel Cuckoo Search and Naive Bayes classifier
    Sujana, T. Sai
    Rao, N. Madhu Sudana
    Reddy, Raja Sekar
    [J]. 2017 INTERNATIONAL CONFERENCE ON NETWORKS & ADVANCES IN COMPUTATIONAL TECHNOLOGIES (NETACT), 2017, : 167 - 172
  • [3] Naive Bayes Classifier with Feature Selection to Identify Phage Virion Proteins
    Feng, Peng-Mian
    Ding, Hui
    Chen, Wei
    Lin, Hao
    [J]. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2013, 2013
  • [4] Feature extraction and classification of proteomics data using stationary wavelet transform and naive Bayes classifier
    Liu Dan
    Huang Yuan-yuan
    Ma Chen-xiang
    [J]. 2010 4TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING (ICBBE 2010), 2010,
  • [5] Intrusion Detection using Naive Bayes Classifier with Feature Reduction
    Mukherjee, Saurabh
    Sharma, Neelam
    [J]. 2ND INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION, CONTROL AND INFORMATION TECHNOLOGY (C3IT-2012), 2012, 4 : 119 - 128
  • [6] Intrusion Detection Model Using Chi Square Feature Selection and Modified Naive Bayes Classifier
    Thaseen, I. Sumaiya
    Kumar, Ch. Aswani
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL SYMPOSIUM ON BIG DATA AND CLOUD COMPUTING CHALLENGES (ISBCC - 16'), 2016, 49 : 81 - 91
  • [7] Advanced Naive Bayes Text Classifier with Embedded Feature Weighting Approach
    Kim, Han-joon
    [J]. INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2009, 12 (03): : 607 - 620
  • [8] Naive bayes face/nonface classifier: A study of preprocessing and feature extraction techniques
    Phung, SL
    Bouzerdoum, A
    Chai, D
    Watson, A
    [J]. ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 1385 - 1388
  • [9] Predicting peptide bond conformation using feature selection and the Naive Bayes approach
    Exarchos, Kostas P.
    Exarchos, Themis P.
    Papaloukas, Costas
    Troganis, Anastassios N.
    Fotiadis, Dimitrios I.
    [J]. 2007 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-16, 2007, : 5009 - +
  • [10] Naive Feature Selection: Sparsity in Naive Bayes
    Askari, Armin
    d'Aspremont, Alex
    El Ghaoui, Laurent
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 1813 - 1821