Gene/protein name recognition based on support vector machine using dictionary as features

被引:33
|
作者
Mitsumori, T
Fation, S
Murata, M
Doi, K
Doi, H
机构
[1] Nara Inst Sci & Technol, Grad Sch Informat Sci, Nara 6300101, Japan
[2] Natl Inst Informat & Commun Technol, Kyoto 6190289, Japan
关键词
D O I
10.1186/1471-2105-6-S1-S8
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Automated information extraction from biomedical literature is important because a vast amount of biomedical literature has been published. Recognition of the biomedical named entities is the first step in information extraction. We developed an automated recognition system based on the SVM algorithm and evaluated it in Task 1.A of BioCreAtIvE, a competition for automated gene/protein name recognition. Results: In the work presented here, our recognition system uses the feature set of the word, the part-of-speech (POS), the orthography, the prefix, the suffix, and the preceding class. We call these features "internal resource features", i.e., features that can be found in the training data. Additionally, we consider the features of matching against dictionaries to be external resource features. We investigated and evaluated the effect of these features as well as the effect of tuning the parameters of the SVM algorithm. We found that the dictionary matching features contributed slightly to the improvement in the performance of the f-score. We attribute this to the possibility that the dictionary matching features might overlap with other features in the current multiple feature setting. Conclusion: During SVM learning, each feature alone had a marginally positive effect on system performance. This supports the fact that the SVM algorithm is robust on the high dimensionality of the feature vector space and means that feature selection is not required.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Facial expression recognition using a combination of multiple facial features and support vector machine
    Tsai, Hung-Hsu
    Chang, Yi-Cheng
    SOFT COMPUTING, 2018, 22 (13) : 4389 - 4405
  • [22] Recognition of people reoccurrences using bag-of-features representation and support vector machine
    Liu, Kun
    Yang, Jie
    PROCEEDINGS OF THE 2009 CHINESE CONFERENCE ON PATTERN RECOGNITION AND THE FIRST CJK JOINT WORKSHOP ON PATTERN RECOGNITION, VOLS 1 AND 2, 2009, : 573 - 577
  • [23] Ear recognition using features inspired by visual cortex and support vector machine technique
    Yaqubi, Mahboubeh
    Faez, Karim
    Motamed, Sara
    2008 INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING, VOLS 1-3, 2008, : 533 - +
  • [24] Facial expression recognition using a combination of multiple facial features and support vector machine
    Hung-Hsu Tsai
    Yi-Cheng Chang
    Soft Computing, 2018, 22 : 4389 - 4405
  • [25] FACIAL EXPRESSION RECOGNITION BASED ON LOCAL BINARY PATTERN FEATURES AND SUPPORT VECTOR MACHINE
    Nhan Thi Cao
    An Hoa Ton-That
    Hyung Il Choi
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2014, 28 (06)
  • [26] Automatic recognition system for concrete cracks with support vector machine based on crack features
    Wang, Rui
    Chen, Rui-Qi
    Guo, Xin-Xin
    Liu, Jia-Xuan
    Yu, Hai-Ying
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [27] PIPELINE LEAKAGE RECOGNITION BASED ON THE PROJECTION SINGULAR VALUE FEATURES AND SUPPORT VECTOR MACHINE
    Wang Mingda
    Zhang Laibin
    Liang Wei
    Hu Jinqiu
    PROCEEDINGS OF THE ASME INTERNATIONAL PIPELINE CONFERENCE 2010, VOL 3, 2010, : 471 - 476
  • [28] Performance Analysis of Zone Based Features for Online Handwritten Gurmukhi Script Recognition using Support Vector Machine
    Verma, Karun
    Sharma, R. K.
    PROGRESS IN SYSTEMS ENGINEERING, 2015, 366 : 747 - 753
  • [29] Prediction of Protein-Protein Interactions Based on Molecular Interface Features and the Support Vector Machine
    Zhou, Weiqiang
    Yan, Hong
    Fan, Xiaodan
    Hao, Quan
    CURRENT BIOINFORMATICS, 2013, 8 (01) : 3 - 8
  • [30] Identification of osteoporosis based on gene biomarkers using support vector machine
    Lv, Nanning
    Zhou, Zhangzhe
    He, Shuangjun
    Shao, Xiaofeng
    Zhou, Xinfeng
    Feng, Xiaoxiao
    Qian, Zhonglai
    Zhang, Yijian
    Liu, Mingming
    OPEN MEDICINE, 2022, 17 (01): : 1216 - 1227