Gene/protein name recognition based on support vector machine using dictionary as features

被引:33
|
作者
Mitsumori, T
Fation, S
Murata, M
Doi, K
Doi, H
机构
[1] Nara Inst Sci & Technol, Grad Sch Informat Sci, Nara 6300101, Japan
[2] Natl Inst Informat & Commun Technol, Kyoto 6190289, Japan
关键词
D O I
10.1186/1471-2105-6-S1-S8
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Automated information extraction from biomedical literature is important because a vast amount of biomedical literature has been published. Recognition of the biomedical named entities is the first step in information extraction. We developed an automated recognition system based on the SVM algorithm and evaluated it in Task 1.A of BioCreAtIvE, a competition for automated gene/protein name recognition. Results: In the work presented here, our recognition system uses the feature set of the word, the part-of-speech (POS), the orthography, the prefix, the suffix, and the preceding class. We call these features "internal resource features", i.e., features that can be found in the training data. Additionally, we consider the features of matching against dictionaries to be external resource features. We investigated and evaluated the effect of these features as well as the effect of tuning the parameters of the SVM algorithm. We found that the dictionary matching features contributed slightly to the improvement in the performance of the f-score. We attribute this to the possibility that the dictionary matching features might overlap with other features in the current multiple feature setting. Conclusion: During SVM learning, each feature alone had a marginally positive effect on system performance. This supports the fact that the SVM algorithm is robust on the high dimensionality of the feature vector space and means that feature selection is not required.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] A Target Recognition Algorithm Based on Support Vector Machine
    Ding, Yan
    Jin, Weiqi
    Yu, Yuhong
    Wang, Han
    2008 INTERNATIONAL CONFERENCE ON OPTICAL INSTRUMENTS AND TECHNOLOGY: OPTICAL SYSTEMS AND OPTOELECTRONIC INSTRUMENTS, 2009, 7156
  • [42] Emotion Recognition of Electromyography based on Support Vector Machine
    Yang Guangying
    Yang Shanxiao
    2010 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY AND SECURITY INFORMATICS (IITSI 2010), 2010, : 298 - 301
  • [43] Handwritten Digit Recognition Based on Support Vector Machine
    Gao, Xinwen
    Guan, Benbo
    Yu, Liqing
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON INFORMATION SCIENCES, MACHINERY, MATERIALS AND ENERGY (ICISMME 2015), 2015, 126 : 941 - 944
  • [44] Radar target recognition based on Support Vector Machine
    Zhang, L
    Zhou, WD
    Jiao, LC
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 1453 - 1456
  • [45] Prediction of Membrane Protein Types by Using Support Vector Machine Based on composite vector
    Wang, Ting
    Hu, Xiu Zhen
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOLS 1-4, 2009, : 1499 - 1503
  • [46] Corn breed recognition based on support vector machine
    Cheng, Hong
    Shi, Zhixing
    Yao, Wei
    Wang, Lei
    Pang, Lixin
    Nongye Jixie Xuebao/Transactions of the Chinese Society of Agricultural Machinery, 2009, 40 (03): : 180 - 183
  • [47] License plate recognition based on Support Vector Machine
    Abdullah, Siti Norul Huda Sheikh
    Omar, Khairuddin
    Sahran, Shahnorbanun
    Khalid, Marzuki
    2009 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS, VOLS 1 AND 2, 2009, : 78 - 82
  • [48] Intelligent target recognition based on the support vector machine
    Ding, Ai-Ling
    Liu, Fang
    Yao, Xia
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2001, 28 (06): : 743 - 746
  • [49] An algorithm of gait recognition based on support vector machine
    Du, Libin
    Shao, Wenxin
    Journal of Computational Information Systems, 2011, 7 (13): : 4710 - 4715
  • [50] Stratum Recognition Method Based on Support Vector Machine
    Wu Wei-jiang
    Li Guo-he
    Li Hong-qi
    2009 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND TECHNOLOGY, VOL II, PROCEEDINGS, 2009, : 317 - 320