A SVM-based Compound-word Recognition Method in Information Security

被引:0
|
作者
Li, Shixian [2 ]
Zhang, Lei [2 ]
Han, Bo [1 ]
Lei, Tingrui [1 ]
Wang, Qing [1 ]
Peng, Tao [3 ]
Cao, Peng [3 ]
机构
[1] Wuhan Univ, Int Sch Software, Wuhan 430072, Peoples R China
[2] China Informat Technol Secur Evaluat Ctr, Beijing, Peoples R China
[3] RuiDa Informat Secur Ind Co Ltd, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
depth first traversal; compound-word; SVM; domain contrast corpus;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the emergence of mobile Internet, Internet of things and cloud computing, the domain of information security is in a rapid development. As a result, a constant stream of compound-words describing new concepts and new technologies has arisen. However, the existing dictionary does not collect those new compound-words in time, so it cannot identify them correctly. In order to solve this problem, this paper presents a SVM-based compound-word recognition method in information security. The method is based on the outputs of the existing word segmentation system. It constructs adjacent atom-word digraph according to the statistical co-occurrence features and lexical rules. Next, it produces compound-word candidate set through deep traverse the digraph by the longest match principle. It further filters the candidate set by using a SVM classifier with the help of domain contrast corpus and computer dictionary. We use this method to identify new compound-words from 2200 vulnerability description texts. It achieves a precision of 82.25% and recall of 77.44%. The results show that our method is able to effectively identify new compound-words in information security from large scale of corpus.
引用
收藏
页码:837 / 841
页数:5
相关论文
共 50 条
  • [1] A SVM-based compound-word recognition method in information security
    Li, Shixian
    Zhang, Lei
    Han, Bo
    Lei, Tingrui
    Wang, Qing
    Peng, Tao
    Cao, Peng
    [J]. Proceedings - 2013 10th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2013, 2013, : 837 - 841
  • [2] The Compound-Word Effect in Associative Recognition
    Ahmad, Fahad Naveed
    [J]. CANADIAN JOURNAL OF EXPERIMENTAL PSYCHOLOGY-REVUE CANADIENNE DE PSYCHOLOGIE EXPERIMENTALE, 2012, 66 (04): : 295 - 295
  • [3] SVM-Based Method for Perceptual Image Recognition
    Yang, Cheng
    Zhu, Bin
    An, Fang
    [J]. 2017 2ND INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC 2017), 2017, : 264 - 267
  • [4] A new information fusion method for SVM-Based robotic audio-visual emotion recognition
    Han, Meng-Ju
    Hsu, Jing-Huai
    Song, Kai-Tai
    Chang, Fuh-Yu
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-8, 2007, : 2464 - +
  • [5] Power quality events recognition using a SVM-based method
    Cerqueira, Augusto Santiago
    Ferreira, Danton Diego
    Ribeiro, Moises Vidal
    Duque, Carlos Augusto
    [J]. ELECTRIC POWER SYSTEMS RESEARCH, 2008, 78 (09) : 1546 - 1552
  • [6] A novel SVM-based method for moving video objects recognition
    Kong, Xiaodong
    Luo, Qingshan
    Zeng, Guihua
    [J]. ADVANCES IN VISUAL INFORMATION SYSTEMS, 2007, 4781 : 136 - +
  • [7] SVM-based Hypertext Information Categorization
    Qing, Liu
    [J]. 2012 INTERNATIONAL CONFERENCE ON INDUSTRIAL CONTROL AND ELECTRONICS ENGINEERING (ICICEE), 2012, : 1351 - 1353
  • [8] EHPred: an SVM-based method for epoxide hydrolases recognition and classification.
    Jia J.
    Yang L.
    Zhang Z.Z.
    [J]. Journal of Zhejiang University SCIENCE B, 2006, 7 (1): : 1 - 6
  • [9] SVM-based feature extraction for face recognition
    Kim, Sang-Ki
    Park, Youn Jung
    Toh, Kar-Ann
    Lee, Sangyoun
    [J]. PATTERN RECOGNITION, 2010, 43 (08) : 2871 - 2881
  • [10] SVM-based PQ disturbance recognition system
    Huang, Jiansheng
    Jiang, Zhuhan
    Rylands, Leanne
    Negnevitsky, Michael
    [J]. IET GENERATION TRANSMISSION & DISTRIBUTION, 2018, 12 (02) : 328 - 334