A Comparative Study of Deep Learning based Named Entity Recognition Algorithms for Cybersecurity

被引:13
|
作者
Dasgupta, Soham [2 ]
Piplai, Aritran [1 ]
Kotal, Anantaa [1 ]
Joshi, Anupam [1 ]
机构
[1] Univ Maryland Baltimore Cty, Dept Comp Sci & Elect Engn, Baltimore, MD 21228 USA
[2] Mallya Aditi Int Sch, Bengaluru, Karnataka, India
关键词
Named Entity Recognition; Deep Learning; Cybersecurity; Artificial Intelligence;
D O I
10.1109/BigData50022.2020.9378482
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Named Entity Recognition (NER) is important in the cybersecurity domain. It helps researchers extract cyber threat information from unstructured text sources. The extracted cyber-entities or key expressions can be used to model a cyber-attack described in an open-source text. A large number of generalpurpose NER algorithms have been published that work well in text analysis. These algorithms do not perform well when applied to the cybersecurity domain. In the field of cybersecurity, the open-source text available varies greatly in complexity and underlying structure of the sentences. General-purpose NER algorithms can misrepresent domain-specific words, such as "malicious" and "javascript". In this paper, we compare the recent deep learning-based NER algorithms on a cybersecurity dataset. We created a cybersecurity dataset collected from various sources, including "Microsoft Security Bulletin" and "Adobe Security Updates". Some of these approaches proposed in literature were not used for Cybersecurity. Others are innovations proposed by us. This comparative study helps us identify the NER algorithms that are robust and can work well in sentences taken from a large number of cybersecurity sources. We tabulate their performance on the test set and identify the best NER algorithm for a cybersecurity corpus. We also discuss the different embedding strategies that aid in the process of NER for the chosen deep learning algorithms.
引用
收藏
页码:2596 / 2604
页数:9
相关论文
共 50 条
  • [1] Named entity recognition based on deep learning
    Ji, Zhenyan
    Kong, Deyan
    Liu, Wei
    Dong, Wei
    Sang, Yanjuan
    [J]. Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2022, 28 (06): : 1603 - 1615
  • [2] A Comparative Study of Named Entity Recognition for Hindi Using Sequential Learning Algorithms
    Krishnarao, Awaghad Ashish
    Gahlot, Himanshu
    Srinet, Amit
    Kushwaha, D. S.
    [J]. 2009 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE, VOLS 1-3, 2009, : 1163 - 1168
  • [3] Adversarial Active Learning for Named Entity Recognition in Cybersecurity
    Li, Tao
    Hu, Yongjin
    Ju, Ankang
    Hu, Zhuoran
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 66 (01): : 407 - 420
  • [4] Military Named Entity Recognition Method Based on Deep Learning
    Wang, Xuefeng
    Yang, Ruopeng
    Lu, Yiwei
    Wu, Qingfeng
    [J]. PROCEEDINGS OF 2018 5TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2018, : 479 - 483
  • [5] Subsequence Based Deep Active Learning for Named Entity Recognition
    Radmard, Puria
    Fathullah, Yassir
    Lipani, Aldo
    [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 4310 - 4321
  • [6] Automatic Configuration of Deep Learning Algorithms for an Arabic Named Entity Recognition System
    Azroumahli, Chaimae
    Mouhib, Ibtihal
    El Younoussi, Yacine
    Badir, Hassan
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (10) : 106 - 113
  • [7] MetaboListem and TABoLiSTM: Two Deep Learning Algorithms for Metabolite Named Entity Recognition
    Yeung, Cheng S.
    Beck, Tim
    Posma, Joram M.
    [J]. METABOLITES, 2022, 12 (04)
  • [8] A Survey on Deep Learning for Named Entity Recognition
    Li, Jing
    Sun, Aixin
    Han, Jianglei
    Li, Chenliang
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (01) : 50 - 70
  • [9] Turkish Named Entity Recognition with Deep Learning
    Gunes, Asim
    Tantug, A. Cuneyd
    [J]. 2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [10] A Deep Learning Solution to Named Entity Recognition
    Murthy, V. Rudra
    Bhattacharyya, Pushpak
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, (CICLING 2016), PT I, 2018, 9623 : 427 - 438