Text Mining in Cybersecurity: A Systematic Literature Review

被引:20
|
作者
Ignaczak, Luciano [1 ]
Goldschmidt, Guilherme [1 ]
Da Costa, Cristiano Andre [1 ]
Righi, Rodrigo Da Rosa [1 ]
机构
[1] Unisinos Univ, Lab Software Innovat, Sao Leopoldo, Brazil
关键词
Cybersecurity; text mining; natural language processing; systematic literature review; LINGUISTIC STEGANOGRAPHY; INFORMATION SECURITY; SENTIMENT ANALYSIS; ARTIFICIAL-INTELLIGENCE; THREAT INTELLIGENCE; SEMANTIC ANALYSIS; NEURAL-NETWORKS; SPAM DETECTION; DATA BREACHES; SOCIAL MEDIA;
D O I
10.1145/3462477
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The growth of data volume has changed cybersecurity activities, demanding a higher level of automation. In this new cybersecurity landscape, text mining emerged as an alternative to improve the efficiency of the activities involving unstructured data. This article proposes a Systematic Literature Review (SLR) to present the application of text mining in the cybersecurity domain. Using a systematic protocol, we identified 2,196 studies, out of which 83 were summarized. As a contribution, we propose a taxonomy to demonstrate the different activities in the cybersecurity domain supported by text mining. We also detail the strategies evaluated in the application of text mining tasks and the use of neural networks to support activities involving unstructured data. The work also discusses text classification performance aiming its application in real-world solutions. The SLR also highlights open gaps for future research, such as the analysis of non-English content and the intensification in the usage of neural networks.
引用
收藏
页数:36
相关论文
共 50 条
  • [1] Cybersecurity Analysis via Process Mining: A Systematic Literature Review
    Macak, Martin
    Daubner, Lukas
    Sani, Mohammadreza Fani
    Buhnova, Barbora
    [J]. ADVANCED DATA MINING AND APPLICATIONS, ADMA 2021, PT I, 2022, 13087 : 393 - 407
  • [2] Crowdsourcing: a systematic review of the literature using text mining
    Pavlidou, Ioanna
    Papagiannidis, Savvas
    Tsui, Eric
    [J]. INDUSTRIAL MANAGEMENT & DATA SYSTEMS, 2020, 120 (11) : 2041 - 2065
  • [3] Text mining applications in psychiatry: a systematic literature review
    Abbe, Adeline
    Grouin, Cyril
    Zweigenbaum, Pierre
    Falissard, Bruno
    [J]. INTERNATIONAL JOURNAL OF METHODS IN PSYCHIATRIC RESEARCH, 2016, 25 (02) : 86 - 100
  • [4] Text-mining Techniques and Tools for Systematic Literature Reviews: A Systematic Literature Review
    Feng, Luyi
    Chiam, Yin Kia
    Lo, Sin Kuang
    [J]. 2017 24TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC 2017), 2017, : 41 - 50
  • [5] Process mining usage in cybersecurity and software reliability analysis: A systematic literature review
    Macak, Martin
    Daubner, Lukas
    Sani, Mohammadreza Fani
    Buhnova, Barbora
    [J]. ARRAY, 2022, 13
  • [6] A Systematic Literature Review of Sexual Harassment Studies with Text Mining
    Karami, Amir
    Spinel, Melek Yildiz
    White, C. Nicole
    Ford, Kayla
    Swan, Suzanne
    [J]. SUSTAINABILITY, 2021, 13 (12)
  • [7] Twitter and Research: A Systematic Literature Review Through Text Mining
    Karami, Amir
    Lundy, Morgan
    Webb, Frank
    Dwivedi, Yogesh K.
    [J]. IEEE ACCESS, 2020, 8 (08): : 67698 - 67717
  • [8] Systematic Literature Review Of Hate Speech Detection With Text Mining
    Rini
    Utami, Ema
    Hartanto, Anggit Dwi
    [J]. PROCEEDINGS OF ICORIS 2020: 2020 THE 2ND INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEM (ICORIS), 2020, : 228 - 233
  • [9] Text Mining, Clustering and Sentiment analysis: A systematic Literature Review
    Hoti, Mergim H.
    Ajdari, Jaumin
    Hamiti, Mentor
    Zenuni, Xhemal
    [J]. 2022 11TH MEDITERRANEAN CONFERENCE ON EMBEDDED COMPUTING (MECO), 2022, : 302 - 307
  • [10] Epistemological Considerations of Text Mining: Implications for Systematic Literature Review
    Caballero-Julia, Daniel
    Campillo, Philippe
    [J]. MATHEMATICS, 2021, 9 (16)