Analyzing CVE Database Using Unsupervised Topic Modelling

被引:7
|
作者
Vanamala, Mounika [1 ]
Yuan, Xiaohong [1 ]
Bandaru, Kanishka [2 ]
机构
[1] North Carolina A&T State Univ, Dept Comp Sci, Greensboro, NC 27411 USA
[2] Birla Inst Technol & Sci, Comp Sci Engn, Hyderabad, India
关键词
Probabilistic Topic Modeling; Latent Dirichlet Allocation; Topic Modelling; CVE; OWASP;
D O I
10.1109/CSCI49370.2019.00019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes our study of the vulnerability reports in the Common Vulnerability and Exposures (CVE) database by using topic modeling on the description texts of the vulnerabilities. Prevalent vulnerability types were found, and new trends of vulnerabilities were discovered by studying the 121,716 unique CVE entries that are reported from January 1999 to July 2019. The topics found through topic modeling were mapped to OWASP Top 10 vulnerabilities. It was found that the OWASP vulnerabilities A2: 2017-Broken Authentication, A4:2017-XML External Entities (XXE), and A5:2017-Broken Access Control increased, yet the vulnerability A7:2017-Cross-Site Scripting (XSS) had a steep decrease over the period of 20 years.
引用
收藏
页码:72 / 77
页数:6
相关论文
共 50 条
  • [1] Using Topic Modelling for Analyzing Panamanian Parliamentary Proceedings with Neural and Statistical Methods
    Contreras, Kenji
    Verbel, Gabriel
    Sanchez, Jose
    Sanchez-Galan, Javier E.
    PROCEEDINGS OF THE 2022 IEEE 40TH CENTRAL AMERICA AND PANAMA CONVENTION (CONCAPAN), 2022,
  • [2] Analyzing Scientific Publications using Domain-Specific Word Embedding and Topic Modelling
    Singhal, Trisha
    Liu, Junhua
    Blessing, Lucienne T. M.
    Lim, Kwan Hui
    Proceedings - 2021 IEEE International Conference on Big Data, Big Data 2021, 2021, : 4965 - 4973
  • [3] Analyzing Scientific Publications using Domain-Specific Word Embedding and Topic Modelling
    Singhal, Trisha
    Liu, Junhua
    Blessing, Lucienne T. M.
    Lim, Kwan Hui
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 4965 - 4973
  • [4] Unsupervised Topic Modelling for Multi-Party Spoken Discourse
    Purver, Matthew
    Koerding, Konrad P.
    Griffiths, Thomas L.
    Tenenbaum, Joshua B.
    COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 17 - 24
  • [5] Using structural topic modelling to reveal patterns in reports on opioid drugs in a pharmacovigilance database
    Losch, Lea
    Brown, Patrick
    van Hunsel, Florence
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2022, 31 (09) : 1003 - 1006
  • [6] Using structural topic modelling to reveal patterns in reports on opioid drugs in a pharmacovigilance database
    Losch, Lea
    Brown, Patrick
    Van Hunsel, Florence
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2022, 31 : 641 - 641
  • [7] Using topic modelling for unsupervised annotation of electronic health records to identify an outbreak of disease in UK dogs
    Noble, Peter-John Mantyla
    Appleton, Charlotte
    Radford, Alan David
    Nenadic, Goran
    PLOS ONE, 2021, 16 (12):
  • [8] Analyzing the history of Cognition using Topic Models
    Priva, Uriel Cohen
    Austerweil, Joseph L.
    COGNITION, 2015, 135 : 4 - 9
  • [9] Unsupervised title and abstract screening for systematic review: a retrospective case-study using topic modelling methodology
    Agnes Natukunda
    Leacky K. Muchene
    Systematic Reviews, 12
  • [10] Unsupervised title and abstract screening for systematic review: a retrospective case-study using topic modelling methodology
    Natukunda, Agnes
    Muchene, Leacky K. K.
    SYSTEMATIC REVIEWS, 2023, 12 (01)