Analyzing CVE Database Using Unsupervised Topic Modelling

被引:7
|
作者
Vanamala, Mounika [1 ]
Yuan, Xiaohong [1 ]
Bandaru, Kanishka [2 ]
机构
[1] North Carolina A&T State Univ, Dept Comp Sci, Greensboro, NC 27411 USA
[2] Birla Inst Technol & Sci, Comp Sci Engn, Hyderabad, India
关键词
Probabilistic Topic Modeling; Latent Dirichlet Allocation; Topic Modelling; CVE; OWASP;
D O I
10.1109/CSCI49370.2019.00019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes our study of the vulnerability reports in the Common Vulnerability and Exposures (CVE) database by using topic modeling on the description texts of the vulnerabilities. Prevalent vulnerability types were found, and new trends of vulnerabilities were discovered by studying the 121,716 unique CVE entries that are reported from January 1999 to July 2019. The topics found through topic modeling were mapped to OWASP Top 10 vulnerabilities. It was found that the OWASP vulnerabilities A2: 2017-Broken Authentication, A4:2017-XML External Entities (XXE), and A5:2017-Broken Access Control increased, yet the vulnerability A7:2017-Cross-Site Scripting (XSS) had a steep decrease over the period of 20 years.
引用
收藏
页码:72 / 77
页数:6
相关论文
共 50 条
  • [31] Mining Contentious Documents Using an Unsupervised Topic Model Based Approach
    Trabelsi, Amine
    Zaiane, Osmar R.
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2014, : 550 - 559
  • [32] Unsupervised Satellite Image Classification Using Markov Field Topic Model
    Xu, Kan
    Yang, Wen
    Liu, Gang
    Sun, Hong
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2013, 10 (01) : 130 - 134
  • [33] Understanding Russian Information Operations Using Unsupervised Multilingual Topic Modeling
    Chew, Peter A.
    Turnley, Jessica G.
    SOCIAL, CULTURAL, AND BEHAVIORAL MODELING, 2017, 10354 : 102 - 107
  • [34] TOPIC IDENTIFICATION OF SPOKEN DOCUMENTS USING UNSUPERVISED ACOUSTIC UNIT DISCOVERY
    Kesiraju, Santosh
    Pappagari, Raghavendra
    Ondel, Lucas
    Burget, Lukas
    Dehak, Najim
    Khudanpur, Sanjeev
    Cernocky, Jan Honza
    Gangashetty, Suryakanth V.
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5745 - 5749
  • [35] Discovery of activity composites using topic models: An analysis of unsupervised methods
    Seiter, Julia
    Amft, Oliver
    Rossi, Mirco
    Troster, Gerhard
    PERVASIVE AND MOBILE COMPUTING, 2014, 15 : 215 - 227
  • [36] Analyzing Novice Programmers' EEG Signals using Unsupervised Algorithms
    Swansi, Vanlalhruaii
    Herradura, Tita
    Suarez, Merlin Teodosia
    25TH INTERNATIONAL CONFERENCE ON COMPUTERS IN EDUCATION (ICCE 2017): TECHNOLOGY AND INNOVATION: COMPUTER-BASED EDUCATIONAL SYSTEMS FOR THE 21ST CENTURY, 2017, : 113 - 115
  • [37] Analyzing Harmonic Monitoring Data Using Supervised and Unsupervised Learning
    Asheibi, Ali
    Stirling, David
    Sutanto, Danny
    IEEE TRANSACTIONS ON POWER DELIVERY, 2009, 24 (01) : 293 - 301
  • [38] Identifying and Analyzing Atypical Flights by Using Supervised and Unsupervised Approaches
    Clachar, Sophine A.
    TRANSPORTATION RESEARCH RECORD, 2015, (2471) : 10 - 18
  • [39] Analyzing Gaze Behavior Using Object Detection and Unsupervised Clustering
    Venuprasad, Pranav
    Xu, Li
    Huang, Enoch
    Gilman, Andrew
    Chukoskie, Leanne
    Cosman, Pamela
    ETRA'20 FULL PAPERS: ACM SYMPOSIUM ON EYE TRACKING RESEARCH AND APPLICATIONS, 2020,
  • [40] Detecting Similar Linked Datasets Using Topic Modelling
    Roeder, Michael
    Ngomo, Axel-Cyrille Ngonga
    Ermilov, Ivan
    Both, Andreas
    SEMANTIC WEB: LATEST ADVANCES AND NEW DOMAINS, 2016, 9678 : 3 - 19