Automated U.S Diplomatic Cables Security Classification: Topic Model Pruning vs. Classification Based on Clusters

被引:0
|
作者
Alzhrani, Khudran [1 ]
Rudd, Ethan M. [1 ,2 ]
Chow, C. Edward [1 ]
Boult, Terrance E. [1 ,2 ]
机构
[1] Univ Colorado, Dept Comp Sci, Colorado Springs, CO 80907 USA
[2] Univ Colorado, VAST Lab, Colorado Springs, CO 80907 USA
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The U.S Government has been the target for cyber-attacks from all over the world. Just recently, former President Obama accused the Russian government of the leaking emails to Wikileaks and declared that the U.S. might be forced to respond. While Russia denied involvement, it is clear that the U.S. has to take some defensive measures to protect its data infrastructure. Insider threats have been the cause of other sensitive information leaks too, including the infamous Edward Snowden incident. Most of the recent leaks were in the form of text. Due to the nature of text data, security classifications are assigned manually. In an adversarial environment, insiders can leak texts through E-mail, printers, or any untrusted channels. The optimal defense is to automatically detect the unstructured text security class and enforce the appropriate protection mechanism without degrading services or daily tasks. Unfortunately, existing Data Leak Prevention (DLP) systems are not well suited for detecting unstructured texts. In this paper, we compare two recent approaches in the literature for text security classification, evaluating them on actual sensitive text data from the WikiLeaks dataset.
引用
收藏
页数:6
相关论文
共 7 条
  • [1] ON THE CLASSIFICATION OF PERSONAL MEANING: THEORY-GOVERNED TYPOLOGY VS. EMPIRICISM-BASED CLUSTERS
    Vollstedt, Maike
    PROCEEDINGS OF THE 35TH CONFERENCE OF THE INTERNATIONAL GROUP FOR PSYCHOLOGY OF MATHEMATICS EDUCATION, VOL. 4: DEVELOPING MATHEMATICAL THINKING, 2011, : 321 - 328
  • [2] SoK: Machine vs. machine - A systematic classification of automated machine learning-based CAPTCHA solvers
    Dionysiou, Antreas
    Athanasopoulos, Elias
    COMPUTERS & SECURITY, 2020, 97
  • [3] USTW Vs. STW: A Comparative Analysis for Exam Question Classification based on Bloom’s Taxonomy
    Gani M.O.
    Ayyasamy R.K.
    Fui T.
    Sangodiah A.
    Mendel, 2022, 28 (02) : 25 - 40
  • [4] Automated Detection and Classification of Positive vs. Negative Robot Interactions With Children With Autism Using Distance-Based Features
    Feil-Seifer, David
    Mataric, Maja J.
    PROCEEDINGS OF THE 6TH ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTIONS (HRI 2011), 2011, : 323 - 330
  • [5] Automated text classification of opinion vs. news French press articles. A comparison of transformer and feature-based approaches
    Escou, Louis
    Descampe, Antonin
    Fairon, Cedrick
    LANGUAGE & COMMUNICATION, 2024, 99 : 129 - 140
  • [6] Novel nested patch-based feature extraction model for automated Parkinson's Disease symptom classification using MRI images
    Kaplan, Ela
    Altunisik, Erman
    Firat, Yasemin Ekmekyapar
    Barua, Prabal Datta
    Dogan, Sengul
    Baygin, Mehmet
    Demir, Fahrettin Burak
    Tuncer, Turker
    Palmer, Elizabeth
    Tan, Ru-San
    Yu, Ping
    Soar, Jeffrey
    Fujita, Hamido
    Acharya, U. Rajendra
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 224
  • [7] RETRACTED: An Intelligent Security Classification Model of Driver's Driving Behavior Based on V2X in IoT Networks (Retracted Article)
    Dai, Songyin
    Zhong, Yuan
    Xu, Cheng
    Liu, Hongzhe
    Yuan, Jiazheng
    Wang, Pengfei
    SECURITY AND COMMUNICATION NETWORKS, 2022, 2022