CROSS-LINGUAL CYBERSECURITY ANALYTICS IN THE INTERNATIONAL DARK WEB WITH ADVERSARIAL DEEP REPRESENTATION LEARNING

被引:21
|
作者
Ebrahimi, Mohammadreza [1 ]
Chai, Yidong [2 ]
Samtani, Sagar [3 ]
Chen, Hsinchun [4 ]
机构
[1] Univ S Florida, Sch Informat Syst & Management, Tampa, FL 33620 USA
[2] Hefei Univ Technol, Sch Management, Anhua 230009, Peoples R China
[3] Indiana Univ, Dept Operat & Decis Technol, Bloomington, IN 47405 USA
[4] Univ Arizona, Dept Management Informat Syst, Tucson, AZ 85721 USA
基金
美国国家科学基金会;
关键词
Cybersecurity analytics; dark web; automated hacker asset detection; cross-lingual knowledge transfer; adversarial learning; computational design science;
D O I
10.25300/MISQ/2022/16618
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
International dark web platforms operating within multiple geopolitical regions and languages host a myriad of hacker assets such as malware, hacking tools, hacking tutorials, and malicious source code. Cybersecurity analytics organizations employ machine learning models trained on human-labeled data to automatically detect these assets and bolster their situational awareness. However, the lack of human-labeled training data is prohibitive when analyzing foreign-language dark web content. In this research note, we adopt the computational design science paradigm to develop a novel IT artifact for cross-lingual hacker asset detection (CLHAD). CLHAD automatically leverages the knowledge learned from English content to detect hacker assets in non-English dark web platforms. CLHAD encompasses a novel Adversarial deep representation learning (ADREL) method, which generates multilingual text representations using generative adversarial networks (GANs). Drawing upon the state of the art in cross-lingual knowledge transfer, ADREL is a novel approach to automatically extract transferable text representations and facilitate the analysis of multilingual content. We evaluate CLHAD on Russian, French, and Italian dark web platforms and demonstrate its practical utility in hacker asset profiling, and conduct a proof-of-concept case study. Our analysis suggests that cybersecurity managers may benefit more from focusing on Russian to identify sophisticated hacking assets. In contrast, financial hacker assets are scattered among several dominant dark web languages. Managerial insights for security managers are discussed at operational and strategic levels.
引用
收藏
页码:1209 / 1226
页数:18
相关论文
共 50 条
  • [31] MetaXL: Meta Representation Transformation for Low-resource Cross-lingual Learning
    Xia, Mengzhou
    Zheng, Guoqing
    Mukherjee, Subhabrata
    Shokouhi, Milad
    Neubig, Graham
    Awadallah, Ahmed Hassan
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 499 - 511
  • [32] A Learning to rank framework based on cross-lingual loss function for cross-lingual information retrieval
    Elham Ghanbari
    Azadeh Shakery
    Applied Intelligence, 2022, 52 : 3156 - 3174
  • [33] Cross-Lingual Propagation for Deep Sentiment Analysis
    Dong, Xin
    de Melo, Gerard
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 5771 - 5778
  • [34] ON THE STUDY OF GENERATIVE ADVERSARIAL NETWORKS FOR CROSS-LINGUAL VOICE CONVERSION
    Sisman, Berrak
    Zhang, Mingyang
    Dong, Minghui
    Li, Haizhou
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 144 - 151
  • [35] Adversarial and Sequential Training for Cross-lingual Prosody Transfer TTS
    Kim, Min-Kyung
    Chang, Joon-Hyuk
    INTERSPEECH 2022, 2022, : 4556 - 4560
  • [36] Cross-Lingual Event Detection via Optimized Adversarial Training
    Guzman-Nateras, Luis F.
    Minh Van Nguyen
    Thien Huu Nguyen
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 5588 - 5599
  • [37] Distillation Language Adversarial Network for Cross-lingual Sentiment Analysis
    Wang, Deheng
    Yang, Aimin
    Zhou, Yongmei
    Xie, Fenfang
    Ouyang, Zhouhao
    Peng, Sancheng
    2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 45 - 50
  • [38] Self-Attention with Cross-Lingual Position Representation
    Ding, Liang
    Wang, Longyue
    Tao, Dacheng
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 1679 - 1685
  • [39] Adversarial Attack against Cross-lingual Knowledge Graph Alignment
    Zhang, Zeru
    Zhang, Zijie
    Zhou, Yang
    Wu, Lingfei
    Wu, Sixing
    Han, Xiaoying
    Dou, Dejing
    Che, Tianshi
    Yan, Da
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 5320 - 5337
  • [40] Reliability of electric vehicle charging infrastructure: A cross-lingual deep learning approach
    Liu, Yifan
    Francis, Azell
    Hollauer, Catharina
    Lawson, M. Cade
    Shaikh, Omar
    Cotsman, Ashley
    Bhardwaj, Khushi
    Banboukian, Aline
    Li, Mimi
    Webb, Anne
    Asensio, Omar Isaac
    COMMUNICATIONS IN TRANSPORTATION RESEARCH, 2023, 3