CROSS-LINGUAL CYBERSECURITY ANALYTICS IN THE INTERNATIONAL DARK WEB WITH ADVERSARIAL DEEP REPRESENTATION LEARNING

被引:21
|
作者
Ebrahimi, Mohammadreza [1 ]
Chai, Yidong [2 ]
Samtani, Sagar [3 ]
Chen, Hsinchun [4 ]
机构
[1] Univ S Florida, Sch Informat Syst & Management, Tampa, FL 33620 USA
[2] Hefei Univ Technol, Sch Management, Anhua 230009, Peoples R China
[3] Indiana Univ, Dept Operat & Decis Technol, Bloomington, IN 47405 USA
[4] Univ Arizona, Dept Management Informat Syst, Tucson, AZ 85721 USA
基金
美国国家科学基金会;
关键词
Cybersecurity analytics; dark web; automated hacker asset detection; cross-lingual knowledge transfer; adversarial learning; computational design science;
D O I
10.25300/MISQ/2022/16618
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
International dark web platforms operating within multiple geopolitical regions and languages host a myriad of hacker assets such as malware, hacking tools, hacking tutorials, and malicious source code. Cybersecurity analytics organizations employ machine learning models trained on human-labeled data to automatically detect these assets and bolster their situational awareness. However, the lack of human-labeled training data is prohibitive when analyzing foreign-language dark web content. In this research note, we adopt the computational design science paradigm to develop a novel IT artifact for cross-lingual hacker asset detection (CLHAD). CLHAD automatically leverages the knowledge learned from English content to detect hacker assets in non-English dark web platforms. CLHAD encompasses a novel Adversarial deep representation learning (ADREL) method, which generates multilingual text representations using generative adversarial networks (GANs). Drawing upon the state of the art in cross-lingual knowledge transfer, ADREL is a novel approach to automatically extract transferable text representations and facilitate the analysis of multilingual content. We evaluate CLHAD on Russian, French, and Italian dark web platforms and demonstrate its practical utility in hacker asset profiling, and conduct a proof-of-concept case study. Our analysis suggests that cybersecurity managers may benefit more from focusing on Russian to identify sophisticated hacking assets. In contrast, financial hacker assets are scattered among several dominant dark web languages. Managerial insights for security managers are discussed at operational and strategic levels.
引用
收藏
页码:1209 / 1226
页数:18
相关论文
共 50 条
  • [41] Cross-Lingual Sentiment Analysis in Deep Learning: A Comparative Study of Multilingual Approaches
    Kumar, Rishabh
    Kumar, Rajat
    Singh, Ritik
    Katarya, Rahul
    2023 14th International Conference on Computing Communication and Networking Technologies, ICCCNT 2023, 2023,
  • [42] Cross-lingual analysis of English and Chinese web search
    Lin, Peiguang
    Zhang, Tong
    Xia, Menglong
    Zhou, Jin
    Nie, Peiyao
    INTERNATIONAL JOURNAL OF WEB AND GRID SERVICES, 2018, 14 (04) : 376 - 399
  • [43] Document Similarity for Arabic and Cross-Lingual Web Content
    Salhi, Ali
    Yahya, Adnan H.
    ARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, 2018, 782 : 134 - 146
  • [44] Enhancing Cross-lingual Natural Language Inference by Prompt-learning from Cross-lingual Templates
    Qi, Kunxun
    Wan, Hai
    Du, Jianfeng
    Chen, Haolan
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 1910 - 1923
  • [45] Cross-lingual deep learning model for gender-based emotion detection
    Bhattacharya, Sudipta
    Mishra, Brojo Kishore
    Borah, Samarjeet
    Das, Nabanita
    Dey, Nilanjan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (09) : 25969 - 26007
  • [46] A cross-lingual framework for web news taxonomy integration
    Yang, Cheng-Zen
    Chen, Che-Min
    Chen, Ing-Xiang
    INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2006, 4182 : 270 - +
  • [47] Cross-lingual deep learning model for gender-based emotion detection
    Sudipta Bhattacharya
    Brojo Kishore Mishra
    Samarjeet Borah
    Nabanita Das
    Nilanjan Dey
    Multimedia Tools and Applications, 2024, 83 (9) : 25969 - 26007
  • [48] Joint Representation Learning of Cross-lingual Words and Entities via Attentive Distant Supervision
    Cao, Yixin
    Hou, Lei
    Li, Juanzi
    Liu, Zhiyuan
    Li, Chengjiang
    Chen, Xu
    Dong, Tiansi
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 227 - 237
  • [49] Weakly-Supervised Concept-based Adversarial Learning for Cross-lingual Word Embeddings
    Wang, Haozhou
    Henderson, James
    Merlo, Paola
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 4419 - 4430
  • [50] Personalized Microblog Sentiment Classification via Adversarial Cross-lingual Muti-task Learning
    Wang, Weichao
    Feng, Shi
    Gao, Wei
    Wang, Daling
    Zhang, Yifei
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 338 - 348