CROSS-LINGUAL CYBERSECURITY ANALYTICS IN THE INTERNATIONAL DARK WEB WITH ADVERSARIAL DEEP REPRESENTATION LEARNING

被引:21
|
作者
Ebrahimi, Mohammadreza [1 ]
Chai, Yidong [2 ]
Samtani, Sagar [3 ]
Chen, Hsinchun [4 ]
机构
[1] Univ S Florida, Sch Informat Syst & Management, Tampa, FL 33620 USA
[2] Hefei Univ Technol, Sch Management, Anhua 230009, Peoples R China
[3] Indiana Univ, Dept Operat & Decis Technol, Bloomington, IN 47405 USA
[4] Univ Arizona, Dept Management Informat Syst, Tucson, AZ 85721 USA
基金
美国国家科学基金会;
关键词
Cybersecurity analytics; dark web; automated hacker asset detection; cross-lingual knowledge transfer; adversarial learning; computational design science;
D O I
10.25300/MISQ/2022/16618
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
International dark web platforms operating within multiple geopolitical regions and languages host a myriad of hacker assets such as malware, hacking tools, hacking tutorials, and malicious source code. Cybersecurity analytics organizations employ machine learning models trained on human-labeled data to automatically detect these assets and bolster their situational awareness. However, the lack of human-labeled training data is prohibitive when analyzing foreign-language dark web content. In this research note, we adopt the computational design science paradigm to develop a novel IT artifact for cross-lingual hacker asset detection (CLHAD). CLHAD automatically leverages the knowledge learned from English content to detect hacker assets in non-English dark web platforms. CLHAD encompasses a novel Adversarial deep representation learning (ADREL) method, which generates multilingual text representations using generative adversarial networks (GANs). Drawing upon the state of the art in cross-lingual knowledge transfer, ADREL is a novel approach to automatically extract transferable text representations and facilitate the analysis of multilingual content. We evaluate CLHAD on Russian, French, and Italian dark web platforms and demonstrate its practical utility in hacker asset profiling, and conduct a proof-of-concept case study. Our analysis suggests that cybersecurity managers may benefit more from focusing on Russian to identify sophisticated hacking assets. In contrast, financial hacker assets are scattered among several dominant dark web languages. Managerial insights for security managers are discussed at operational and strategic levels.
引用
收藏
页码:1209 / 1226
页数:18
相关论文
共 50 条
  • [21] Cross-Lingual Entity Linking for Web Tables
    Luo, Xusheng
    Luo, Kangqi
    Chen, Xianyang
    Zhu, Kenny Q.
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 362 - 369
  • [22] Searching the Web for Cross-lingual Parallel Data
    El-Kishky, Ahmed
    Koehn, Philipp
    Schwenk, Holger
    PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 2417 - 2420
  • [23] Cross-Lingual Learning with Distributed Representations
    Pikuliak, Matus
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 8032 - 8033
  • [24] Deep Learning for Mandarin-Tibetan Cross-Lingual Speech Synthesis
    Zhang, Weizhao
    Yang, Hongwu
    Bu, Xiaolong
    Wang, Lili
    IEEE ACCESS, 2019, 7 : 167884 - 167894
  • [25] Deep Multilabel Multilingual Document Learning for Cross-Lingual Document Retrieval
    Feng, Kai
    Huang, Lan
    Xu, Hao
    Wang, Kangping
    Wei, Wei
    Zhang, Rui
    ENTROPY, 2022, 24 (07)
  • [26] A Deep Transfer Learning Method for Cross-Lingual Natural Language Inference
    Bandyopadhyay, Dibyanayan
    De, Arkadipta
    Gain, Baban
    Saikh, Tanik
    Ekbal, Asif
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 3084 - 3092
  • [27] Adversarial Learning with Contextual Embeddings for Zero-resource Cross-lingual Classification and NER
    Keung, Phillip
    Lu, Yichao
    Bhardwaj, Vikas
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 1355 - 1360
  • [28] CISA: Chinese Information Structure Analysis for Scientific Writing with Cross-lingual Adversarial Learning
    Huang, Hen-Hsen
    Chen, Hsin-Hsi
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 5832 - 5834
  • [29] Adversarial Cross-Lingual Transfer Learning for Slot Tagging of Low-Resource Languages
    He, Keqing
    Yan, Yuanmeng
    Xu, Weiran
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [30] A Learning to rank framework based on cross-lingual loss function for cross-lingual information retrieval
    Ghanbari, Elham
    Shakery, Azadeh
    APPLIED INTELLIGENCE, 2022, 52 (03) : 3156 - 3174