Dark Web Text Classification by Learning through SVM Optimization

被引:4
|
作者
Murty, Ch A. S. [1 ]
Rughani, Parag H. [2 ]
机构
[1] Ctr Dev Adv Comp C DAC, Hyderabad, India
[2] Natl Forens Sci Univ, Digital Forens, Gandhinagar, Gujarat, India
关键词
Darkweb; SVM; classification; Darkweb content classification;
D O I
10.12720/jait.13.6.624-631
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Darkweb has become the largest repository of unauthorized information compared to the surface web because of its benefit of anonymity and privacy. With these anonymity and privacy features, the dark web is also becoming a safe place for illegal activities and hence an increase of dark web usage and size of the onion-based URLs. With the increasing use of dark web users, it is the need for cybercrime investigators across the globe to classify dark web data for understanding various illegal activities to control and categorize URLs hosting such illicit activities with feature engineering. In this research, the Support Vector Machines (SVM) algorithm is used to understand the algorithm's efficiency for a proposed model to classify dark web data with optimization techniques. Text-based keywords from more than 1800 websites were collected by applying feature engineering techniques and the system's performance was evaluated with the SVM approach. The results are very encouraging as the Precision, Recall, and F-measure values are 0.83, 0.90 & 0.96 achieved with a dataset of 1800 URLs.
引用
收藏
页码:624 / 631
页数:8
相关论文
共 50 条
  • [1] Research on Web Text Classification Algorithm Based on Improved CNN and SVM
    Wang, Zhiquan
    Qu, Zhiyi
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT 2017), 2017, : 1958 - 1961
  • [2] An improved web text classification algorithm based on SVM-KNN
    Cao, Jianfang
    Chen, Junjie
    ADVANCES IN MECHATRONICS AND CONTROL ENGINEERING, PTS 1-3, 2013, 278-280 : 1305 - 1308
  • [3] Improving SVM text classification performance through threshold adjustment
    Shanahan, JG
    Roma, N
    MACHINE LEARNING: ECML 2003, 2003, 2837 : 361 - 372
  • [4] A Novel Active Learning Method Using SVM for Text Classification
    Mohamed Goudjil
    Mouloud Koudil
    Mouldi Bedda
    Noureddine Ghoggali
    Machine Intelligence Research, 2018, (03) : 290 - 298
  • [5] A Novel Active Learning Method Using SVM for Text Classification
    Goudjil M.
    Koudil M.
    Bedda M.
    Ghoggali N.
    International Journal of Automation and Computing, 2018, 15 (03) : 290 - 298
  • [6] Web Services Ontology Population through Text Classification
    Reyes-Ortiz, Jose A.
    Bravo, Maricela
    Pablo, Hugo
    PROCEEDINGS OF THE 2016 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2016, 8 : 491 - 495
  • [7] Curriculum learning and evolutionary optimization into deep learning for text classification
    Alfredo Arturo Elías-Miranda
    Daniel Vallejo-Aldana
    Fernando Sánchez-Vega
    A. Pastor López-Monroy
    Alejandro Rosales-Pérez
    Victor Muñiz-Sanchez
    Neural Computing and Applications, 2023, 35 : 21129 - 21164
  • [8] Curriculum learning and evolutionary optimization into deep learning for text classification
    Elias-Miranda, Alfredo Arturo
    Vallejo-Aldana, Daniel
    Sanchez-Vega, Fernando
    Lopez-Monroy, A. Pastor
    Rosales-Perez, Alejandro
    Muniz-Sanchez, Victor
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (28): : 21129 - 21164
  • [9] Learning Semantic Text Features for Web Text-Aided Image Classification
    Wang, Dongzhe
    Mao, Kezhi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (12) : 2985 - 2996
  • [10] Research on Text Classification Method of LDA-SVM Based on PSO Optimization
    Wang, Qing
    Peng, Rongqun
    Wang, Jiaqiang
    Xie, Yushu
    Zhou, Yangfan
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 1974 - 1978