CRATOR a CRAwler for TOR: Turning Dark Web Pages into Open Source INTelligence

被引:0
|
作者
De Pascale, Daniel [1 ]
Cascavilla, Giuseppe [2 ]
Tamburri, Damian A. [2 ]
Van Den Heuvel, Willem Jan [1 ]
机构
[1] Tilburg Univ, JADS, sHertogenbosch, Netherlands
[2] Eindhoven Univ Technol, JADS, Eindhoven, Netherlands
来源
关键词
Law Enforcement Agency; TOR; Dark Web; crawler; Open Source Intelligence;
D O I
10.1007/978-3-031-70890-9_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dark web crawling is a complex process that involves specific methodologies and techniques to navigate the Tor network and extract data from hidden services. This study proposes a dark web crawler designed to extract pages handling security protocols, such as CAPTCHAs, efficiently. Our approach uses a combination of seed URL lists, link analysis, and scanning to discover new content. We also incorporate methods for user-agent rotation and proxy usage to maintain anonymity and avoid detection. We evaluate the effectiveness of our crawler using metrics such as coverage, performance, and robustness. Our results demonstrate that our crawler effectively extracts pages handling security protocols while preserving anonymity and avoiding detection. Our proposed dark web crawler can be used for several applications, including threat intelligence, cybersecurity, and online investigations.
引用
收藏
页码:144 / 161
页数:18
相关论文
共 20 条
  • [1] TorBot: Open Source Intelligence Tool for Dark Web
    Narayanan, P. S.
    Ani, R.
    King, Akeem T. L.
    INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES, ICICCT 2019, 2020, 89 : 187 - 195
  • [2] Web Mining for Open Source Intelligence
    Best, Clive
    PROCEEDINGS OF THE 12TH INTERNATIONAL INFORMATION VISUALISATION, 2008, : 321 - 325
  • [3] Assessing the Health of the Dark Web: An Analysis of Dark Web Open Source Software Projects
    Onyango, Samuel
    Steenvoorden, Emilie
    Scholten, Joram
    Jansen, Slinger
    AGILE PROCESSES IN SOFTWARE ENGINEERING AND EXTREME PROGRAMMING - WORKSHOPS (XP 2021), 2021, 426 : 125 - 134
  • [4] A crawler architecture for harvesting the clear, social, and dark web for IoT-related cyber-threat intelligence
    Koloveas, Paris
    Chantzios, Thanasis
    Tryfonopoulos, Christos
    Skiadopoulos, Spiros
    2019 IEEE WORLD CONGRESS ON SERVICES (IEEE SERVICES 2019), 2019, : 3 - 8
  • [5] Using Open Source Intelligence as a Tool for Reliable Web Searching
    Rai B.K.
    Verma R.
    Tiwari S.
    SN Computer Science, 2021, 2 (5)
  • [6] Social Networks and Web Security: Implications on Open Source Intelligence
    Ansari, Fahad
    Akhlaq, Monis
    Rauf, A.
    2013 2ND NATIONAL CONFERENCE ON INFORMATION ASSURANCE (NCIA), 2013, : 79 - 82
  • [7] Creating database-backed library web pages using open source tools
    Jukes, Eric
    PROGRAM-ELECTRONIC LIBRARY AND INFORMATION SYSTEMS, 2008, 42 (03) : 323 - 326
  • [8] Creating database-backed library web pages: Using open source tools
    Nicol, Erica Carlson
    JOURNAL OF ACADEMIC LIBRARIANSHIP, 2007, 33 (01): : 151 - 151
  • [9] Creating database-backed library web pages: Using open source tools
    Calvert, Philip
    ELECTRONIC LIBRARY, 2007, 25 (04): : 484 - 485
  • [10] Creating database-backed library web pages: Using open source tools
    Tran, Lan Anh
    ONLINE INFORMATION REVIEW, 2008, 32 (03) : 454 - 455