Identifying Persons of Interest in Digital Forensics Using NLP-Based AI

被引:0
|
作者
Adkins, Jonathan [1 ,2 ]
Al Bataineh, Ali [2 ,3 ]
Khalaf, Majd [2 ,3 ]
机构
[1] Norwich Univ, Senator Patrick Leahy Sch Cybersecur & Adv Comp, Northfield, VT 05663 USA
[2] Norwich Univ, Artificial Intelligence Ctr, Northfield, VT 05663 USA
[3] Norwich Univ, Dept Elect & Comp Engn, Northfield, VT 05663 USA
关键词
Artificial Intelligence (AI); criminology; digital forensics; Natural Language Processing (NLP); sentiment analysis; topic modeling; word vector cosine distance;
D O I
10.3390/fi16110426
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The field of digital forensics relies on expertise from multiple domains, including computer science, criminology, and law. It also relies on different toolsets and an analyst's expertise to parse enormous amounts of user-generated data to find clues that help crack a case. This process of investigative analysis is often done manually. Artificial Intelligence (AI) can provide practical solutions to efficiently mine enormous amounts of data to find useful patterns that can be leveraged to investigate crimes. Natural Language Processing (NLP) is a subdomain of research under AI that deals with problems involving unstructured data, specifically language. The domain of NLP includes several tools to parse text, including topic modeling, pairwise correlation, word vector cosine distance measurement, and sentiment analysis. In this research, we propose a digital forensic investigative technique that uses an ensemble of NLP tools to identify a person of interest list based on a corpus of text. Our proposed method serves as a type of human feature reduction, where a total pool of suspects is filtered down to a short list of candidates who possess a higher correlation with the crime being investigated.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Test case information extraction from requirements specifications using NLP-based unified boilerplate approach
    Lim, Jin Wei
    Chiew, Thiam Kian
    Su, Moon Ting
    Ong, Simying
    Subramaniam, Hema
    Mustafa, Mumtaz Begum
    Chiam, Yin Kia
    JOURNAL OF SYSTEMS AND SOFTWARE, 2024, 211
  • [22] Calculation of embodied GHG emissions in early building design stages using BIM and NLP-based semantic model healing
    Forth, Kasimir
    Abualdenien, Jimmy
    Borrmann, Andre
    ENERGY AND BUILDINGS, 2023, 284
  • [23] Discovering Message Templates on Large Scale Bitcoin Abuse Reports Using a Two-Fold NLP-Based Clustering Method
    Choi, Jinho
    Lee, Taehwa
    Kim, Kwanwoo
    Seo, Minjae
    Cui, Jian
    Shin, Seungwon
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (04) : 824 - 827
  • [24] An AI-based Security System using Computer Vision and NLP Conversion System
    Karim, Md Rajaul
    Chowdhury, Punam
    Rahman, Latifur
    Kazary, Sumaya
    2021 3RD INTERNATIONAL CONFERENCE ON SUSTAINABLE TECHNOLOGIES FOR INDUSTRY 4.0 (STI), 2021,
  • [25] Eliciting file relationships using metadata based associations for digital forensics
    Sriram Raghavan
    S. V. Raghavan
    CSI Transactions on ICT, 2014, 2 (1) : 49 - 64
  • [26] A blockchain based private framework for facilitating digital forensics using IoT
    Suri, Bhawna
    Taneja, Shweta
    Sharma, Siddharth
    Verma, Vishwajeet
    Parashar, Divyanshi
    Sikka, Parth
    Arora, Monika
    Ahmad, Sayed Sayeed
    JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2023, 26 (05): : 1249 - 1263
  • [27] NLP-based clinical text classification and sentiment analyses of complex medical transcripts using transformer model and machine learning classifiers
    Pratiyush Guleria
    Neural Computing and Applications, 2025, 37 (1) : 341 - 366
  • [28] The Impact of Subordination Type and Finiteness on Second Language Development in Timed Impromptu Writing: An NLP-Based Analysis Using the Subordination Sophistication Analyzer
    Hwang, Haerim
    WRITTEN COMMUNICATION, 2025, 42 (01) : 193 - 222
  • [29] Identifying Tourist Places of Interest Based on Digital Imprints: Towards a Sustainable Smart City
    Encalada, Luis
    Boavida-Portugal, Ines
    Ferreira, Carlos Cardoso
    Rocha, Jorge
    SUSTAINABILITY, 2017, 9 (12)
  • [30] An Agile Digital Platform to Support Population Health-A Case Study of a Digital Platform to Support Patients with Delirium Using IoT, NLP, and AI
    Tanniru, Mohan R.
    Agarwal, Nimit
    Sokan, Amanda
    Hariri, Salim
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (11)