CRAICLJul 10, 2023

ChatGPT for Digital Forensic Investigation: The Good, The Bad, and The Unknown

arXiv:2307.10195v1116 citationsh-index: 32
Originality Synthesis-oriented
AI Analysis

It evaluates a disruptive tool for digital forensic investigators, but the findings are incremental as they highlight existing risks and suitability limitations.

This paper assesses ChatGPT's impact on digital forensics by testing GPT-4 across use cases like artifact understanding and anomaly detection, concluding it has limited low-risk applications due to issues like data privacy and inaccuracies.

The disruptive application of ChatGPT (GPT-3.5, GPT-4) to a variety of domains has become a topic of much discussion in the scientific community and society at large. Large Language Models (LLMs), e.g., BERT, Bard, Generative Pre-trained Transformers (GPTs), LLaMA, etc., have the ability to take instructions, or prompts, from users and generate answers and solutions based on very large volumes of text-based training data. This paper assesses the impact and potential impact of ChatGPT on the field of digital forensics, specifically looking at its latest pre-trained LLM, GPT-4. A series of experiments are conducted to assess its capability across several digital forensic use cases including artefact understanding, evidence searching, code generation, anomaly detection, incident response, and education. Across these topics, its strengths and risks are outlined and a number of general conclusions are drawn. Overall this paper concludes that while there are some potential low-risk applications of ChatGPT within digital forensics, many are either unsuitable at present, since the evidence would need to be uploaded to the service, or they require sufficient knowledge of the topic being asked of the tool to identify incorrect assumptions, inaccuracies, and mistakes. However, to an appropriately knowledgeable user, it could act as a useful supporting tool in some circumstances.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes