Project Name: Phishing E-mail detection and reply system powered by LLMs

Introduction:

This project features the utilization of advanced deep learning methodologies, specifically Bidirectional Encoder Representations from Transformers (BERT), in conjunction with Mistral 7B Instruct - a sophisticated Large Language Model (LLM), to detect and counter phishing emails. Our work encompasses data preprocessing, feature extraction, and model training, followed by the evaluation of the model's performance in classifying emails accurately. Furthermore, we showcase responses generated based on the provided input and prompt template. The results indicate that the BERT-based model achieved 99.3% accuracy in distinguishing between phishing and non-phishing emails.

Features:

Email classification: We created and trained a BERT model capable of classifying emails with 99.3% accuracy.
Reply generation: We use Mistral 7B Instruct to generate replies based on the provided email.

Prerequisites:

Before you begin, ensure you have met the following requirements:

Python: We recommend Python 3.9 because running this project requires specific versions of libraries, which are not available on the latest version of Python.
Ram: Running both models requires at least 8GB of RAM, we recommend 16.

Installation:

Clone the repository: git clone https://github.com/mrmrjing/PhishingEmailClassification.git
Navigate to the project directory: cd PhishingEmailClassification
Install dependencies: pip install -r requirements.txt
Run the AntiPhishingSystem.ipynb Jupiter notebook

Warning Running the program for the first time will trigger a download of the Mistral 7B model, which takes ~5GB of disk space.

Warning v2 Git clone will try to download email_classification_model.h5 and combined_data.csv, which are big files. It might be required to download them separately.

Usage:

This system takes in a file named email.json and returns reply.json, both files have the same structure. See the example:

{
    "sender": "af25@outlook.com",
    "subject": "Threat detection",
    "body": "Our system indicates that there has been suspicious activity detected on your account and we require your immediate attention to verify your account information to prevent any unauthorized access. Please click on the following link to proceed with the verification process: Failure to verify your account within the next 24 hours may result in temporary suspension or permanent closure of your account."
}

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
.DS_Store		.DS_Store
.gitattributes		.gitattributes
.gitignore		.gitignore
AntiPhishingSystem.drawio		AntiPhishingSystem.drawio
AntiPhishingSystem.ipynb		AntiPhishingSystem.ipynb
EmailClassification.ipynb		EmailClassification.ipynb
Phishing E-mail detection and reply system powered by LLMs.pdf		Phishing E-mail detection and reply system powered by LLMs.pdf
PhishingEmailClassification.pdf		PhishingEmailClassification.pdf
README.md		README.md
ReplyGeneration.ipynb		ReplyGeneration.ipynb
combined_data.csv		combined_data.csv
email.json		email.json
email_classification_model.h5		email_classification_model.h5
reply.json		reply.json
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Project Name: Phishing E-mail detection and reply system powered by LLMs

Introduction:

Features:

Prerequisites:

Installation:

Usage:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

mrmrjing/PhishingEmailClassification

Folders and files

Latest commit

History

Repository files navigation

Project Name: Phishing E-mail detection and reply system powered by LLMs

Introduction:

Features:

Prerequisites:

Installation:

Usage:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages