🛡️ PhishGuard – Phishing URL Detection System

PhishGuard is a real-time phishing URL detection tool built for users to quickly and easily check the safety of links. Using a combination of machine learning, feature-based analysis, and blacklist lookups, it flags potentially dangerous URLs and provides users with immediate feedback.

Developed as a final project for CS166 – Information Security at San José State University, Spring 2025.

🚀 Features

Phishing Detection using fine-tuned BERT model
URL Analysis based on structure, SSL status, and domain patterns
Google Safe Browsing API integration for blacklist checking
Scan History with timestamped results saved in local database
Modern Frontend built with React and Tailwind CSS
Real-Time Feedback – get instant results after scanning

🏗️ Project Architecture

Frontend:

Built with React and Tailwind CSS
Users can input URLs or scan QR codes
Displays results in real-time with clear safe/unsafe indicators

Backend:

Developed with Flask
Uses TensorFlow to fine-tune and serve a pre-trained BERT model
Integrates with Google Safe Browsing API
Stores scan history using SQLite3

🧠 Machine Learning Model

Originally planned to use Scikit-learn, but due to time constraints and data limitations, we pivoted to a pre-trained BERT model
Fine-tuned using a phishing URL dataset from Kaggle
Model classifies URLs as phishing or legitimate based on learned patterns

📦 Installation

Requires Python 3.8+, Node.js, and pip

1. Clone the repo

git clone https://github.com/your-username/phishguard.git
cd phishguard

2. Set up the backend

cd backend
pip install -r requirements.txt
python app.py

3. Set up the frontend

cd frontend
npm install
npm start

🧪 Sample Usage

Enter a URL in the input field.
Click "Scan".
Get an instant verdict: Safe ✅ or Unsafe ❌.
Visit the Scan History page to review past results.

📌 Known Issues

Some false positives/negatives due to limited dataset
Google Safe Browsing API occasionally misses newer threats
Currently using local SQLite database – not optimized for large-scale usage

📈 Future Improvements

Create a browser extension for real-time link warnings
Train and evaluate custom Scikit-learn models for comparison
Replace SQLite with PostgreSQL or MongoDB for scalability
Dockerize the backend for easier deployment

👨‍💻 Contributors

Han Ngo – Backend, API Integration, Data Processing
Kundyz Serzhankyzy – Frontend Development, UI Design
Uyen Pham – Machine Learning, Data Analysis, Frontend Integration

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
backend		backend
models/__pycache__		models/__pycache__
src		src
u_backend		u_backend
.env		.env
.gitignore		.gitignore
README.md		README.md
eslint.config.js		eslint.config.js
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
requirements.txt		requirements.txt
tailwind.config.js		tailwind.config.js
test_results.json		test_results.json
tsconfig.app.json		tsconfig.app.json
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🛡️ PhishGuard – Phishing URL Detection System

🚀 Features

🏗️ Project Architecture

🧠 Machine Learning Model

📦 Installation

1. Clone the repo

2. Set up the backend

3. Set up the frontend

🧪 Sample Usage

📌 Known Issues

📈 Future Improvements

👨‍💻 Contributors

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

uynphm/phishing-detection-tool

Folders and files

Latest commit

History

Repository files navigation

🛡️ PhishGuard – Phishing URL Detection System

🚀 Features

🏗️ Project Architecture

🧠 Machine Learning Model

📦 Installation

1. Clone the repo

2. Set up the backend

3. Set up the frontend

🧪 Sample Usage

📌 Known Issues

📈 Future Improvements

👨‍💻 Contributors

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages