🛡️ Phishing URL Detection

This project implements a robust machine learning model designed to accurately detect phishing URLs by analyzing a diverse set of URL and website features. The goal is to enhance cybersecurity by identifying potentially malicious URLs before they can cause harm.

📊 Data Description

The input dataset (located in the valid_data/ directory) contains engineered features representing various characteristics of URLs. Each feature is encoded as:

1 → Positive or benign attribute
0 → Neutral or unknown attribute
-1 → Negative or suspicious attribute

These feature vectors are used by the model to classify each URL as:

1 → Phishing
0 → Legitimate

🔁 Project Workflow

Load and preprocess the validated URL feature dataset i.e. valid_data/test.csv
Use a trained machine learning model to assess phishing risk.
Saves predictions to predicted_output/output.csv for review and further analysis.

📈 Insights & Performance Report

✅ Accuracy: ~92%
📌 Precision and Recall: Over 90%
🔍 Key Predictors: SSL certificate status, URL length, domain registration length
📦 Handles a broad spectrum of phishing tactics for generalized robustness

This model serves as an automated solution for phishing URL detection, enabling proactive defense in cybersecurity systems.

🚀 Deployment

The model and application were containerized and deployed using:

Docker: For consistent containerized environment
AWS ECR: To store and manage Docker images
AWS EC2: As the hosting server to run the container in production

🔧 Setup Steps :

📥 Clone the repository :

git clone https://github.com/iam-salma/CrisisAid-news-and-awareness-website.git
cd CrisisAid-news-and-awareness-website

🐍 Make sure you have Python 3 installed. :

Here’s the official link to install Python 3: 🔗 https://www.python.org/downloads/
📦 Create a virtual environment :
```
python -m venv venv
```
⚙️ Activate the virtual environment

On Windows :
```
.\venv\Scripts\activate
```
On macOS/Linux :
```
source venv/bin/activate
```
📌 Install dependencies :
```
pip install -r requirements.txt
```
🗝️ Create .env folder to store secrets :

store your MONGO_DB_URL
🏃To Run the Project:
```
python main.py
```

ENJOY 😊🎉

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.github/workflows		.github/workflows
Artifacts/06_10_2025_18_00_48		Artifacts/06_10_2025_18_00_48
Network_Data		Network_Data
__pycache__		__pycache__
data_schema		data_schema
final_model		final_model
my-notes		my-notes
networksecurity		networksecurity
prediction_output		prediction_output
static		static
templates		templates
valid_data		valid_data
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
app.py		app.py
main.py		main.py
push_data.py		push_data.py
requirements.txt		requirements.txt
setup.py		setup.py
test_mongodb.py		test_mongodb.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🛡️ Phishing URL Detection

📊 Data Description

🔁 Project Workflow

📈 Insights & Performance Report

🚀 Deployment

🔧 Setup Steps :

About

Uh oh!

Releases

Packages

Languages

License

iam-salma/phishing-url-detection-ml

Folders and files

Latest commit

History

Repository files navigation

🛡️ Phishing URL Detection

📊 Data Description

🔁 Project Workflow

📈 Insights & Performance Report

🚀 Deployment

🔧 Setup Steps :

About

Topics

Resources

License

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages