🧠 Disaster Tweet Classification – ML & NLP Comparison + Streamlit Deployment

This project classifies tweets as disaster-related or not using both traditional ML models and a fine-tuned DistilBERT transformer model. It was originally developed for the Kaggle NLP Disaster Tweets Competition and includes a deployed Streamlit app for real-time prediction.

🚀 Project Highlights

✅ Built and evaluated 3 models: Naive Bayes, Logistic Regression, and DistilBERT
🧪 Preprocessed tweet data using regex cleaning and keyword merging
📊 Compared performance across models (F1 score was the main metric)
🖥️ Deployed the final classifier using Streamlit for interactive use
🥇 DistilBERT performed best on the validation set and Kaggle leaderboard

📦 Models Compared

Model	Validation F1	Kaggle Score	Notes
Naive Bayes	~0.79	~0.79	Simple, fast baseline using CountVectorizer. Surprisingly well-calibrated (Brier: 0.128) with high AUC (0.90), indicating strong discrimination and decent probability estimates.
Logistic Regression	~0.78	~0.79	Linear baseline with similar classification performance. Slightly lower AUC (0.89) and worse calibration (Brier: 0.141), highlighting that theoretical expectations don’t always match real-world behavior.
DistilBERT	0.79+	0.83	Fine-tuned transformer model that outperforms both baselines in terms of classification (F1, AUC: 0.90) and probability calibration (Brier: 0.108). More robust and generalizable on unseen data.

📝 Calibration Insight:
While Logistic Regression is typically assumed to offer better-calibrated probabilities than Naive Bayes, this dataset showed the opposite: Naive Bayes achieved a lower Brier score (0.128 vs. 0.141), suggesting more reliable probability estimates in this specific case.
DistilBERT, despite being more complex, provided both the best classification performance and the best-calibrated predictions — with the lowest Brier score (0.108). This reinforces the importance of measuring calibration empirically, especially when using model probabilities for downstream decisions.

💡 What can be discussed:

Trade-off between simplicity and performance: NB and LR are lightweight and easy to train; DistilBERT is heavier but significantly better.
Discrimination vs. calibration: AUC shows how well a model separates classes; Brier score shows how accurate its predicted probabilities are.
Model reliability: Even simple models like NB can outperform expectations on calibration — theory should guide, but data should decide.
Real-world readiness: If calibrated probabilities are needed (e.g., for risk thresholds or cost-sensitive applications), DistilBERT is the most reliable; NB may be a viable fallback if resources are constrained.

🧹 Preprocessing

Removed:
- User mentions (@user)
- URLs and HTML tags
- Special characters and extra whitespace
Lowercased text
Merged keyword and text fields during training

🧠 DistilBERT Fine-Tuning

Base model: distilbert-base-uncased
Trained using:
- AdamW optimizer
- Learning rate scheduler
- Early stopping (based on validation F1)
Used Hugging Face transformers + PyTorch
Best validation F1: 0.7929

🌐 Live Demo – Streamlit App

Try out the classifier in your browser:
👉 Disaster Tweet Classifier (Streamlit App)

Choose a model (Naive Bayes or Logistic Regression), enter a tweet, and classify it in real time.

🔧 Models in the App:

✅ Naive Bayes
✅ Logistic Regression
✅ DistilBERT

🧪 Next Steps

🧹 Improve text preprocessing pipeline using nltk or spacy
🔄 Add additional features like location or tweet metadata
🚀 Optimize DistilBERT loading time with quantization or other model compression techniques

📚 Tools & Libraries

Streamlit for UI deployment
scikit-learn for traditional ML models
Hugging Face Transformers for DistilBERT
PyTorch for fine-tuning
Pandas, Joblib, TQDM, Regex for preprocessing and utilities

Important Notice

The code in this repository is proprietary and protected by copyright law. Unauthorized copying, distribution, or use of this code is strictly prohibited. By accessing this repository, you agree to the following terms:

Do Not Copy: You are not permitted to copy any part of this code for any purpose.
Do Not Distribute: You are not permitted to distribute this code, in whole or in part, to any third party.
Do Not Use: You are not permitted to use this code, in whole or in part, for any purpose without explicit permission from the owner.

If you have any questions or require permission, please contact the repository owner.

Thank you for your cooperation.

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
.streamlit		.streamlit
distilbert_model		distilbert_model
logistic_regression_model		logistic_regression_model
naive_bayes_model		naive_bayes_model
notebooks		notebooks
utils		utils
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
streamlit_app.py		streamlit_app.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🧠 Disaster Tweet Classification – ML & NLP Comparison + Streamlit Deployment

🚀 Project Highlights

📦 Models Compared

💡 What can be discussed:

🧹 Preprocessing

🧠 DistilBERT Fine-Tuning

🌐 Live Demo – Streamlit App

🔧 Models in the App:

🧪 Next Steps

📚 Tools & Libraries

Important Notice

About

Uh oh!

Releases

Packages

Languages

kulkarnipreetam/nlp-disaster-tweet-classification

Folders and files

Latest commit

History

Repository files navigation

🧠 Disaster Tweet Classification – ML & NLP Comparison + Streamlit Deployment

🚀 Project Highlights

📦 Models Compared

💡 What can be discussed:

🧹 Preprocessing

🧠 DistilBERT Fine-Tuning

🌐 Live Demo – Streamlit App

🔧 Models in the App:

🧪 Next Steps

📚 Tools & Libraries

Important Notice

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages