Fake News Detection Using Machine Learning

Project Overview

This project implements a fake news detection system using machine learning algorithms, specifically Naive Bayes and Support Vector Machine (SVM) classifiers. The system processes text data and classifies news articles as either "FAKE" or "REAL".

Project Structure

The project is organized into three main Python files, each handling different aspects of the data lifecycle:

1. Data Preprocessing (`index.py`)

class DataPreprocessor:
    def __init__(self, data_path):
        self.data_path = data_path
        # ...

Key functionalities:

Data loading and cleaning
Feature preparation
Text processing pipeline setup
Train-test split management

2. Naive Bayes Classifier (`naive_bayes_model.py`)

class NaiveBayesClassifier:
    def __init__(self):
        self.pipeline = Pipeline([
            ('vect', CountVectorizer()),
            ('tfidf', TfidfTransformer()),
            ('clf', MultinomialNB())
        ])

Features:

Text vectorization using CountVectorizer
TF-IDF transformation
Model training and evaluation
Single text prediction capability

3. SVM Classifier (`svm_model.py`)

class SVMClassifier:
    def __init__(self, kernel='linear'):
        self.pipeline = Pipeline([
            ('vect', CountVectorizer()),
            ('tfidf', TfidfTransformer()),
            ('clf', SVC(kernel=kernel))
        ])

Features:

Linear kernel SVM implementation
Text preprocessing pipeline
Model evaluation metrics
Prediction functionality

Performance Metrics

Both models are evaluated using:

Accuracy Score
Confusion Matrix
Classification Report
Training and Prediction Time

Sample Results

Naive Bayes Performance:

Accuracy: ~95.58%
Training Time: ~0.03 seconds
Prediction Time: ~0.015 seconds

SVM Performance:

Accuracy: ~93.05%
Training Time: ~51.82 seconds
Prediction Time: ~16.56 seconds

Usage Guide

1. Data Preprocessing

from index import DataPreprocessor

# Initialize preprocessor
preprocessor = DataPreprocessor('path_to_your_dataset.csv')
df = preprocessor.load_data()
X_train, X_test, y_train, y_test = preprocessor.prepare_features()

2. Training Naive Bayes Model

from naive_bayes_model import NaiveBayesClassifier

nb_classifier = NaiveBayesClassifier()
nb_classifier.train(X_train, y_train)
nb_classifier.evaluate(X_test, y_test)

3. Training SVM Model

from svm_model import SVMClassifier

svm_classifier = SVMClassifier()
svm_classifier.train(X_train, y_train)
svm_classifier.evaluate(X_test, y_test)

4. Making Predictions

# For single text prediction
text = "Your news article text here"
prediction = classifier.predict_single(text)

Key Features

Modular and maintainable code structure
Comprehensive evaluation metrics
Easy-to-use interface
Support for both batch and single-text predictions
Performance timing measurements

Dependencies

pandas
scikit-learn
numpy
time

Future Improvements

Add support for more classification algorithms
Implement cross-validation
Add feature importance analysis
Implement model persistence
Add data visualization components

For more detailed information and presentation materials, please refer to the presentation: [Fake News Detection Using ML Presentation]

Project Conclusions

Both models show strong performance in detecting fake news
Naive Bayes offers faster training and prediction times
SVM provides slightly better precision but with longer processing times
The system demonstrates practical applicability for real-world news classification

Note: This project is for educational purposes and should be used as part of a broader fact-checking strategy.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
FAKE NEWS DETECTION USING DATA MINING Modified (2).pptx		FAKE NEWS DETECTION USING DATA MINING Modified (2).pptx
LICENSE		LICENSE
Readme.md		Readme.md
index.py		index.py
naive_bayes_model.py		naive_bayes_model.py
svm_model.py		svm_model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Fake News Detection Using Machine Learning

Project Overview

Project Structure

1. Data Preprocessing (`index.py`)

2. Naive Bayes Classifier (`naive_bayes_model.py`)

3. SVM Classifier (`svm_model.py`)

Performance Metrics

Sample Results

Naive Bayes Performance:

SVM Performance:

Usage Guide

1. Data Preprocessing

2. Training Naive Bayes Model

3. Training SVM Model

4. Making Predictions

Key Features

Dependencies

Future Improvements

Project Conclusions

About

Uh oh!

Releases

Packages

Languages

License

yellatp/Fake-News-Classifier

Folders and files

Latest commit

History

Repository files navigation

Fake News Detection Using Machine Learning

Project Overview

Project Structure

1. Data Preprocessing (index.py)

2. Naive Bayes Classifier (naive_bayes_model.py)

3. SVM Classifier (svm_model.py)

Performance Metrics

Sample Results

Naive Bayes Performance:

SVM Performance:

Usage Guide

1. Data Preprocessing

2. Training Naive Bayes Model

3. Training SVM Model

4. Making Predictions

Key Features

Dependencies

Future Improvements

Project Conclusions

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

1. Data Preprocessing (`index.py`)

2. Naive Bayes Classifier (`naive_bayes_model.py`)

3. SVM Classifier (`svm_model.py`)

Packages