BERT-Based AI Content Classification

This project leverages a BERT-based deep learning model to classify text articles as either AI-generated or human-written. Using PyTorch and the Hugging Face transformers library, the project implements fine-tuning of a pre-trained BERT model for binary classification.

Overview

The primary goal of this project is to classify text data into two categories:

AI-generated
Human-written

The workflow includes:

Tokenizing text data using a BERT tokenizer.
Defining a PyTorch dataset and data loader for text and labels.
Building and training a custom BERT-based classifier.
Evaluating the model using stratified cross-validation.
Saving the trained model for deployment or further analysis.

Features

Pre-trained BERT Model: Fine-tunes bert-base-uncased for text classification.
Custom Dataset Class: Implements a PyTorch-compatible dataset class for efficient data handling.
Cross-Validation: Uses Stratified K-Fold cross-validation to ensure robust evaluation.
Evaluation Metrics: Calculates accuracy, F1 score, precision, and recall.

Requirements

Python 3.7+
Libraries:
- torch
- transformers
- pandas
- numpy
- sklearn

Future Improvements

Experiment with advanced pre-trained models like RoBERTa or DeBERTa.
Handle class imbalance with techniques like oversampling or weighted loss functions.
Extend the model to support multiclass classification for other types of text.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
backend		backend
client		client
frontend		frontend
server		server
AIML Challenge.ipynb		AIML Challenge.ipynb
README.md		README.md
streamlitapp.py		streamlitapp.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

BERT-Based AI Content Classification

Overview

Features

Requirements

Future Improvements

About

Uh oh!

Releases

Packages

Languages

hawkh/ML-Challenge

Folders and files

Latest commit

History

Repository files navigation

BERT-Based AI Content Classification

Overview

Features

Requirements

Future Improvements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages