Grammatical Error Correction Project

This project focuses on analyzing and correcting grammatical errors in text using various Natural Language Processing (NLP) techniques and machine learning models.

Project Overview

The project involves the following steps:

Data Loading and Preprocessing: Loading a dataset of ungrammatical and corrected sentences, cleaning the data, and preparing it for analysis and model training.
Exploratory Data Analysis (EDA): Performing EDA to understand the characteristics of the data, including error type frequencies, sentence length distributions, and common grammatical patterns.
Model Training and Evaluation: Training different grammatical error correction (GEC) models, such as T5-based models, and evaluating their performance using metrics like BLEU.
Error Analysis and Visualization: Analyzing common error patterns and visualizing them using techniques like word clouds and frequency distributions.
Grammar Correction with Happy Transformer: Using the Happy Transformer library to apply the trained model for correcting grammar in new text.

Dataset

The project uses the "Grammar Correction.csv" dataset, which contains pairs of ungrammatical and corrected sentences.

Libraries Used

The following libraries are used in this project:

pandas
nltk
matplotlib
seaborn
textstat
Levenshtein
textblob
wordcloud
transformers
happytransformer
optuna
evaluate

Usage

Clone the repository: git clone <repository_url>
Install the required libraries: pip install -r requirements.txt
Run the Jupyter Notebook: jupyter notebook Grammatical_Error_Correction.ipynb

Results

The project achieves promising results in grammatical error correction, with the best-performing model achieving a high BLEU score on the test set.

Future Work

Explore more advanced GEC models and techniques.
Fine-tune models on larger and more diverse datasets.
Develop a user-friendly interface for grammar correction.

Contributing

Contributions are welcome! Please feel free to open issues or pull requests.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
LANG8		LANG8
support		support
CS499B_Grammar_Correction.ipynb		CS499B_Grammar_Correction.ipynb
GCCC_New.ipynb		GCCC_New.ipynb
Grammar Correction.csv		Grammar Correction.csv
Lava1.ipynb		Lava1.ipynb
README.md		README.md
Redefined.ipynb		Redefined.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Grammatical Error Correction Project

Project Overview

Dataset

Libraries Used

Usage

Results

Future Work

Contributing

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

mumin-ahmod/499-Grammar-Error-Detection

Folders and files

Latest commit

History

Repository files navigation

Grammatical Error Correction Project

Project Overview

Dataset

Libraries Used

Usage

Results

Future Work

Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages