Arabic Question Generation Project

Overview

This project focuses on automatic question generation in Arabic using deep learning and NLP techniques. It leverages datasets such as Arabic-SQuAD, ARCD, MLQA, and TydiQA to train and evaluate models for generating high-quality, answerable questions from Arabic context passages.

Project Structure

Notebooks:
- arabic-question-generation.ipynb: Main notebook for question generation experiments and evaluation.
- arabic-question-generation-preprocessing.ipynb: Data loading, cleaning, and preprocessing steps.
- arabic-question-generation-train and predict.ipynb: Model training and prediction pipeline.
- try with another data/: Additional experiments with ARCD and TydiQA datasets.
Scripts:
- test script.py: Utility functions for text preprocessing and testing.
Results:
- Generated Questions.pdf, Model test result with bert score and answerability.pdf: Output and evaluation reports.

Key Features

Preprocessing and normalization of Arabic text (diacritics removal, punctuation, spacing, Alef variations).
Utilizes transformer models (T5, mT5) for question generation.
Evaluation using BLEU, ROUGE, and BERT-based metrics.
Supports multiple Arabic QA datasets.

How to Run

Install Requirements: Install all dependencies using the provided requirements file:
```
pip install -r requirements.txt
```
(You can still see the first cells in the notebooks for additional details.)
Data Preparation: Place the datasets in the data/ directory as structured above.
Run Notebooks: Follow the order: preprocessing → training/prediction → main experiments.
Testing: Use test script.py for standalone text preprocessing or model testing.

Example Usage

See the notebooks for step-by-step code and explanations. Example context and generated questions are provided in the results files.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Arabic Question Generation Project

Overview

Project Structure

Key Features

How to Run

Example Usage

References

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
Results		Results
try with another data		try with another data
README.md		README.md
arabic-question-generation-preprocessing.ipynb		arabic-question-generation-preprocessing.ipynb
arabic-question-generation-train and predict.ipynb		arabic-question-generation-train and predict.ipynb
arabic-question-generation.ipynb		arabic-question-generation.ipynb
requirements.txt		requirements.txt
test script.py		test script.py

Mohammed2372/Arabic-Question-Generation

Folders and files

Latest commit

History

Repository files navigation

Arabic Question Generation Project

Overview

Project Structure

Key Features

How to Run

Example Usage

References

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages