JuniorGPT: A GPT Implementation for Shakespeare Dataset

JuniorGPT is a light implementation of the GPT (Generative Pre-trained Transformer) model focused on generating text in the style of Shakespeare. This repository contains code to train a model using a subset of Shakespeare's works and subsequently generate text resembling the bard's style.

Project Overview

JuniorGPT is designed to:

Load Shakespeare text data.
Tokenize the data and create a vocabulary.
Define and initialize a GPT-like architecture.
Train the model.
Generate Shakespearean-style text.

System Requirements

Python 3.x
PyTorch (latest version)
CUDA (if using GPU for training)

Setup and Run

Place your Shakespeare dataset named input.txt in the root directory.
Run the script. The script will train the model and generate text samples.
Find the generated Shakespearean-style text in the output.txt file.

Code Structure

Initial Setup: Hyperparameters, GPU check, manual seed, and other configuration details.
Data Preparation:
- Loading the Shakespeare dataset.
- Tokenizing the text and creating a vocabulary for unique characters.
- Splitting the data into training and validation sets.
Model Architecture:
- Definition of various sub-modules (like MultiHeadAttention, FeedFoward, and Block) which will be used in the main GPT model.
- Main GPT model (called GPTLanguageModel) defined using the sub-modules.
Training Loop:
- The model is trained using the AdamW optimizer.
- Loss is estimated every eval_interval steps.
Text Generation:
- A context tensor is initialized with zeros.
- The model generates text in the style of Shakespeare. This text is saved to output.txt.

Output

At the end of the training, you will see a sample output printed in the console. Additionally, the generated text will be saved in output.txt in the root directory.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
LICENSE		LICENSE
README.md		README.md
gpt.py		gpt.py
input.txt		input.txt
output.txt		output.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

JuniorGPT: A GPT Implementation for Shakespeare Dataset

Project Overview

System Requirements

Setup and Run

Code Structure

Output

About

Uh oh!

Releases

Packages

Languages

License

AlikhanIllini/JuniorGPT

Folders and files

Latest commit

History

Repository files navigation

JuniorGPT: A GPT Implementation for Shakespeare Dataset

Project Overview

System Requirements

Setup and Run

Code Structure

Output

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages