Transformer Implementation

This project implements a custom Transformer model from scratch and tests it on various datasets. We also compare its performance against PyTorch's nn.Transformer.

Features

Custom implementation of the Transformer architecture.
Positional encoding, multi-head attention, and feedforward layers implemented manually.
Training and evaluation on multiple datasets:
- WikiText-2: A dataset for language modeling tasks.
- Multi30k (EN-DE): English to German translation.
- Multi30k (EN-FR): English to French translation.
BLEU score and loss metrics for performance comparison.

Comparisons

We evaluate our implementation against PyTorch's nn.Transformer. The comparison includes:

Training time.
Model accuracy (BLEU scores).
Convergence behavior.

Datasets

WikiText-2:
- Used for language modeling.
- Tokenized and preprocessed using basic_english tokenizer.
Multi30k:
- Two settings: English-to-German (EN-DE) and English-to-French (EN-FR).
- Preprocessed using spacy tokenizers for English, German, and French.

Results

Results are evaluated based on BLEU scores and loss. Detailed analysis can be found in the logs and plots generated during training.

Usage

Clone the repository:

git clone https://github.com/username/transformer-project.git

Install dependencies: pip install -r requirements.txt
Run the training script: python train.py

Author:

[Jonathan Wang, Conny Zhou]

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.idea		.idea
Transformer		Transformer
__pycache__		__pycache__
multi30k_data		multi30k_data
utils		utils
.DS_Store		.DS_Store
README.md		README.md
data.py		data.py
download.py		download.py
requirements.txt		requirements.txt
src_vocab.json		src_vocab.json
test.py		test.py
train.py		train.py
trg_vocab.json		trg_vocab.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Transformer Implementation

Features

Comparisons

Datasets

Results

Usage

Author:

About

Uh oh!

Releases

Packages

Uh oh!

Languages

JonathanWry/transformerImpl

Folders and files

Latest commit

History

Repository files navigation

Transformer Implementation

Features

Comparisons

Datasets

Results

Usage

Author:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages