English-Hindi Neural Machine Translation

This repository contains implementations of Sequence-to-Sequence (Seq2Seq) neural models for English → Hindi Neural Machine Translation (NMT), including a baseline architecture and an improved attention-based model.

Project Overview

Dataset: 100k parallel English–Hindi sentences
- 70k for training, 30k for testing
Epochs: 15
Batch Size: 64
Framework: PyTorch
Loss Function: Negative Log Likelihood Loss (NLLLoss)
Optimizer: Adam

Models Implemented

Seq2Seq Architecture Based on Sequence to Sequence Learning with Neural Networks (Sutskever et al.), comprising:
- Encoder (LSTM) → Context Vector → Decoder (LSTM) → Word Predictor.
Seq2Seq with Attention Based on Neural Machine Translation by Jointly Learning to Align and Translate (Bahdanau et al.), which introduces an attention mechanism for better alignment and handling of long sentences.

Key Findings

Baseline Seq2Seq Model: Produces syntactically correct outputs but often repetitive or contextually weak.
Attention-based Model: Better at focusing on important input words, resulting in translations more relevant to the target sentence, especially for longer sequences.
Observation: Attention improves translation quality and contextual accuracy compared to plain Seq2Seq.

Repository Contents

seq2seq.ipynb → Basic Seq2Seq model implementation
seq2seq_attention.ipynb → Seq2Seq model with attention mechanism
Paper2.csv → Sample dataset used for experiments
report → Explaining project with few tested examples and observations

Usage

Open and run notebooks:
```
seq2seq.ipynb
seq2seq_attention.ipynb
```
Adjust dataset path and training hyperparameters as needed.

Technologies & Libraries

Python 3.x
PyTorch
NumPy, Matplotlib
Jupyter Notebook

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Paper2.csv		Paper2.csv
README.md		README.md
report.pdf		report.pdf
seq2seq.ipynb		seq2seq.ipynb
seq2seq_attention.ipynb		seq2seq_attention.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

English-Hindi Neural Machine Translation

Project Overview

Models Implemented

Key Findings

Repository Contents

Usage

Technologies & Libraries

License

About

Uh oh!

Releases

Packages

Languages

vishalpatel72/English-Hindi-NMT-Seq2Seq-Attention

Folders and files

Latest commit

History

Repository files navigation

English-Hindi Neural Machine Translation

Project Overview

Models Implemented

Key Findings

Repository Contents

Usage

Technologies & Libraries

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages