NLP_Project

The QANTA Diplomacy project aims to develop a model that predicts whether messages exchanged between players in the game Diplomacy are deceptive or truthful. The project uses in-game conversations and associated metadata to make predictions, with evaluation based on how accurately deceptive and truthful messages are identified.

Overview

This repository contains the code for several models developed as part of the QANTA Diplomacy project. Both baseline and novel models are provided, with support for inference through command line scripts and Jupyter Notebook (.ipynb) files.

Execution Instructions

Baseline Models

The baseline models are stored in the python_scripts folder. They include a Bag of Words model and a Context-LSTM model.

Bag of Words Model

Navigate to the python_scripts folder:
```
cd python_scripts
```

Train the model:

python baseline_BOW.py --data_path C:\Git\NLP_Project\NLP_Project\data --save_path models_BOW/ --max_iter 15 --power_threshold 4

Run inference:

python predictions_BOW.py --model_path models_BOW/SENDER_with_power_model.pkl --vectorizer_path models_BOW/SENDER_with_power_vectorizer.pkl --message "I promise I won't attack your territory next turn."

Context-LSTM + Power Model

Navigate to the python_scripts folder:
```
cd python_scripts
```
Train the model:
```
python baseline_context_LSTM.py
```

Run inference:

python predictions_context_LSTM.py --model_path models_lstm/best_model.pt --sample_message "I promise I won't attack your territory next turn." --power_delta 4

Novel Models

The novel models are stored in the folder novel_python_scripts and come in two versions: with and without ConceptNet.

Novel Model without ConceptNet

Navigate to the novel_python_scripts folder:

cd novel_python_scripts/Without_ConceptNet

Train the model:

python train.py --train_path C:\Git\NLP_Project\NLP_Project\data\train.jsonl --val_path C:\Git\NLP_Project\NLP_Project\data\validation.jsonl --test_path C:\Git\NLP_Project\NLP_Project\data\test.jsonl --model_name roberta-base --batch_size 32 --epochs 5 --lr 5e-6 --use_game_scores --oversample_factor 30 --truth_focal_weight 4.0 --gradient_accumulation_steps 2 --output_dir outputs

Run inference:

python inference.py --test_path C:\Git\NLP_Project\NLP_Project\data\test.jsonl --model_path C:\Git\NLP_Project\NLP_Project\novel_models\kaggle\working\best_macro_f1_model.pt --model_name roberta-base --batch_size 32 --use_game_scores --output_file predictions.jsonl

Novel Model with ConceptNet

Navigate to the novel_python_scripts folder:
```
cd novel_python_scripts/With_ConceptNet
```
Set up the required dependencies:
```
python setup.py --setup_all
```

Train the model:

python train.py --train_path C:\Git\NLP_Project\NLP_Project\data\train.jsonl --val_path C:\Git\NLP_Project\NLP_Project\data\validation.jsonl --test_path C:\Git\NLP_Project\NLP_Project\data\test.jsonl --conceptnet_path data/numberbatch-en.txt --model_name roberta-base --batch_size 32 --epochs 5 --lr 5e-6 --use_game_scores --oversample_factor 30 --truth_focal_weight 4.0 --gradient_accumulation_steps 2 --output_dir outputs

Run inference:

python inference.py --test_path C:\Git\NLP_Project\NLP_Project\data\test.jsonl --model_path "C:\Git\NLP_Project\NLP_Project\novel_models(With Conceptnet)\kaggle\working\best_macro_f1_model.pt" --conceptnet_path data/numberbatch-en.txt --model_name roberta-base --batch_size 32 --use_game_scores --output_file predictions.jsonl

Drive Link

You can access the models and additional data via the following Google Drive link:

https://drive.google.com/drive/folders/1zNyJ8Cs1Vzt1ohzljB5PHJHfILcK5Xcz?usp=sharing

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
Data		Data
Previous_Documents		Previous_Documents
novel_models(With Conceptnet)/kaggle/working		novel_models(With Conceptnet)/kaggle/working
novel_models/kaggle/working		novel_models/kaggle/working
novel_python_scripts		novel_python_scripts
python_scripts		python_scripts
1_PPT.pptx		1_PPT.pptx
1_Report.pdf		1_Report.pdf
README.md		README.md
baseline__BOW.ipynb		baseline__BOW.ipynb
baseline__CONTEXT+POWER.ipynb		baseline__CONTEXT+POWER.ipynb
novel_model(Concept_Net).ipynb		novel_model(Concept_Net).ipynb
novel_model.ipynb		novel_model.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

NLP_Project

Table of Contents

Overview

Execution Instructions

Baseline Models

Bag of Words Model

Context-LSTM + Power Model

Novel Models

Novel Model without ConceptNet

Novel Model with ConceptNet

Drive Link

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

adityaaggupta2017/NLP_Project

Folders and files

Latest commit

History

Repository files navigation

NLP_Project

Table of Contents

Overview

Execution Instructions

Baseline Models

Bag of Words Model

Context-LSTM + Power Model

Novel Models

Novel Model without ConceptNet

Novel Model with ConceptNet

Drive Link

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages