LiTEx

Dataset and Implementation of the paper "LiTEx: A Linguistic Taxonomy of Explanations for Understanding Within-Label Variation in Natural Language Inference"

Getting Started

Setting up the code environment

pip install -r requirements.txt

Download spaCy language models

The code requires English and German spaCy models for tokenization and linguistic analysis. Run the following commands:

python -m spacy download en_core_web_sm
python -m spacy download de_core_news_md

Content

data

├── esnli
│   ├── annotation_esnli.jsonl (annotation results for e-SNLI)
│   ├── subset_explcat1.jsonl (subset: one explanation category per item)
│   ├── subset_explcat2.jsonl (subset: two explanation category per item)
│   ├── subset_explcat3.jsonl (subset: three and more explanation category per item)
├── livenli
│   ├── annotation_livenli.jsonl (annotation results for LiveNLI)
├── varierr
│   ├── annotation_varierr.jsonl (annotation results for VariErr)

annotation_interface

├── explanation_annotation.py
├── explanation_annotation_highlight.py
├── human_validation_annotation.py

The annotation interfaces are designed using Streamlit (https://github.com/streamlit/streamlit)

To run the interactive web app, please follow the Quickstart instruction provided on Streamlit Guide (https://github.com/streamlit/streamlit).

Then, open a terminal and run:

$ streamlit run explanation_annotation.py
$ streamlit run explanation_annotation_highlight.py
$ streamlit run human_validation_annotation.py

iaa

├── classification_iaa
│   ├── annotator0_iaa.jsonl # classification annotation of annotator 0        
│   ├── annotator1_iaa.jsonl # classification annotation of annotator 1        
├── highlight_iaa
│   ├── annotator0_iaa_highlight.jsonl # highlight annotation of annotator 0
│   ├── annotator1_iaa_highlight.jsonl #highlight annotation of annotator 1
├── iaa.ipynb # notebook for iaa analysis

classification

├── bert.ipynb # notebook for fine-tuning BERT
├── roberta.ipynb # notebook for fine-tuning RoBERTa
├── llm_explanation_classifier.py # script for LLM-based explanation classification
├── classification_results
│   ├── deepseek
│   │   ├── predictions_deepseek-v3_baseline.jsonl 
│   │   ├── predictions_deepseek-v3_instruction.jsonl
│   │   ├── predictions_deepseek-v3_one_example.jsonl
│   │   ├── predictions_deepseek-v3_one_example_instruction.jsonl
│   │   ├── predictions_deepseek-v3_two_example.jsonl
│   │   ├── predictions_deepseek-v3_two_example_instruction.jsonl
│   ├── gpt3.5
│   │   ├── predictions_gpt3.5_baseline.jsonl 
│   │   ├── predictions_gpt3.5_instruction.jsonl
│   │   ├── predictions_gpt3.5_one_example.jsonl
│   │   ├── predictions_gpt3.5_one_example_instruction.jsonl
│   │   ├── predictions_gpt3.5_two_example.jsonl
│   │   ├── predictions_gpt3.5_two_example_instruction.jsonl
│   ├── gpt4o
│   │   ├── predictions_gpt4o_baseline.jsonl 
│   │   ├── predictions_gpt4o_instruction.jsonl
│   │   ├── predictions_gpt4o_one_example.jsonl
│   │   ├── predictions_gpt4o_one_example_instruction.jsonl
│   │   ├── predictions_gpt4o_two_example.jsonl
│   │   ├── predictions_gpt4o_two_example_instruction.jsonl
│   ├── llama
│   │   ├── predictions_llama_baseline.jsonl 
│   │   ├── predictions_llama_instruction.jsonl
│   │   ├── predictions_llama_one_example.jsonl
│   │   ├── predictions_llama_one_example_instruction.jsonl
│   │   ├── predictions_llama_two_example.jsonl
│   │   ├── predictions_llama_two_example_instruction.jsonl
├── README.md # instructions for running llm_explanation_classifier.py

generation

├── llm_explanation
│   ├── Deepseek # generated explanations using ``deepseek-chat``
│   │   ├── deepseek_classify_and_generate.zip
│   │   ├── deepseek_highlight_index.zip
│   │   ├── deepseek_highlight_marked.zip
│   │   ├── deepseek_label.zip
│   │   ├── deepseek_taxonomy_filtered.zip
│   ├── GPT4o # generated explanations using ``gpt4o``
│   │   ├── gpt4o_classify_and_generate.zip
│   │   ├── gpt4o_highlight_index.zip
│   │   ├── gpt4o_highlight_marked.zip
│   │   ├── gpt4o_label.zip
│   │   ├── gpt4o_taxonomy_filtered.zip
│   ├── Llama # generated explanations using ``Llama-3.2-3B-Instruct``
│   │   ├── llama_classify_and_generate.zip
│   │   ├── llama_highlight_index.zip
│   │   ├── llama_highlight_marked.zip
│   │   ├── llama_label.zip
│   │   ├── llama_taxonomy_filtered.zip 
├── model_generator.py
├── README.md # instructions for running model_generator.py

human_validation

├── annotated_validation.jsonl # results of human validation of model-generated explanations

similarity_analysis

├── similarity_analysis.py # script for calculation similarity
├── similarity_score
│   ├── livenli_similarity_per_instance_results.jsonl
│   ├── varierr_similarity_per_instance_results.jsonl
├── README.md # instructions for running similarity_analysis.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LiTEx

Getting Started

Setting up the code environment

Download spaCy language models

Content

data

annotation_interface

iaa

classification

generation

human_validation

similarity_analysis

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
annotation_interface		annotation_interface
classification		classification
data		data
generation		generation
human_validation		human_validation
iaa		iaa
similarity_analysis		similarity_analysis
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md
explanation_annotation.py		explanation_annotation.py
requirements.txt		requirements.txt
within_label_variation.pdf		within_label_variation.pdf

License

mainlp/LiTEx

Folders and files

Latest commit

History

Repository files navigation

LiTEx

Getting Started

Setting up the code environment

Download spaCy language models

Content

data

annotation_interface

iaa

classification

generation

human_validation

similarity_analysis

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages