KNN - text summarization

This is the repository for KNN project at VUT FIT.

As of now it consists of couple of Python Notebooks, which are used for data preprocessing and evaluation.

Final Report

The final project report is available here: KNN_report.pdf

Docs

Notes doc: [https://docs.google.com/document/d/1VxYESr0JHd2iB6swp7XlGTmxkmW5QZ8Ofs7JtMrcwYg/edit#heading=h.k8mvfthgb23j]

Checkpoint doc: [https://docs.google.com/document/d/1ENQ43E5LHeky2Nc-gqrUHDR-GbNUQfpfMnq47S-j0jo/edit#heading=h.wgl2s2eniag8]

Development

Using virtual environment is recommended. But its slow afffff in venv. AAAAA

python3.11 -m venv .venv

# activate venv
source .venv/bin/activate

# install requirements
pip install -r requirements.txt

dataset_tool.py

Tool for creating the dataset

Setup

create .env file based on example.env
install requirements pip install -r requirements.txt
extract the ˙sumeczech dataset into ./sumeczech-1.0/

Usage

python dataset_tool.py <input_json_file> <output_json_file>
where <input_json_file> is the file with the data in the .jsonl format of the SumeCzech dataset
and <output_json_file> is the file where the output will be saved in the .jsonl format

Metacentrum

ideálne nastavený login cez SSH klúč
pripojenie s X11 forwarding

ssh -X zefron6.cerit-sc.cz

qsub príkaz

qsub ./Jupyter_Cuda80_KNN.sh

Base Evaluation

used SumeCzech test dataset but only first 150 examples
model: Mistral-7B-Instruct-v0.2 but quantized to 8bit: mistral-7b-instruct-v0.2.Q8_0.gguf

0shot: abstract to headline

Given prompt:

{"role": "user", "content": "Vytvoř stručný český nadpis, který výstižně shrnuje obsah tohoto abstraktu: \n '{abstract}'"},

3shot: abstract to headline

Given prompt:

chat_abstract_to_headline_3shot = [
  {"role": "user", "content": "Jste užitečným pomocníkem, který shrne text v českém jazyce pro různé typy sumarizací.\n {abstract}"},
  {"role": "assistant", "content": {headline}},
  {"role": "user", "content": {abstract}},
  {"role": "assistant", "content": {headline}},
  {"role": "user", "content": {abstract}},
  {"role": "assistant", "content": {headline}},
  {"role": "user", "content": {abstract}},
]

3shot: text to abstract

same prompt as above but abstract is text and headline is abstract

RougeRAW

version of Rouge metric that does not use stemming or stop words removal
used in SumeCzech paper.

!NOTE: using high values from rougeraw metric This refers to the ROUGE-RAW score when the target summary length is set to the 90th percentile of the reference summary lengths.

Mistral-7b-instruct-v0.2:

                  RougeRAW-1      RougeRAW-2      RougeRAW-L
                  P    R    F     P    R    F     P    R    F
0s-abs to hdln   12.9 22.6 16.4  03.1 06.1 04.0  11.3 19.8 14.1
3s-abs to hdln   13.7 23.6 16.6  03.5 06.8 04.4  11.8 21.2 14.7
3s-txt to abs    14.0 19.4 15.9  01.9 02.8 02.2  09.2 13.2 10.6

Llama3-8b-instruct:

                  RougeRAW-1      RougeRAW-2      RougeRAW-L
                  P    R    F     P    R    F     P    R    F
0s-abs to hdln   22.7 18.6 19.6  07.8 06.2 06.7  20.5 17.0 17.7
3s-abs to hdln   22.9 21.8 21.6  07.1 06.6 06.6  20.6 20.0 19.6
3s-txt to abs    14.3 23.7 17.3  02.9 04.6 03.4  09.7 16.3 11.8

Llama3-8b-instruct-finetuned:

                  RougeRAW-1      RougeRAW-2      RougeRAW-L
                  P    R    F     P    R    F     P    R    F
0s-abs to hdln   21.4 18.1 18.8  07.0 06.1 06.3  18.8 16.1 16.6
3s-abs to hdln   22.7 21.4 21.1  07.0 07.1 06.7  19.2 18.5 18.0
3s-txt to abs    14.1 23.8 17.1  02.7 04.6 03.3  09.5 16.4 11.6

csmpt7b:

                  RougeRAW-1      RougeRAW-2      RougeRAW-L
                  P    R    F     P    R    F     P    R    F
0s-abs to hdln   12.9 17.7 13.6  04.2 05.8 04.6  11.9 15.6 12.2

csmpt7b-finetuned:

                  RougeRAW-1      RougeRAW-2      RougeRAW-L
                  P    R    F     P    R    F     P    R    F
0s-abs to hdln   12.1 24.0 15.4  03.9 08.1 05.2  10.8 21.7 13.8

Theirs:

abstract to headline

            RougeRAW-1      RougeRAW-2      RougeRAW-L
Method      P    R    F     P    R    F     P    R    F
first     13.9 23.6 16.5  04.1 07.4 05.0  12.2 20.7 14.5
random    11.0 17.8 12.8  02.6 04.5 03.1  09.6 15.5 11.1
textrank  13.3 22.8 15.9  03.7 06.8 04.6  11.6 19.9 13.8
t2t       20.2 15.9 17.2  06.7 05.1 05.6  18.6 14.7 15.8

text to abstract

            RougeRAW-1      RougeRAW-2      RougeRAW-L
Method      P    R    F     P    R    F     P    R    F
first     13.1 17.9 14.4  01.9 02.8 02.1  08.8 12.0 09.6
random    11.7 15.5 12.7  01.2 01.7 01.3  07.7 10.3 08.4
textrank  11.1 20.8 13.8  01.6 03.1 02.0  07.1 13.4 08.9
t2t       13.2 10.5 11.3  01.2 00.9 01.0  10.2 08.1 08.7

Rouge

Rouge metric from Huggingface library
without a stemmer
showing only F1 scores

Mistral-7b-instruct-v0.2:

                Rouge-1    Rouge-2     Rouge-L    Rouge-Lsum
                  F         F             F           F
0s-abs to hdln   18.36      5.30        15.34       15.35
3s-abs to hdln   17.61      5.39        14.50       14.71
3s-txt to abs    24.53      4.65        13.67       13.87

Llama3-8b-instruct:

                Rouge-1    Rouge-2     Rouge-L    Rouge-Lsum
                  F         F             F           F
0s-abs to hdln   20.78      8.41        18.15       17.99
3s-abs to hdln   23.22      8.94        20.52       20.48
3s-txt to abs    26.34      5.89        14.94       14.98

Llama3-8b-instruct-finetuned:

                Rouge-1    Rouge-2     Rouge-L    Rouge-Lsum
                  F         F             F           F
0s-abs to hdln   19.17      07.83       17.28       17.21
3s-abs to hdln   22.46      08.30       18.85       18.81
3s-txt to abs    25.81      05.75       14.58       14.60

csmpt7b:

                Rouge-1    Rouge-2     Rouge-L    Rouge-Lsum
                  F         F             F           F
0s-abs to hdln   13.49      4.89        11.76       11.75

csmpt7b-finetuned:

                Rouge-1    Rouge-2     Rouge-L    Rouge-Lsum
                  F         F             F           F
0s-abs to hdln   16.15      6.83        13.67       13.73

BertScore

uses model: google-bert/bert-base-multilingual-cased

Mistral-7b-instruct-v0.2:

                    BertScore
                  P      R      F
0s-abs to hdln   0.659   0.692  0.675
3s-abs to hdln   0.657   0.693  0.674
3s-txt to abs    0.659   0.674  0.666

Llama3-8b-instruct:

                    BertScore
                  P      R      F
0s-abs to hdln   0.714   0.694  0.703
3s-abs to hdln   0.715   0.697  0.705
3s-txt to abs    0.665   0.690  0.677

Llama3-8b-instruct-finetuned:

                    BertScore
                  P      R      F
0s-abs to hdln   0.707   0.692  0.699
3s-abs to hdln   0.715   0.698  0.706
3s-txt to abs    0.668   0.690  0.678

csmpt7b:

                    BertScore
                  P      R      F
0s-abs to hdln   0.595  0.596  0.595

csmpt7b-finetuned:

                    BertScore
                  P      R      F
0s-abs to hdln   0.640  0.683  0.661

BLEU

Mistral-7b-instruct-v0.2:

                  BLEU 
0s-abs to hdln    0.011
3s-abs to hdln    0.008
3s-txt to abs     0.010

Llama3-8b-instruct:

                  BLEU
0s-abs to hdln    0.023
3s-abs to hdln    0.025
3s-txt to abs     0.018

Llama3-8b-instruct-finetuned:

                  BLEU
0s-abs to hdln    0.018
3s-abs to hdln    0.025
3s-txt to abs     0.016

csmpt7b:

                  BLEU
0s-abs to hdln    0.015

csmpt7b-finetuned:

                  BLEU
0s-abs to hdln    0.016

METEOR

Mistral-7b-instruct-v0.2:

                METEOR
0s-abs to hdln   0.125
3s-abs to hdln   0.133
3s-txt to abs    0.142

Llama3-8b-instruct:

                METEOR
0s-abs to hdln   0.117
3s-abs to hdln   0.134
3s-txt to abs    0.168

Llama3-8b-instruct-finetuned:

                METEOR
0s-abs to hdln   0.113
3s-abs to hdln   0.134
3s-txt to abs    0.170

csmpt7b:

                METEOR
0s-abs to hdln   0.097

csmpt7b-finetuned:

                METEOR
0s-abs to hdln   0.128

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
.vscode		.vscode
data		data
eval_src		eval_src
results/runs		results/runs
scripts		scripts
tools		tools
translate_src		translate_src
.gitignore		.gitignore
EasyNMT_dataset_translation.ipynb		EasyNMT_dataset_translation.ipynb
KNN_report.pdf		KNN_report.pdf
Llama-3.ipynb		Llama-3.ipynb
README.md		README.md
eval_cs-TinyLlama.ipynb		eval_cs-TinyLlama.ipynb
eval_csmpt7.ipynb		eval_csmpt7.ipynb
eval_llama3.ipynb		eval_llama3.ipynb
example.env		example.env
requirements.txt		requirements.txt
testing-but-models.ipynb		testing-but-models.ipynb
train_CSTinyLlama.ipynb		train_CSTinyLlama.ipynb
train_Llama-3.ipynb		train_Llama-3.ipynb
train_but_csmpt7b.ipynb		train_but_csmpt7b.ipynb
train_but_tiny_llama.ipynb		train_but_tiny_llama.ipynb
train_mistral7b.ipynb		train_mistral7b.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

KNN - text summarization

Final Report

Docs

Development

dataset_tool.py

Setup

Usage

Metacentrum

Base Evaluation

0shot: abstract to headline

3shot: abstract to headline

3shot: text to abstract

RougeRAW

Rouge

BertScore

BLEU

METEOR

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

SyntaxErr0r1/KNN

Folders and files

Latest commit

History

Repository files navigation

KNN - text summarization

Final Report

Docs

Development

dataset_tool.py

Setup

Usage

Metacentrum

Base Evaluation

0shot: abstract to headline

3shot: abstract to headline

3shot: text to abstract

RougeRAW

Rouge

BertScore

BLEU

METEOR

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages