From Pre-training to Fine-tuning: an in-depth Analysis of Large

Language Models in the Biomedical Domain

This repository supports the research paper "From Pre-training to Fine-tuning: an in-depth Analysis of Large Language Models in the Biomedical Domain" and contains Python scripts specifically designed for fine-tuning, probing, and comparing attentions of pre-trained language models.

Getting Started

Before executing the scripts, it's crucial to navigate to the correct directory based on the task you wish to perform:

For Natural Language Inference (NLI) tasks, enter the NLI directory.
For Named Entity Recognition (NER) tasks, move to the NER directory.

Each directory contains scripts tailored for these tasks, using the following pre-trained models:

BERT
BioBERT
GPT-2 Medium
BioGPT

Below, you'll find instructions on how to execute each script via the command line, including the necessary arguments for fine-tuning, probing, and attention comparison tasks. Ensure you are in the correct task-specific directory (NLI or NER) before proceeding with the script execution.

1. Fine-Tuning (`fine-tuning.py`)

This script fine-tunes a selected model on a specified dataset.

Usage:

python fine-tuning.py --training_size <PERCENTAGE> --model_name <MODEL>
Arguments:

--training_size: Size of the training set as a percentage. Valid choices are 0, 10, 30, 50, and 100.
--model_name: Name of the model to fine-tune. Valid choices are bert, biobert, gpt2, and biogpt.

2. Probing Task (`probing.py`)

Executes a probing task to analyze the encoding capabilities of pre-trained and fine-tuned models

Usage:

python probing.py --training_size <PERCENTAGE> --model_name <MODEL>
--training_size: Specifies the size of the training set as a percentage. This allows for experimentation with varying amounts of training data to see how it affects model performance. Valid choices are 0, 10, 30, 50, and 100.
--model_name: Determines the model to be used for the probing task. This script supports a range of models tailored for biomedical tasks, including bert, biobert, gpt2, and biogpt, allowing for comparative analysis across different architectures and training regimes.

3. Attention Comparison (`attention.py`)

This script compares the attentions between pre-trained and fine-tuned models, providing insights into how attention mechanisms vary as a result of fine-tuning on specific datasets.

Usage:

python attention.py --training_sizes <PERCENTAGES> --model_name <MODEL>
--training_sizes: Sizes of the training sets as percentages, separated by commas (e.g., 10,30). Please ensure no spaces are included between commas and numbers.
--model_name: Name of the model to compare attentions. Valid choices are bert, biobert, gpt2, and biogpt.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

From Pre-training to Fine-tuning: an in-depth Analysis of Large

Getting Started

1. Fine-Tuning (`fine-tuning.py`)

Usage:

2. Probing Task (`probing.py`)

3. Attention Comparison (`attention.py`)

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
NER		NER
NLI		NLI
README.md		README.md

agnesebonfigli99/LLMs-in-the-Biomedical-Domain

Folders and files

Latest commit

History

Repository files navigation

From Pre-training to Fine-tuning: an in-depth Analysis of Large

Getting Started

1. Fine-Tuning (fine-tuning.py)

Usage:

2. Probing Task (probing.py)

3. Attention Comparison (attention.py)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

1. Fine-Tuning (`fine-tuning.py`)

2. Probing Task (`probing.py`)

3. Attention Comparison (`attention.py`)

Packages