Master Thesis Repository - Tomáš Mlynář

This repository contains all code, experiments, and documentation related to the master thesis of Tomáš Mlynář. The project focuses on the adaptation large language models (LLMs), their training, evaluation, and benchmarking, with a particular emphasis on Czech language resources and evaluation frameworks.

🤗 Hugging Face Collection

All published models and datasets are available on Hugging Face Hub

Project Structure

datasets_creation/ # Scripts and notebooks for dataset creation and preprocessing 
evaluation/ # Evaluation scripts, benchmarks, and analysis - notebooks 
scripts/ # Shell scripts for running experiments and evaluations 
training/ # Training scripts and notebooks (pretraining, finetuning, NLI, etc.)

Installation

Clone the repository:

git clone https://gitlab.fel.cvut.cz/factchecking/master-thesis-repository-tomas-mlynar.git
cd master-thesis-repository-tomas-mlynar

(Recommended) Create and activate a Python virtual environments (there are 3 requirements files available for different components):
```
python3 -m venv venv
source venv/bin/activate
```

Install dependencies (from the desired requirements file):

pip install -r master_venv_requirements.txt # main venv for the project

pip install -r unsloth_venv_requirements.txt # for training with Unsloth

pip install -r wildbench_venv_requirements.txt # for evaluation with WildBench

Acknowledgments

WildBench for evaluation scripts and benchmarks.
Supervisors, collaborators, and the Czech Technical University in Prague for support and guidance.

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
datasets_creation		datasets_creation
evaluation		evaluation
scripts		scripts
training		training
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
index.html		index.html
master_venv_requirements.txt		master_venv_requirements.txt
unsloth_venv_requirements.txt		unsloth_venv_requirements.txt
wildbench_venv_requirements.txt		wildbench_venv_requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Master Thesis Repository - Tomáš Mlynář

🤗 Hugging Face Collection

Table of Contents

Project Structure

Installation

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

mlynatom/master-thesis

Folders and files

Latest commit

History

Repository files navigation

Master Thesis Repository - Tomáš Mlynář

🤗 Hugging Face Collection

Table of Contents

Project Structure

Installation

Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages