LLMs for machine translation on medium-to-low resource languages: A comprehensive evaluation for Catalan

Abstract

Large language models (LLMs) that have been trained on monolingual data, predomi- nantly in English, with no intentionally included parallel text, have demonstrated remarkable potential in handling multilingual machine translation. In this study, we aim to assess the performance of decoder-only LLMs in the task of translation in medium-to-low resource languages, conducting both qualitative and quanti- tative error analyses, as well as investigating the effect of translation fine-tuning strategies to the cross-lingual transfer of the model and its ability on other tasks. All of them centred on the context of Iberian languages, focusing on the case of Catalan and Spanish.

Name		Name	Last commit message	Last commit date
Latest commit History 144 Commits
complete_out		complete_out
env_requirements		env_requirements
results		results
src		src
tests		tests
tgt_out		tgt_out
.gitignore		.gitignore
README.md		README.md
languages.json		languages.json
requirements.txt		requirements.txt
templates.json		templates.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

LLMs for machine translation on medium-to-low resource languages: A comprehensive evaluation for Catalan

Abstract

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Uh oh!

Uh oh!

JaumePrats/llm-mt-iberian-languages

Folders and files

Latest commit

History

Repository files navigation

LLMs for machine translation on medium-to-low resource languages: A comprehensive evaluation for Catalan

Abstract

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages