CymruFluency - A Fusion Technique and a 4D Welsh Dataset for Welsh Fluency Analysis

Arvinder Pal Singh Bali¹ · Gary K.L. Tam¹ · Avishek Siris¹ · Gareth Andrews¹ · Yukun Lai² · Bernie Tiddeman³ · Gwenno Ffrancon⁴

_{¹ Swansea University ·
² Cardiff University ·
³ Aberystwyth University ·
⁴ Academi Hywel Teifi, Swansea University}

Note

Published in Advanced Concepts for Intelligent Vision Systems (ACIVS 2025) / Lecture Notes in Computer Science

Full paper is avaliable HERE.

This project introduces a novel dataset and method for evaluating Welsh language fluency using multimodal fusion techniques.

Abstract

Welsh is a linguistically rich yet under-resourced minority language. Despite its cultural significance, automated fluency assessment remains largely unexplored due to limited datasets and tools. Existing models focus on high-resource languages, leaving Welsh without sufficient multi-modal resources. To address this, we introduce CymruFluency, the first 4D dataset for Welsh fluency assessment, capturing both audio and 3D lip movements with expert-annotated fluency scores. Building on this, we propose a multi-modal fluency classification framework that combines audio features (mel spectrograms) and manually annotated 3D lip landmarks. Our fusion approach significantly improves fluency prediction over unimodal models, emphasizing the critical role of 3D lip dynamics in Welsh learning. This research advances minority language processing by integrating articulatory features into fluency evaluation, offering a powerful tool for Welsh language learning, assessment, and preservation.

Capture steps

1. Sequence capture with 3dMD machine

Fluent Speaker

Non-Fluent Speaker

Note the exaggerated mouth movement in fluent speakers.

2. 3D reconstruction

CymruFluency.V10.mp4

Subject uttering Welsh phrase “Gwybodaeth angenrheidiol” (Tr. EN: Necessary information; IPA: /ˈɡʊɨ̯bɔðaɪθ aŋɛnˈhreɪ̯djɔl/)

3. Landmarking process

3D mesh quality and landmarking in progress.

4. Alignment

Aligning landmarks to mitigate head movement.

Dataset

Note

Full dataset is available on Zenodo.

The dataset is split in four parts and can be accessed through the four versions of the repository. For more information on content and structure of the dataset, please read dataset description.

Installation

Clone this repo:

git clone https://github.com/arvinsingh/CymruFluency.git
cd CymruFluency

Install dependencies:
```
uv sync
```
Launch the notebooks:
```
jupyter notebook
```

Notebooks Overview

Data Exploration and Analysis.ipynb - Visualize and explore dataset stats
Experiment Audio Landmarks.ipynb - Train and eval unimodal models
Experiment Model Training.ipynb - Train and eval multimodal models
Welsh vs English.ipynb - Comparative study of fluency in Welsh vs English dataset

Architecture Pipeline.

License

This dataset is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Research purposes only.

Citation [Pending publication]

Important

If you use our dataset and code, please use the following two bibtex for citation:

@inproceedings{bali_2025_cymrufluency,
  author       = {Arvinder Pal Singh Bali and
                  Gary K. L. Tam and
                  Avishek Siris and
                  Gareth Andrews and
                  Yukun Lai and
                  Bernie Tiddeman and
                  Gwenno Ffrancon},
  title        = {CymruFluency - A fusion technique and a 4D Welsh dataset for Welsh fluency analysis},
  booktitle    = {Advanced Concepts for Intelligent Vision Systems (ACIVS)},
  year         = {2025},
  address      = {Japan},
  publisher    = {Springer (Lecture Notes in Computer Science, LNCS)},
  doi          = {TBD},
  url          = {TBD}
}

@dataset{bali_2025_dataset,
  author       = {Bali, Arvinder Pal Singh and
                  Tam, Gary KL and
                  Siris, Avishek and
                  Andrews, Gareth and
                  Lai, Yukun and
                  Tiddeman, Bernie and
                  Ffrancon, Gwenno},
  title        = {Dataset and code for "CymruFluency - A fusion technique and a 4D Welsh dataset for Welsh fluency analysis"},
  month        = may,
  year         = 2025,
  publisher    = {Zenodo},
  doi          = {10.5281/zenodo.15397513},
  url          = {https://doi.org/10.5281/zenodo.15397513},
}

Acknowledgement

This is part of a major ongoing project led by Dr Gary K.L. Tam.

This research was supported by Coleg Cymraeg Cenedlaethol Small Grant 2017, Cherish-DE Escalator Fund 2019, 2021(1RR, 52E), Swansea University SPIN fund, Wales Network Innovation Small Grant 2023 and EPSRC IAA Fund 2024. We would like to thank all annotators and anonymized participants for their contributions to this project.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
assets		assets
src		src
.gitignore		.gitignore
DATASET.md		DATASET.md
Data Exploration and Analysis.ipynb		Data Exploration and Analysis.ipynb
Experiment Audio Landmarks.ipynb		Experiment Audio Landmarks.ipynb
Experiment Model Training.ipynb		Experiment Model Training.ipynb
LICENSE		LICENSE
README.md		README.md
Welsh vs English.ipynb		Welsh vs English.ipynb
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CymruFluency - A Fusion Technique and a 4D Welsh Dataset for Welsh Fluency Analysis