Skip to content

arvinsingh/CymruFluency

Repository files navigation

CymruFluency - A Fusion Technique and a 4D Welsh Dataset for Welsh Fluency Analysis

1 Swansea University · 2 Cardiff University · 3 Aberystwyth University · 4 Academi Hywel Teifi, Swansea University

Note

Accepted at Advanced Concepts for Intelligent Vision Systems (ACIVS 2025)

Post-print version is avaliable HERE.

This project introduces a novel dataset and method for evaluating Welsh language fluency using multimodal fusion techniques.

Abstract

Welsh is a linguistically rich yet under-resourced minority language. Despite its cultural significance, automated fluency assessment remains largely unexplored due to limited datasets and tools. Existing models focus on high-resource languages, leaving Welsh without sufficient multi-modal resources. To address this, we introduce CymruFluency, the first 4D dataset for Welsh fluency assessment, capturing both audio and 3D lip movements with expert-annotated fluency scores. Building on this, we propose a multi-modal fluency classification framework that combines audio features (mel spectrograms) and manually annotated 3D lip landmarks. Our fusion approach significantly improves fluency prediction over unimodal models, emphasizing the critical role of 3D lip dynamics in Welsh learning. This research advances minority language processing by integrating articulatory features into fluency evaluation, offering a powerful tool for Welsh language learning, assessment, and preservation.

Capture steps

1. Sequence capture with 3dMD machine

Fluent Speaker
Fluent Speaker
Non-Fluent Speaker
Non-Fluent Speaker

Note the exaggerated mouth movement in fluent speakers.

2. 3D reconstruction

CymruFluency.V10.mp4

Subject uttering Welsh phrase “Gwybodaeth angenrheidiol” (Tr. EN: Necessary information; IPA: /ˈɡʊɨ̯bɔðaɪθ aŋɛnˈhreɪ̯djɔl/)

3. Landmarking process

3D mesh

3D mesh quality and landmarking in progress.

4. Alignment

Aligning landmarks

Aligning landmarks to mitigate head movement.

Dataset

Full dataset is available on Zenodo.

The dataset is split in four parts and can be accessed through the four versions of the repository. For more information on content and structure of the dataset, please read dataset description.

Installation

  1. Clone this repo:

    git clone https://github.com/arvinsingh/CymruFluency.git
    cd CymruFluency
  2. Install dependencies:

    uv sync
  3. Launch the notebooks:

    jupyter notebook

Notebooks Overview

  • Data Exploration and Analysis.ipynb - Visualize and explore dataset stats
  • Experiment Audio Landmarks.ipynb - Train and eval unimodal models
  • Experiment Model Training.ipynb - Train and eval multimodal models
  • Welsh vs English.ipynb - Comparative study of fluency in Welsh vs English dataset

Architecture Pipeline

Architecture Pipeline.

License

This dataset is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Research purposes only.

Citation [Pending publication]

Important

If you use our dataset and code, please use the following two bibtex for citation:

@inproceedings{bali_2025_cymrufluency,
  author       = {Arvinder Pal Singh Bali and
                  Gary K. L. Tam and
                  Avishek Siris and
                  Gareth Andrews and
                  Yukun Lai and
                  Bernie Tiddeman and
                  Gwenno Ffrancon},
  title        = {CymruFluency - A fusion technique and a 4D Welsh dataset for Welsh fluency analysis},
  booktitle    = {Advanced Concepts for Intelligent Vision Systems (ACIVS)},
  year         = {2025},
  address      = {Japan},
  publisher    = {Springer (Lecture Notes in Computer Science, LNCS)},
  doi          = {TBD},
  url          = {TBD}
}
@dataset{bali_2025_dataset,
  author       = {Bali, Arvinder Pal Singh and
                  Tam, Gary KL and
                  Siris, Avishek and
                  Andrews, Gareth and
                  Lai, Yukun and
                  Tiddeman, Bernie and
                  Ffrancon, Gwenno},
  title        = {Dataset and code for "CymruFluency - A fusion technique and a 4D Welsh dataset for Welsh fluency analysis"},
  month        = may,
  year         = 2025,
  publisher    = {Zenodo},
  doi          = {10.5281/zenodo.15397513},
  url          = {https://doi.org/10.5281/zenodo.15397513},
}

Acknowledgement

This research was supported by Coleg Cymraeg Cenedlaethol Small Grant 2017, Cherish-DE Escalator Fund 2019, 2021(1RR, 52E), Swansea University SPIN fund, Wales Network Innovation Small Grant 2023 and EPSRC IAA Fund 2024. We would like to thank all annotators and anonymized participants for their contributions to this project.

License: CC BY-NC 4.0

About

CymruFluency - A Fusion Technique and a 4D Welsh Dataset for Welsh Fluency Analysis

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published