Synthesizing VFSS Data Using StyleGAN2-ADA: A Generative Approach for Swallowing Disorder Research

Overview

This project focuses on synthesizing VFSS (Videofluoroscopic Swallowing Study) data using StyleGAN2-ADA. The goal is to create a dataset that can be used for research in swallowing disorders. The project includes various scripts and notebooks for data processing, model training, and evaluation. We also provide tools like:

media_player.py: A simple media player for playing videos.
create-image-dataset-from-videos.py: A script to create an image dataset from videos.

Directory Structure

├── data
│   ├── videos
│   ├── images
│   ├── labels
│   └── ...
├── notebooks
│   ├── applying-stylegan-on-vfss.ipynb
│   └── create-image-dataset.ipynb
├── src
│   ├── video_tool.py
│   ├── image_tool.py
│   ├── utils.py
│   ├── create_dataset.py
│   ├── video_labels.py
│   └── ...
├── README.md
├── create-image-dataset-from-videos.py
├── media_player.py
└── environment.yml

Requirements

Python 3.12.2 or higher
OpenCV
Pandas
NumPy

Installation

Clone the repository:

git clone  https://github.com/caioseda/data-synthesis-vfss.git

Navigate to the project directory:
```
cd data-synthesis-vfss
```

Install the required packages:

conda env create -f environment.yml
conda activate vfss

Usage

Playing Videos

To play a video, use the media_player.py script. You can specify the video_dir, video ID, start frame, and end frame. You can navigate through the video using the arrow keys and pause/play using the spacebar. Use 'q' to quit the video. You can also toggle the display of time and frame number using 'i' and toggle the autoclose behavior using 'a'.

python media_player.py \
   --video_dir data/videos/ \
   --video_id 1

Creating Image Dataset from Videos

To create an image dataset from videos, use the create-image-dataset-from-videos.py script. You can specify the video directory, labels file, video ID, output directory, frame size, and dataset type (max_constriction or all_frames).

python create-image-dataset-from-videos.py \
  --video-dir data/videos/ \
  --labels data/rotulos/Frames e PAS.xlsx \
  --video-id 1 --output-dir data/images/ \
  --frame-size (512, 512) \
  --dataset-type all_frames

Acknowledgements

StyleGAN3 implementation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Synthesizing VFSS Data Using StyleGAN2-ADA: A Generative Approach for Swallowing Disorder Research

Overview

Directory Structure

Requirements

Installation

Usage

Playing Videos

Creating Image Dataset from Videos

Acknowledgements

About

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
data		data
notebooks		notebooks
src		src
.gitignore		.gitignore
README.md		README.md
create-image-dataset-from-videos.py		create-image-dataset-from-videos.py
environment.yml		environment.yml
media_player.py		media_player.py
run-training-server.sh		run-training-server.sh

caioseda/data-synthesis-vfss

Folders and files

Latest commit

History

Repository files navigation

Synthesizing VFSS Data Using StyleGAN2-ADA: A Generative Approach for Swallowing Disorder Research

Overview

Directory Structure

Requirements

Installation

Usage

Playing Videos

Creating Image Dataset from Videos

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Languages