Windows tool for side-by-side image & text viewing, manual caption/tag editing, and batch dataset operations. Perfect for LoRA & ML dataset prep. 100% offline, private, and easy to use.
-
Updated
Sep 20, 2025 - Python
Windows tool for side-by-side image & text viewing, manual caption/tag editing, and batch dataset operations. Perfect for LoRA & ML dataset prep. 100% offline, private, and easy to use.
ManaTTS is the largest open Persian speech dataset with 114+ hours of transcribed audio. Includes data collection pipeline and tools. Suitable for Persian text-to-speech models.
Various scripts, mostly intended to help with model training and dataset creation
DALLE-tools provided useful dataset utilities to improve you workflow with WebDatasets.
🚀 A powerful tool to automatically generate descriptive tags for image datasets using both WD Tagger and VLM, with a user-friendly web UI. Perfect for preparing training data for Stable Diffusion and LoRA.
Scripts to help easily create image pair datasets for super resolution models
Artifician is an event-driven framework designed to simplify and accelerate the process of preparing datasets for Artificial Intelligence models.
A free licensed Persian TTS dataset including 6+ hours of audio-text pairs with subject
This repository presents a project focused on image recognition of nuts and screws using object detection techniques. The objective is to develop a model capable of accurately detecting and classifying nuts and screws in images, enabling automation and quality control in industrial settings.
This comprehensive guide provides a universal process for preparing your own speech datasets and training a custom Text-to-Speech (TTS) model.
🚀 Whenever you need to look through huge pile of images and cannot use force of file explorer, or you just work on a remote headless machine, you can use this tool. It also allows to move files from one folder to another, creating destination if it does not exist. Work in progress.
Dataset Forge - Your Dataset helper.
🌾 Wheat Detection using YOLO11n! 📸 Installs Ultralytics, trains on GlobalWheat2020 dataset, and detects wheat heads with bounding boxes. Includes dataset setup, model training, and inference. 🚀
Quantization Aware Training
This repository contains code for training a convolutional neural network (CNN) model to classify images.
A pipeline for machine translation (using OPUS-MT models) of parliamentary text collections in 30+ languages (ParlaMint corpora). The pipeline includes parsing TEI XLM and CONLL-u files, linguistic processing with the Stanza pipeline, machine translation and word alignment with the Eflomal tool.
Repo for bachelor thesis on CSGO encounter predictions
An automatic pipeline for generating high-quality datasets for TTS and ASR systems.
A Python library for building local web apps to manually classify images into custom categories - perfect for preparing ML training datasets.
ReFrame-CLI is a Python-based command-line `ImageToolKit` to streamline your image manipulation tasks. Ideal for preparing image datasets for training machine learning models, including generative AI and diffusion models.
Add a description, image, and links to the dataset-preparation topic page so that developers can more easily learn about it.
To associate your repository with the dataset-preparation topic, visit your repo's landing page and select "manage topics."