This repository is a fork of HiDream-I1
quantized to 4 bits, allowing the full model to run in less than 16GB of VRAM.
The original repository can be found here.
HiDream-I1
is a new open-source image generative foundation model with 17B parameters that achieves state-of-the-art image generation quality within seconds.
We offer both the full version and distilled models. The parameter size are the same, so they require the same amount of GPU memory to run. However, the distilled models are faster because of reduced number of inference steps.
Name | Min VRAM | Steps | HuggingFace |
---|---|---|---|
HiDream-I1-Full | 16 GB | 50 | 🤗 Original / NF4 |
HiDream-I1-Dev | 16 GB | 28 | 🤗 Original / NF4 |
HiDream-I1-Fast | 16 GB | 16 | 🤗 Original / NF4 |
- GPU Architecture: NVIDIA
>= Ampere
(e.g. A100, H100, A40, RTX 3090, RTX 4090) - GPU RAM:
>= 16 GB
- CPU RAM:
>= 16 GB
Simply run:
pip install hdi1 --no-build-isolation
Note
It's recommended that you start a new python environment for this package to avoid dependency conflicts.
To do that, you can use conda create -n hdi1 python=3.12
and then conda activate hdi1
.
Or you can use python3 -m venv venv
and then source venv/bin/activate
on Linux or venv\Scripts\activate
on Windows.
Then you can run the module to generate images:
python -m hdi1 "A cat holding a sign that says 'hello world'"
# or you can specify the model
python -m hdi1 "A cat holding a sign that says 'hello world'" -m fast
Note
The inference script will try to automatically download meta-llama/Llama-3.1-8B-Instruct
model files. You need to agree to the license of the Llama model on your HuggingFace account and login using huggingface-cli login
in order to use the automatic downloader.
We also provide a web dashboard for interactive image generation. You can start it by running:
python -m hdi1.web
The code in this repository and the HiDream-I1 models are licensed under MIT License.