HiRAD-Gen

HiRAD-Gen is short for high-resolution atmospheric downscaling using generative models. This repository contains the code and configuration required to train and use the model.

Installation (Alps)

To set up the environment for HiRAD-Gen on Alps supercomputer, follow these steps:

Start the PyTorch user environment:

uenv start pytorch/v2.6.0:v1 --view=default

Create a Python virtual environment (replace {env_name} with your desired environment name):
```
python -m venv ./{env_name}
```
Activate the virtual environment:
```
source ./{env_name}/bin/activate
```
Install project dependencies:
```
pip install -e .
```

This will set up the necessary environment to run HiRAD-Gen within the Alps infrastructure.

Training

Run regression model training (Alps)

Script for running the training of regression model is in src/hirad/train_regression.sh. Inside this script set the following:

### OUTPUT ###
#SBATCH --output=your_path_to_output_log
#SBATCH --error=your_path_to_output_error

#SBATCH -A your_compute_group

srun bash -c "
    . ./{your_env_name}/bin/activate
    python src/hirad/training/train.py --config-name=training_era_cosmo_regression.yaml
"

Set up the following config files in src/hirad/conf:

In training_era_cosmo_regression.yaml set:

hydra:
  run:
    dir: your_path_to_save_training_output

In training/era_cosmo_regression.yaml set:

hp:
    training_duration: number of samples to train for (set to 4 for debugging, 512 fits into 30 minutes on 1 gpu with total_batch_size: 4)

In dataset/era_cosmo.yaml set the dataset_path if different from default.

Submit the job with:

sbatch src/hirad/train_regression.sh

Run diffusion model training (Alps)

Before training diffusion model, checkpoint for regression model has to exist.

Script for running the training of diffusion model is in src/hirad/train_diffusion.sh. Inside this script set the following:

### OUTPUT ###
#SBATCH --output=your_path_to_output_log
#SBATCH --error=your_path_to_output_error

#SBATCH -A your_compute_group

srun bash -c "
    . ./{your_env_name}/bin/activate
    python src/hirad/training/train.py --config-name=training_era_cosmo_diffusion.yaml
"

Set up the following config files in src/hirad/conf:

In training_era_cosmo_diffusion.yaml set:

hydra:
  run:
    dir: your_path_to_save_training_output

In training/era_cosmo_regression.yaml set:

hp:
    training_duration: number of samples to train for (set to 4 for debugging, 512 fits into 30 minutes on 1 gpu with total_batch_size: 4)
io:
    regression_checkpoint_path: path_to_directory_containing_regression_training_model_checkpoints

In dataset/era_cosmo.yaml set the dataset_path if different from default.

Submit the job with:

sbatch src/hirad/train_diffusion.sh

Inference

Running inference on Alps

Script for running the inference is in src/hirad/generate.sh. Inside this script set the following:

### OUTPUT ###
#SBATCH --output=your_path_to_output_log
#SBATCH --error=your_path_to_output_error

#SBATCH -A your_compute_group

srun bash -c "
    . ./{your_env_name}/bin/activate
    python src/hirad/inference/generate.py --config-name=generate_era_cosmo.yaml
"

Set up the following config files in src/hirad/conf:

In generate_era_cosmo.yaml set:

hydra:
  run:
    dir: your_path_to_save_inference_output

In generation/era_cosmo.yaml: Choose the inference mode:

inference_mode: all/regression/diffusion

by default all does both regression and diffusion. Depending on mode, regression and/or diffusion model pretrained weights should be provided:

io:
  res_ckpt_path: path_to_directory_containing_diffusion_training_model_checkpoints
  reg_ckpt_path: path_to_directory_containing_regression_training_model_checkpoints

Finally, from the dataset, subset of time steps can be chosen to do inference for.

One way is to list steps under times: in format %Y%m%d-%H%M for era5_cosmo dataset.

The other way is to specify times_range: with three items: first time step (%Y%m%d-%H%M), last time step (%Y%m%d-%H%M), hour shift (int). Hour shift specifies distance in hours between closest time steps for specific dataset (6 for era_cosmo).

By default, inference is done for one time step 20160101-0000

In dataset/era_cosmo.yaml set the dataset_path if different from default.

Submit the job with:

sbatch src/hirad/generate.sh

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
ci		ci
src/hirad		src/hirad
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
interpolate.sh		interpolate.sh
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

HiRAD-Gen

Installation (Alps)

Training

Run regression model training (Alps)

Run diffusion model training (Alps)

Inference

Running inference on Alps

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

MeteoSwiss/HiRAD-Gen

Folders and files

Latest commit

History

Repository files navigation

HiRAD-Gen

Installation (Alps)

Training

Run regression model training (Alps)

Run diffusion model training (Alps)

Inference

Running inference on Alps

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages