This page holds tips and tricks for efficient cluster usage
Let's say you want to compress /scratch/$USER/mydataset into /scratch/$USER/mydata.zip
zip -r /scratch/$USER/mydata.zip /scratch/$USER/mydatasetYou can also use 7z:
7z a /scratch/$USER/mydata.zip /scratch/$USER/mydatasetIf this is for a large dataset, you can either run this inside tmux, or schedule a compute job to do it, for example save the following file:
#!/bin/bash
#SBATCH --account=ACCOUNT
#SBATCH --mem=32G
#SBATCH --cpus-per-task=4
#SBATCH --time=18:00:00
#SBATCH --mail-user=EMAIL
#SBATCH --mail-type=BEGIN
#SBATCH --mail-type=END
#SBATCH --mail-type=FAIL
#SBATCH --mail-type=REQUEUE
#SBATCH --mail-type=ALL
set -euo pipefail
NOW=$(date +"%m_%d_%Y_HH%I_%M")
echo "Starting compression at $NOW"
# Change what needs to be zipped
zip -r /scratch/$USER/mydata.zip /scratch/$USER/mydataset
NOW=$(date +"%m_%d_%Y_HH%I_%M")
echo "DONE at ${NOW}"         - Modify the ACCOUNT, EMAIL, and dataset fields (marked by line 'change')
- Save in zip.sh Schedule
sbatch zip.shYou'll be notified by email when it's completed.
You can use unzip or 7z
unzip myzipfile.zipor to extract to a different directory
unzip myzipfile.zip -d /scratch/$USER/outputfolderUsing 7z as an alternative:
7z x myzipfile.zip