🛰️ Beta-VAE for Hyperspectral Unmixing

Hyperspectral unmixing using Variational Autoencoders with Dirichlet latent distributions, achieving state-of-the-art performance on benchmark datasets.

📘 Overview

Hyperspectral unmixing decomposes each pixel spectrum into pure material spectra (endmembers) and their fractional abundances.
This work introduces a Beta-VAE architecture leveraging the Dirichlet distribution’s natural simplex constraint to model abundance maps probabilistically and physically consistently.

🌈 Hyperspectral Unmixing Problem

Linear Mixing Model

We assume a linear mixture of $M$ endmembers across $N$ pixels and $L$ spectral bands:

$$ \mathbf{Y} = \mathbf{E}\mathbf{A} + \mathbf{N} $$

where:

$\mathbf{Y} \in \mathbb{R}^{L\times N}$ — observed hyperspectral data
$\mathbf{E} \in \mathbb{R}^{L\times M}$ — endmember matrix
$\mathbf{A} \in \mathbb{R}^{M\times N}$ — abundance matrix
$\mathbf{N} \in \mathbb{R}^{L\times N}$ — additive noise

Constraints

Each pixel’s abundances must satisfy:

$$ a_{m,n} \ge 0 \quad \forall m,n $$

$$ \sum_{m=1}^{M} a_{m,n} = 1 \quad \forall n $$

so that $\mathbf{A}$ lies on a unit simplex.

State-of-the-Art Methods

Most existing approaches fail to jointly:

enforce sum-to-one naturally,
estimate endmembers and abundances together,
handle spectral variability robustly.

Proposed Method: Dirichlet-VAE

Key Idea — Dirichlet Latent Space

We adopt a Dirichlet latent prior:

$$ \text{Dir}(\boldsymbol{\alpha}) = \frac{\Gamma\left(\sum_i \alpha_i\right)}{\prod_i \Gamma(\alpha_i)} \prod_i x_i^{\alpha_i - 1} $$

where $\boldsymbol{\alpha} = [\alpha_1, \dots, \alpha_M]$ are concentration parameters.
This prior ensures that sampled abundances naturally satisfy the simplex constraints.

Architecture

Encoder

$$ \mathbf{h} = \text{Encoder}(\mathbf{x}), \qquad \boldsymbol{\alpha} = \text{Softplus}(\text{MLP}(\mathbf{h})) + 1 $$

Latent Sampling

$$ \mathbf{a} \sim \text{Dir}(\boldsymbol{\alpha}) \quad \text{(training)}, \qquad \mathbf{a} = \frac{\boldsymbol{\alpha}}{\sum_i \alpha_i} \quad \text{(inference)} $$

Decoder

$$ \hat{\mathbf{x}} = \mathbf{E}^\top \mathbf{a} $$

where $\mathbf{E}$ is a learnable endmember matrix.

Loss Function

The overall loss combines spectral reconstruction and Dirichlet regularization:

1️⃣ Reconstruction Loss

$$ \mathcal{L}_{\text{recon}} = \text{SAD}(\mathbf{x}, \hat{\mathbf{x}}) + 0.1\text{MSE}(\mathbf{x}, \hat{\mathbf{x}}) $$

where

$$ \text{SAD}(\mathbf{x}, \hat{\mathbf{x}}) = \frac{\arccos\left(\frac{\langle \mathbf{x}, \hat{\mathbf{x}} \rangle}{|\mathbf{x}|_2 |\hat{\mathbf{x}}|_2}\right)}{\pi} $$

2️⃣ KL Divergence

Regularization towards a uniform Dirichlet $\text{Dir}(\mathbf{1})$:

$$ \mathcal{L}_{\text{KL}} = \log \Gamma\left(\sum_i \alpha_i\right) - \sum_i \log \Gamma(\alpha_i) - \log \Gamma(M) + \sum_i (\alpha_i - 1)\left[\psi(\alpha_i) - \psi\left(\sum_j \alpha_j\right)\right] $$

3️⃣ Total Objective

$$ \mathcal{L} = \lambda_{\text{recon}} , \mathcal{L}_{\text{recon}} + \lambda_{\text{KL}} \mathcal{L}_{\text{KL}} $$

🧮 Training Details

Optimizer: AdamW (weight decay)
Regularization: KL annealing
Stability: Gradient clipping
Scheduling: ReduceLROnPlateau
Early stopping for convergence

📊 Evaluation Metrics

Metric	Purpose
SAM (°)	Endmember similarity
RMSE / MAE	Abundance accuracy
SAD	Spectral reconstruction quality

📈 Results

Dataset	Method	MSE Reconstruction	Endmembers similarity: Mean cosine	Spatial coherence of abundance maps (lower is smoother): TV mean
Samson	Dirichlet-VAE	4.85e-3	0.99	4.69e-2

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
__pycache__		__pycache__
dataset		dataset
README.md		README.md
abundances.png		abundances.png
algorithms.py		algorithms.py
endmembers.png		endmembers.png
loss.py		loss.py
model.py		model.py
script.py		script.py
trainer.py		trainer.py
training.ipynb		training.ipynb
utils.py		utils.py
vae_weights.pth		vae_weights.pth

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🛰️ Beta-VAE for Hyperspectral Unmixing

📘 Overview

🌈 Hyperspectral Unmixing Problem

Linear Mixing Model

Constraints

State-of-the-Art Methods

Proposed Method: Dirichlet-VAE

Key Idea — Dirichlet Latent Space

Architecture

Loss Function

1️⃣ Reconstruction Loss

2️⃣ KL Divergence

3️⃣ Total Objective

🧮 Training Details

📊 Evaluation Metrics

📈 Results

About

Uh oh!

Releases

Packages

Languages

smsk-01/Hyperspectral-Source-Unmixing-with--VAE

Folders and files

Latest commit

History

Repository files navigation

🛰️ Beta-VAE for Hyperspectral Unmixing

📘 Overview

🌈 Hyperspectral Unmixing Problem

Linear Mixing Model

Constraints

State-of-the-Art Methods

Proposed Method: Dirichlet-VAE

Key Idea — Dirichlet Latent Space

Architecture

Loss Function

1️⃣ Reconstruction Loss

2️⃣ KL Divergence

3️⃣ Total Objective

🧮 Training Details

📊 Evaluation Metrics

📈 Results

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages