Hyperspectral unmixing using Variational Autoencoders with Dirichlet latent distributions, achieving state-of-the-art performance on benchmark datasets.
Hyperspectral unmixing decomposes each pixel spectrum into pure material spectra (endmembers) and their fractional abundances.
This work introduces a Beta-VAE architecture leveraging the Dirichlet distribution’s natural simplex constraint to model abundance maps probabilistically and physically consistently.
We assume a linear mixture of
where:
-
$\mathbf{Y} \in \mathbb{R}^{L\times N}$ — observed hyperspectral data -
$\mathbf{E} \in \mathbb{R}^{L\times M}$ — endmember matrix -
$\mathbf{A} \in \mathbb{R}^{M\times N}$ — abundance matrix -
$\mathbf{N} \in \mathbb{R}^{L\times N}$ — additive noise
Each pixel’s abundances must satisfy:
so that
Most existing approaches fail to jointly:
- enforce sum-to-one naturally,
- estimate endmembers and abundances together,
- handle spectral variability robustly.
We adopt a Dirichlet latent prior:
where
This prior ensures that sampled abundances naturally satisfy the simplex constraints.
Encoder
Latent Sampling
Decoder
where
The overall loss combines spectral reconstruction and Dirichlet regularization:
where
Regularization towards a uniform Dirichlet
- Optimizer: AdamW (weight decay)
- Regularization: KL annealing
- Stability: Gradient clipping
- Scheduling: ReduceLROnPlateau
- Early stopping for convergence
| Metric | Purpose |
|---|---|
| SAM (°) | Endmember similarity |
| RMSE / MAE | Abundance accuracy |
| SAD | Spectral reconstruction quality |
| Dataset | Method | MSE Reconstruction | Endmembers similarity: Mean cosine | Spatial coherence of abundance maps (lower is smoother): TV mean |
|---|---|---|---|---|
| Samson | Dirichlet-VAE | 4.85e-3 | 0.99 | 4.69e-2 |


