StructuredGaussianMixtures.jl

Overview

This package implements fitting and conditional prediction for Gaussian Mixture Models (GMM). Given a matrix of data $X$ of shape $(n_{features}, n_{samples})$, the method fit(::GMMFitMethod, X) returns a fitted GMM as a Distributions.MixtureModel. Given a fitted gmm, the method predict(gmm, x; input_indices=1:length(x), output_indices=length(x)+1:length(gmm)) returns a posterior GMM over the output dimensions conditioned on the observed vector x at the input indices.

This package implements three GMMFitMethods.

EM: Standard Expectation Maximization for fitting Gaussian Mixture Models with full covariance matrices. Uses the GaussianMixtures.jl implementation with configurable initialization methods and covariance structures.
PCAEM: Fit a GMM in PCA-reduced space and transforms back to the original space, effectively learning low-rank covariance structures through dimensionality reduction.
FactorEM: Fit a mixture of factor analyzers [under construction]. Directly fit GMMs with covariance matrices constrained to the form $Σ = FF' + D$, where F is a low-rank factor matrix and D is diagonal, relying on an internal EM step for factor analysis. This method is particularly effective for high-dimensional data where the number of features exceeds the number of samples.

To perform fitting and prediction efficiently, PCAEM and FactorEM rely on an LRDMvNormal <: Distributions.AbstractMvNormal class that this packages introduces to represent structured covariance matrices $Σ = FF' + D$ using their factor loadings and diagonal vectors.

Usage

See the examples folder for example usage (note to ] dev .. in the examples environment to add StructuredGaussianMixtures in development mode).

Old: Expectation Maximization for Gaussian Mixture Models with Low-Rank Plus Diagonal Covariance

This package implements the Expectation-Maximization (EM) algorithm for fitting Gaussian Mixture Models (GMMs) under the assumption that covariance matrices are low-rank plus diagonal. This structure is useful as a form of regularization in high-dimensionsal settings. It is particularly useful in high-dimensional settings where the number of features is large compared to the number of data points. By constraining the covariance matrices in this manner, we achieve a balance between flexibility and computational efficiency while avoiding overfitting.

We consider a dataset represented as $X \in \mathbb{R}^{n \times m}$ with $n$ data points and $m$ features, potentially with $m \gg n$. A Gaussian Mixture Model with $K$ components over elements $x^{(i)} \in \mathbb{R}^m$ is defined as:

$$p(x) = \sum_{k=1}^{K} \pi_k \mathcal{N}(x \mid \mu_k, \Sigma_k)\text{,}$$

with non-negative mixture weights $\pi_k$ that sum to one. Each Gaussian component has mean $\mu_k \in \mathbb{R}^m$ and covariance matrix of the form:

$$ \Sigma_k = \text{diag}(d_k) + L_k L_k^T\text{,} $$

where:

$d_k \in \mathbb{R}^m$ is the diagonal vector capturing independent feature noise,
$L_k \in \mathbb{R}^{m \times k}$ (with rank $k \ll m$) captures the dominant covariance structure via a low-rank factor.

The EM algorithm for fitting GMMs alternates between Expectation and Maximization:

E-step: Compute responsibilities based on current parameters:

$$\gamma_{ik} = \frac{\pi_k \mathcal{N}(x^{(i)} \mid \mu_k, \Sigma_k)}{\sum_j \pi_j \mathcal{N}(x^{(i)} \mid \mu_j, \Sigma_j)}\text{.}$$

M-step: Update parameters by maximizing expected complete-data log-likelihood.

$$\pi_k = \frac{n_k}{n}, \quad \text{where } n_k = \sum_{i=1}^{n} \gamma_{ik}\text{,}$$

$$\mu_k = \frac{1}{n_k} \sum_{i=1}^{n} \gamma_{ik} x^{(i)}\text{,}$$

$$\Sigma_k = \frac{1}{n_k} \sum_{i=1}^{n} \gamma_{ik} (x^{(i)} - \mu_k)(x^{(i)} - \mu_k)^\top\text{.}$$

Computing responsibilities requires inverting $\Sigma_k$. Since this can be prohibitively expensive in large dimensions, we leverage the structure of the covariance matrices to calculate the responsibilities using an $\mathcal{O}(k^3)$ matrix inversion rather than a full $\mathcal{O}(m^3)$ inversion. Additionally, since updating $\Sigma_k$ directly is intractable in high dimensions, we perform an inner EM loop during maximization is performed using Factor Analysis, where $L_k$ and $D_k$ are iteratively estimated to approximate the full covariance structure.

This low-rank plus diagonal structure is particularly advantageous in settings such as time-series analysis (e.g. for finance), text modeling, gene expression analysis, and compressed sensing, where $m \gg n$ leads to singular or poorly conditioned full covariance estimates. Our package leverages efficient matrix decompositions and batched computations to ensure scalability, making it well-suited for large-scale, high-dimensional datasets.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.github		.github
examples		examples
src		src
test		test
.gitignore		.gitignore
LICENSE		LICENSE
Project.toml		Project.toml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

StructuredGaussianMixtures.jl

Overview

Usage

Old: Expectation Maximization for Gaussian Mixture Models with Low-Rank Plus Diagonal Covariance

About

Uh oh!

Releases

Packages

Languages

License

sisl/StructuredGaussianMixtures.jl

Folders and files

Latest commit

History

Repository files navigation

StructuredGaussianMixtures.jl

Overview

Usage

Old: Expectation Maximization for Gaussian Mixture Models with Low-Rank Plus Diagonal Covariance

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages