Expected Free Energy Minefield Demo

Left: posterior sharpening over time. Right: hidden ground-truth minefield

A robot with a noisy sensor explores minefield using Bayesian inference and expected free energy (EFE). The posterior over mines sharpens step by step as the agent balances risk and information gain.

This repo contains a simple demo of a robot exploring a noisy minefield using expected free energy (EFE) and Bayesian statistics.

The robot holds a generative model of the world (beta-bernoulli over mine presence and a noisy mine sensor). At each stage we compute the posterior mine probabilities (the risk) and take off the difference between the prior entropy and the evidence wieghted entropy of each outcome of a given move. The move with the lowest expected free energy is then selected.

The robot balances:

Epistemic gain: information gain from probing new cells.
Extrinsic risk: risk of stepping on a mine.

We consider two models:

naieve_EFE_no_revisit.ipynb — the robot samples each tile at most once. Revisits confer no additional information.
EFE_revisit.ipynb — the robot can re-probe tiles. Revisits confer additional information and yield a sharper posterior.

Written in haste.

Generative model of the minefield

Each cell (s_i \in {0,1}) (mine or no mine) has a bernoulli parameter of 0.1

The joint distribution of the grid is therefore:

Posterior update

Let a be probability of detecting a mine when there is a mine present (0.95):

Let 1-a be the probability of not detecting a mine when there is a mine present 0.05:

Let b be the probability of detecting a mine when there is no mine present (0.3)

Let 1-b be the probability of not detecting a mine when there is no mine present (0.7)

In summary the sensor has a confusion matrix:

Now we can define the likelihood of performing n mine detections with k detects and n-k non detects for a single cell given there is a mine there:

and define the likelihood of performing n mine detections with k detects and n-k non detects for a single cell given there is no mine there:

We can now use Bayes' rule with those likelihoods to form a posterior:

Substituting the likelihoods:

Expected Free Energy (EFE)

For an action (a) (moving in a particular direction compass direction N,S,W,E,NE,NW,SE,SW):

: current posterior belief
: imagined posterior after seeing (o) at cell (a)
: entropy

Entropy of a Bernoulli with parameter (p):

The agent picks the action (a^*) with lowest EFE, trading off exploration (epistemic gain) against exploitation (low risk).

Results

Brier scores for Single-visit vs. revisit vs. random

Inferred posteriors and their associated ground truths

Posterior PDFs

Left: single-visit posterior stays diffuse. Right: revisits drive probabilities sharply toward 0/1.

Boards

Ground truths of the boards.

Running the notebooks

You can view the notebooks on GitHub or run them in Colab:

naieve_EFE_no_revisit.ipynb
EFE_revisit.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
images		images
EFE_revisit.ipynb		EFE_revisit.ipynb
README.md		README.md
naieve_EFE_no_revisit.ipynb		naieve_EFE_no_revisit.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Expected Free Energy Minefield Demo

Generative model of the minefield

Posterior update

Expected Free Energy (EFE)

Results

Brier scores for Single-visit vs. revisit vs. random

Inferred posteriors and their associated ground truths

Running the notebooks

About

Uh oh!

Releases

Packages

Languages

FaizSayyid/bayesian-mine-clearance-via-free-energy

Folders and files

Latest commit

History

Repository files navigation

Expected Free Energy Minefield Demo

Generative model of the minefield

Posterior update

Expected Free Energy (EFE)

Results

Brier scores for Single-visit vs. revisit vs. random

Inferred posteriors and their associated ground truths

Running the notebooks

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages