Skip to content

Probabilistic extensions of Allen’s Interval Algebra for temporal reasoning under uncertainty

License

Notifications You must be signed in to change notification settings

vxrdis/allen-interval-probabilities

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

67 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Probabilities of Allen Interval Relations

MIT LicensePython 3.9Last CommitRepo SizeCode Style: Black

Trinity College Dublin Logo

A Final Year Project by Cillín Forrester
under the supervision of Dr Tim Fernando
submitted to the School of Computer Science and Statistics
in partial fulfilment of the requirements for the degree of
B.A. (Mod.) in Computer Science, Linguistics and a Language

Contents


This repository presents a Final Year Project investigating probabilistic extensions of Allen's Interval Algebra (a framework that defines 13 possible relationships between time intervals). It evaluates key hypotheses about interval relations using empirical simulation, statistical testing, and interactive visualisation.

The project integrates formal temporal logic (a system for reasoning about time using logical rules) with stochastic modelling (mathematical techniques for analysing random processes), offering an extensible framework for reasoning about time under uncertainty.

Table A     Basic Interval Composition
p before m meets o overlaps F finished by D contains s starts e equals S started by d during f finishes O overlapped by M met by P after
p before (p) (p) (p) (p) (p) (p) (p) (p) (pmosd) (pmosd) (pmosd) (pmosd) full
m meets (p) (p) (p) (p) (p) (m) (m) (m) (osd) (osd) (osd) (Fef) (DSOMP)
o overlaps (p) (p) (pmo) (pmo) (pmoFD) (o) (o) (oFD) (osd) (osd) concur (DSO) (DSOMP)
F finished by (p) (m) (o) (F) (D) (o) (F) (D) (osd) (Fef) (DSO) (DSO) (DSOMP)
D contains (pmoFD) (oFD) (oFD) (D) (D) (oFD) (D) (D) concur (DSO) (DSO) (DSO) (DSOMP)
s starts (p) (p) (pmo) (pmo) (pmoFD) (s) (s) (seS) (d) (d) (dfO) (M) (P)
e equals (p) (m) (o) (F) (D) (s) (e) (S) (d) (f) (O) (M) (P)
S started by (pmoFD) (oFD) (oFD) (D) (D) (seS) (S) (S) (dfO) (O) (O) (M) (P)
d during (p) (p) (pmosd) (pmosd) full (d) (d) (dfOMP) (d) (d) (dfOMP) (P) (P)
f finishes (p) (m) (osd) (Fef) (DSOMP) (d) (f) (OMP) (d) (f) (OMP) (P) (P)
O overlapped by (pmoFD) (oFD) concur (DSO) (DSOMP) (dfO) (O) (OMP) (dfO) (O) (OMP) (P) (P)
M met by (pmoFD) (seS) (dfO) (M) (P) (dfO) (M) (P) (dfO) (M) (P) (P) (P)
P after full (dfOMP) (dfOMP) (P) (P) (dfOMP) (P) (P) (dfOMP) (P) (P) (P) (P)

full = (pmoFDseSdfOMP) and concur = (oFDseSdfO)

Table B     Composition Frequencies
Frequency
22 (p) (P)
9 (d) (D)
7 (oFD) (osd) (DSO) (dfO)
6 (pmoFD) (pmosd) (m) (DSOMP) (dfOMP) (M)
5 (o) (O)
4 (pmo) (OMP)
3 full concur (F) (Fef) (seS) (s) (S) (f)
1 (e)

Project Overview

Allen's Interval Algebra classifies temporal relations between intervals into 13 mutually exclusive categories. While this framework captures qualitative structure, it makes no assumptions about the relative likelihood of each relation.

This project addresses that gap by simulating interval dynamics under various probabilistic assumptions (birth–death models, where intervals randomly begin and end according to specified probabilities). It tests the null hypothesis of a uniform distribution across relations and explores how composition tables behave in practice — including entropy, skew, and hypothesis alignment.

Key aims include:

  • Simulating interval systems to empirically estimate the probability of each of Allen's relations
  • Testing the Principle of Indifference and comparing to predictions from Suliman and Fernando–Vogel
  • Mapping the structure of the composition table through probabilistic chaining
  • Building an interactive dashboard for exploring simulation results
  • Visualizing entropy distributions across parameter spaces
  • Providing comprehensive statistical analysis of relation probabilities

Theoretical Motivation and Context

Allen's Interval Algebra provides a deterministic framework for reasoning about temporal relations between intervals. However, this classical approach faces fundamental limitations when applied to real-world scenarios where uncertainty is inevitable.

Deterministic temporal reasoning cannot adequately address:

  1. Inherent uncertainty in establishing precise temporal boundaries
  2. Measurement imprecision when determining interval endpoints
  3. Incomplete knowledge about temporal sequences
  4. Varying degrees of confidence in temporal assertions
  5. Decision-making under uncertainty requiring probabilistic reasoning

When modelling cognitive processes, natural language understanding, or real-time systems, these limitations become critical barriers.

A probabilistic extension of Allen's algebra allows for reasoning that accommodates uncertainty, enables robust inference from noisy data, and better reflects how humans actually reason about temporal relations.

This project specifically addresses this gap by developing an empirical foundation for probabilistic temporal reasoning that preserves the logical structure of Allen's framework while embracing uncertainty.

Research Objectives

This project seeks to answer the following questions:

  • Determine how the probabilities of Allen's interval relations vary under different stochastic assumptions (e.g., birth/death rates)
  • Assess whether empirical distributions significantly differ from a uniform distribution (Principle of Indifference)
  • Investigate how varying simulation parameters impact the composition and resultant probability distributions of interval chains
  • Develop clear, interactive visualisations and statistical analyses to facilitate deeper insights into temporal structure

Project Structure and Contents

This repository is structured to clearly delineate between core simulations, analysis tools, and documentation:

allen-interval-probabilities/
├── docs/
├── COMP_RESULTS.md
├── SIM_RESULTS.md
├── constants.py
├── intervals.py
├── relations.py
├── simulations.py
├── stats.py
├── batch_runner.py
├── sim_results.py
├── comp_runner.py
├── comp_results.py
├── requirements.txt
└── report/

Core Components

  • Simulation Engines: batch_runner.py, composition_runner.py, and simulations.py implement the stochastic interval simulation logic
  • Relation Analysis: intervals.py and relations.py define the Allen interval relations and operations
  • Statistical Analysis: stats.py provides comprehensive statistical measures including entropy, chi-square tests, KL divergence, and more
  • Report Generation: comp_results.py and sim_results.py generate formatted reports of simulation results
  • Thesis Report: The report/ directory contains LaTeX source for the formal thesis document, including chapter files for Introduction, State of the Art, and Implementation

Usage Instructions

Installation

Clone the repository and install dependencies:

git clone https://github.com/vxrdis/allen-interval-probabilities
cd allen-interval-probabilities
pip install -r requirements.txt

Running Simulations

Simulate basic interval relations empirically by running:

python batch_runner.py --trials 10000 --pValues 0.1,0.5,0.9

For advanced interval composition simulations (exploring how Allen relations chain together), use:

python composition_runner.py --trials 10000 --pBorn 0.1 --pDie 0.1

This will systematically generate all 169 possible relation pairs (13×13) and collect statistics on their compositions.

Interactive Web Dashboard

To explore the simulation results through an interactive web interface:

python app.py

The dashboard features:

  • Relation distribution visualization with comparison to theoretical models
  • Parameter space exploration with real-time simulation
  • Composition analysis for any pair of Allen relations
  • Full 13×13 composition matrix visualization
  • Statistical metrics including entropy, Gini coefficient, and JS divergence
  • Export functionality for charts, data tables, and simulation results

Local Development

  1. Create a virtual environment: python -m venv venv
  2. Activate it: source venv/bin/activate (Unix) or venv\Scripts\activate (Windows)
  3. Install dependencies: pip install -r requirements.txt
  4. Run the application: python live_simulator.py

Deployment on Render

The application is configured to deploy on Render.com with the following settings:

  • Name: allen-interval
  • Build Command: pip install -r requirements.txt
  • Start Command: gunicorn app:server
  • Instance Type: Starter or Free
  • Auto-Deploy: Enabled
  • Health Check Path: /

After deployment, the app will be available at: https://allen-interval.onrender.com/


Literature Review and Related Work

Allen's Interval Algebra, first formalised by James F. Allen (1983), provides a framework of 13 qualitative relations — such as before, meets, overlaps, and their inverses — to describe how time intervals relate.

Although widely adopted in fields like computational linguistics, artificial intelligence, and temporal logic, the original deterministic formulation does not inherently handle uncertainty or probabilistic scenarios that occur naturally in real-world temporal data.

Probabilistic Extensions

Recent research has explored probabilistic extensions of Allen's relations:

  • Fernando and Vogel (2019) challenged the assumption of uniform distribution among Allen's relations
  • They demonstrated analytically that certain temporal relations (particularly those without endpoint coincidences) inherently occur more frequently under random conditions
  • Their finite-order model predicts that as timelines become denser, exact endpoint coincidences approach zero probability
  • Specifically, under asymptotic conditions where both birth and death probabilities approach zero (p → 0, q → 0), their model predicts domination by six "long-range" relations

Suliman's Prediction

Suliman proposed that seven relations (equals, before, after, starts, started_by, meets, met_by) occur with probability 1/9, and the remaining six with 1/27.

At precisely balanced birth-death probabilities (p = 0.5 and q = 0.5), the empirical distribution closely matches this model (χ² p ≈ 0.76), offering strong support for Suliman's prediction. This balanced condition (equal probability of intervals starting and ending) creates the specific stochastic environment where Suliman's 1/9 vs 1/27 distribution holds.

Outside this balanced region, Suliman's model is rejected — suggesting its validity depends on specific temporal density regimes.

Fernando–Vogel Asymptotic Model

Fernando and Vogel predicted that as p → 0 and q → 0 (extremely low probabilities of birth and death), six long-range relations (before, after, during, contains, overlaps, overlapped_by) should dominate, each converging to a probability of ~1/6.

This pattern emerges clearly in simulations at extreme low p and q, supporting their asymptotic argument for sparse timelines.

Complementary Frameworks

Complementary approaches have appeared in temporal reasoning frameworks and NLP:

Statistical Validation

Beyond theoretical and simulation frameworks, statistical analyses have been critical for validating these probabilistic models:

  • Metrics such as entropy quantify the uncertainty inherent in relation distributions
  • Chi-square goodness-of-fit tests have evaluated model accuracy
  • Real-world datasets confirm theoretical biases — relations like "before" dominate naturally occurring timelines

Practical Applications

Practical applications of probabilistic Allen algebra are increasingly significant:

  • Temporal annotation standards like TimeML leverage these relations
  • In archaeology, tools like ArchaeoPhases use Allen's relations probabilistically to quantify uncertainty in chronological intervals

However, despite the depth of theoretical exploration, practical tools and visualisation dashboards for probabilistic Allen relations have remained limited.

This project addresses this gap directly through interactive visualisations, allowing intuitive exploration of relation probabilities, parameter sensitivity, entropy dynamics, and three-interval composition scenarios.

This literature review highlights important research gaps that the current project aims to address: dynamically derived relation probabilities, comprehensive probabilistic composition tables, deeper empirical validation, and integrated interactive tools.

Documentation and Project Reports

Detailed documentation relevant to this project can be found in the docs/ directory, structured to clearly separate project-specific documents from foundational literature. The formal thesis is being developed in the report/ directory using LaTeX.

Project Documents (docs/)

Related Work and Background Literature (docs/related-work/)

Formal Thesis (report/)

The LaTeX source for the thesis document follows Trinity College Dublin's style guidelines using the tcdthesis.sty template. The report includes:

  • Academic writing style guidance (Introduction.tex)
  • Literature review framework (StateOfTheArt.tex)
  • Implementation details (Implementation.tex)
  • Full bibliography and citations

Hypotheses and Evaluations

This project empirically tests two central hypotheses through controlled stochastic simulations.

1. Probability Distribution of Interval Relations

  • Null Hypothesis: Each of Allen's 13 interval relations is equally likely to occur (uniform distribution: 1/13).

  • Method:

    • Thousands of simulations were run with varying birth (pBorn) and death (pDie) probabilities
    • Relation counts were aggregated and normalised to estimate empirical distributions
  • Result: The null hypothesis is consistently refuted across all parameter sets.

    • In every case, certain relations (e.g. before, meets) dominate while others (e.g. equal) are rare
    • This demonstrates significant and robust deviation from the theoretical uniform baseline
  • Statistical Evidence:

    • Entropy varies substantially across parameter sets
    • Chi-squared tests consistently return p-values < 0.05
    • These confirm strong non-uniformity with high confidence

Suliman's Prediction

Suliman proposed that seven relations (equals, before, after, starts, started_by, meets, met_by) occur with probability 1/9, and the remaining six with 1/27.

At balanced probabilities (p = 0.5, q = 0.5), repeated stochastic simulations (50 runs, 5000 trials each) indicate that Suliman’s predictions hold approximately 70% of the time (mean χ² p ≈ 0.4, range: 0.15–0.8), highlighting inherent stochastic variability. This balanced condition (equal probability of intervals starting and ending) creates the specific stochastic environment where Suliman's 1/9 vs 1/27 distribution holds.

Outside this region, Suliman's model is rejected — suggesting its validity depends on specific temporal density regimes.

Fernando–Vogel Asymptotic Model

Fernando and Vogel predicted that as p → 0 and q → 0, six long-range relations (before, after, during, contains, overlaps, overlapped_by) should dominate, each converging to a probability of ~1/6.

This pattern emerges clearly in simulations at extreme low p and q, supporting their asymptotic argument for sparse timelines.

Entropy and Skew

  • Entropy is maximised when p and q are moderate (e.g. 0.5)
  • Entropy is minimised when intervals are short and rare
  • The PMS/OD ratio (precedes, meets, starts vs. overlaps, during) grows extremely large in sparse regimes — e.g. over 2000×
  • This confirms strong structural skew in relation types when intervals are brief or rarely instantiated

2. Composition Table Probabilities

  • Null Hypothesis: All entries in Allen's interval composition table are equally probable under random chains (e.g. xRy ∧ yRz ⇒ x?z).

  • Method:

    • Simulations of triple-interval chains estimate the frequency of inferred compositions
    • For each pair of input relations (r1, r2), we track the empirical distribution over resulting compositions r3
    • Results are collected in COMP_RESULTS.md showing the probability distribution for each composition
  • Result: The composition table exhibits significant structural bias:

    • Certain compositions consistently result in specific relations with high probability
    • Other compositions show high entropy with diverse possible outcomes
    • The empirical distributions often deviate from theoretical predictions
  • Statistical Evidence:

    • Normalized entropy values quantify uncertainty in composition outcomes
    • Chi-squared tests confirm non-uniformity of relation distributions
    • Composition frequencies show clear patterns (as visualized in Table B)

Visualisation and Interaction

To effectively communicate findings, interactive visualisations and comprehensive reports are generated, allowing intuitive exploration of interval relation probabilities and compositional logic.

Interactive Dashboard

The project includes a web-based dashboard with three main components:

  1. Relation Simulation: Visualizes the probability distribution of Allen relations under specific birth/death parameters

    • Real-time statistical analysis with entropy, Gini coefficient, and JS divergence measures
    • Comparison with theoretical models (Uniform, Fernando-Vogel, and Suliman)
    • Parameter space exploration through an entropy heatmap
  2. Composition Analysis: Explores the probabilities of resulting relations when two Allen relations are composed

    • Visual representation of composition outcomes with percentage and fraction approximations
    • Statistical breakdown of composition frequency and distribution
    • Reference tables for Allen relation compositions
  3. Matrix Analysis: Provides a comprehensive view of all 169 possible relation compositions

    • Interactive heatmap visualization of the full composition matrix
    • Cell-level detailed breakdown of composition results
    • Global statistics across all compositions

Report Types

  • SIM_RESULTS.md: Detailed statistical breakdown of relation probabilities under different parameter settings

    • Includes entropy measures, Gini coefficients, and Jensen-Shannon divergence comparisons
    • Provides best-fit theoretical model comparisons (uniform, Suliman, Fernando-Vogel)
    • Presents distribution visualisations in table format with normalized frequency values
    • Includes parameter sensitivity analysis across different birth/death probability combinations
  • COMP_RESULTS.md: Comprehensive analysis of composition probabilities

    • Shows empirical distributions for all 169 possible relation compositions
    • Includes approximate fraction representations for easier interpretation
    • Highlights high-entropy vs. deterministic compositions
    • Provides sorted tables of most common composition patterns

Future Work

Current and upcoming developments include:

Entropy and Information-Theoretic Analysis ✓

  • Integration of entropy measures to quantify the uncertainty in observed distributions
  • Comparisons against baseline distributions (e.g., uniform, theoretical priors)
  • KL divergence and JS divergence measures for distribution comparison

Probabilistic Composition Mining ✓

  • Extending simulations to estimate the empirical probabilities of inferred relations
  • Analysing how compositions vary under different stochastic assumptions
  • Mapping probability flows through composition chains

Visualisation Enhancements ✓

  • Interactive dashboards with parameter space exploration ✓
  • Entropy heatmaps across birth-death probability parameters ✓
  • Comparative visualisations of theoretical vs. empirical distributions ✓
  • Matrix visualization for all 169 composition pairs ✓
  • Enhanced styling and UI improvements for better user experience ✓

Statistical Evaluation ✓

  • Chi-squared tests to assess goodness-of-fit to theoretical distributions
  • Jensen-Shannon and KL divergence measures for distribution comparisons
  • Sensitivity analyses to identify critical parameters

Applications in Natural Language Processing

  • Integrating with temporal expression extraction systems
  • Applying probabilistic reasoning to ambiguous temporal references in text
  • Developing hybrid neural-symbolic approaches for temporal understanding
  • Creating annotation tools for temporal relation labeling

Cloud Deployment and API Services

  • Providing cloud-hosted interactive dashboards for wider accessibility
  • Developing an API for programmatic access to simulation results
  • Creating a public dataset of simulation results across parameter spaces

Citation

If you reference or build upon this work, please cite:

Cillín Forrester, Probabilities of Allen Interval Relations. Final Year Project, B.A. (Mod.) in Computer Science, Linguistics and a Language. Trinity College Dublin, The University of Dublin (2025). https://github.com/vxrdis/allen-interval-probabilities

@misc{forrester2025allen,
  author       = {Cillín Forrester},
  title        = {Probabilities of Allen Interval Relations},
  year         = {2025},
  institution  = {Trinity College Dublin, The University of Dublin},
  howpublished = {\url{https://github.com/vxrdis/allen-interval-probabilities}},
  note         = {Final Year Project (B.A. (Mod.) in Computer Science, Linguistics and a Language)}
}

References

  • Allen, J.F. (1983). Maintaining Knowledge about Temporal Intervals. Communications of the ACM, 26(11), 832–843. DOI:10.1145/182.358434.

  • Alspaugh, T.A. (2019). Allen's Interval Algebra. Retrieved from thomasalspaugh.org.

  • Badaloni, S., & Giacomin, M. (2006). A fuzzy extension of Allen's interval algebra. Artificial Intelligence, 170(10), 872–908. DOI:10.1016/j.artint.2006.04.001.

  • Fernando, T., & Vogel, C. (2019). Prior Probabilities of Allen Interval Relations over Finite Orders. In Proceedings of the NLPinAI Workshop, part of the 11th International Conference on Agents and Artificial Intelligence (ICAART 2019), Prague, Czech Republic. PDF.

  • Jurafsky, D., & Martin, J.H. (2025). Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition with Language Models (3rd ed., draft). Chapter 20: Information Extraction — Relations, Events, and Time. Retrieved from web.stanford.edu/~jurafsky/slp3.

  • Mahowald, K., Ivanova, A., Kean, H., Thompson, B., Gibson, E., & Fedorenko, E. (2024). Dissociating language and thought in large language models. Trends in Cognitive Sciences, 28(4), 319–334. DOI:10.1016/j.tics.2024.01.011.

  • Petridis, S., Paliouras, G., & Perantonis, S.J. (2010). Allen's hourglass: Probabilistic treatment of interval relations. NCSR "Demokritos", Greece. PDF.

  • Petrukhin, P. (2024). FSMs and Timeline Granularity in Interval Reasoning. MCS Dissertation, Trinity College Dublin. PDF.

  • Santos, P., & Young, R. (1999). Probabilistic temporal networks: A unified framework for reasoning with time and uncertainty. International Journal of Approximate Reasoning, 20(3), 263–291. DOI:10.1016/S0888-613X(99)00009-2.

  • Suliman, M. (2021). Finite Temporality: A Probabilistic Approach to Interval Relations. Undergraduate Thesis, Trinity College Dublin. PDF.