This repository provides a fully reproducible pipeline for analyzing epistemic disruption signals in scientific preprints using a triplet classification model based on Thomas Kuhn's theory of scientific revolutions. The tool was developed as part of the Preprint Watch, a KNOWDYN sponsored project, and is designed to detect, classify, and visualize paradigm-shifting contributions within preprint datasets. Visit the project website to scan preprints for paradigm shifts in real time: https://preprintwatch.com
Each preprint is classified according to an epistemic triplet:
- N: Novelty stage
- M: Method stage
- P: Paradigm stage
Each stage is mapped onto ordinal Kuhnian cycle values:
1
: Normal Science2
: Model Drift3
: Crisis4
: Model Revolution5
: Paradigm Shift
Each preprint is represented as a vector in a hybrid semantic-metric space:
Where:
-
$$D_k$$ is the assigned discipline -
$$k$$ in$${N, M, P}$$ $$N, M, P \in \{1, 2, 3, 4, 5\}$$
The triplet combinations are validated against 48 epistemic scenarios introduced in the 2020 arXiv paper "A Novel Kuhnian Ontology for Epistemic Classification of STM Scholarly Articles ". URL: https://arxiv.org/abs/2002.03531
These scenarios are generated from the full cross-product:
However, only 48 of these are considered epistemically valid. Each triplet encodes a coherent relationship among:
- The epistemic function of a method (e.g., new vs. refined)
- The novelty claim of empirical results
- The Kuhnian stage of the underlying conceptual model or paradigm
For example:
- Scenario 1: (M1, N1, P1) → "Existing method confirms known result under normal science"
- Scenario 48: (M3, N3, P5) → "Novel method produces new observation triggering a paradigm shift"
The epistemic signal intensity is computed from the Paradigm stage:
To capture disciplinary nonlinearity (multdisciplinary and transdisciplinary research), the cross-entropy of the triplet disciplines is computed:
Where
A composite epistemic signal strength is defined as:
This metric accounts for both disruptive potential and cross-disciplinary entropy.
The final visualization is a spectrogram-style heatmap where:
- X-axis: Discipline × Role (e.g.,
Biology:M
,AI:P
) - Y-axis: Preprints (sorted by signal intensity + entropy)
- Color: Kuhnian stage from Normal (P1) to Shift (P5), using a Baby Blue → Cool Orange palette
- Load and run the script
- Upload your Preprint Watch signal file (CSV format, freely available upon request from ipcontrol@knowdyn.co.uk)
- Let the script analyze and visualize the data
- Extract final plots and metrics for reporting or publication
This project is maintained by KNOWDYN for the Preprint Watch project. For scientific use only.