DeepSpike

Deep-Spike: Foundation Model-based Pipeline for Large-Scale Spike Sorting of Neural Activity

Overview

Spike sorting of high-resolution neural recordings is essential for understanding brain activity, but it remains challenging when multiple units are recorded due to their overlapping spike timing, low signal-to-noise ratios and overlapping clusters. Here, we introduce DeepSpike, a self-supervised deep learning model that automates spike sorting and overcomes key limitations of conventional spike sorting methods. DeepSpike is pretrained on large-scale unlabelled spiking events obtained from electrophysiological data as a general foundation model, enabling it to generalize to new recordings without dataset-specific retraining. DeepSpike uses a self-supervised autoencoder to learn robust low-dimensional spike embeddings that facilitate accurate clustering and effective noise filtering. The model is trained on a new, large-scale dataset consisting of $255M$ spiking events (SpikeVault-255M) derived from real in vivo recordings of about $4560$ minutes duration. The dataset consists of $15M$ ground truth spikes that are manually verified by an expert user. DeepSpike outperformed state-of-the-art spike sorting algorithms in both accuracy and robustness in our experiments on SpikeVault-255M, and two public benchmark datasets. Our results demonstrate that DeepSpike provides a scalable and generalizable solution for large-scale neural spike sorting. SpikeVault-255M dataset and the pretrained DeepSpike model are provided for further use and development.

Features

End-to-end spike sorting workflow
Deep learning-based feature extraction (Autoencoder, VAE)
Multiple clustering methods (GMM, DPGMM, HDBSCAN, KMeans)
Integration with SpikeInterface for standardized spike sorting and evaluation
Visualization tools for embeddings and clustering results
Support for large public datasets

Repository Structure

clustering.py: Clustering algorithms and utilities
dataset.py: Dataset loading and preprocessing
models.py: Deep learning models (VAE)
preprocess.py: Data preprocessing functions
utils.py: Utility functions
models/: Pretrained model weights
notebooks/: Example Jupyter notebooks and analysis pipelines
tables/: SpikeVault255M and Public dataset metrics and recording details

Getting Started

Clone the repository

git clone https://github.com/HughYau/DeepSpike.git
cd DeepSpike

Install dependencies Make sure you have Python 3.8+ and install the required packages:
```
pip install -r requirements.txt
```
Run example notebooks Open notebooks/deep_spike_guideline.ipynb for a step-by-step demonstration.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DeepSpike

Overview

Features

Repository Structure

Getting Started

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
models		models
notebooks		notebooks
tables		tables
README.md		README.md
clustering.py		clustering.py
dataset.py		dataset.py
models.py		models.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
utils.py		utils.py

HughYau/DeepSpike

Folders and files

Latest commit

History

Repository files navigation

DeepSpike

Overview

Features

Repository Structure

Getting Started

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages