Distributed DNN Inference on Raspberry Pi Cluster

Project: Optimizing Distributed Deep Neural Network (DNN) Inference for Low Latency and Energy Efficiency on Resource-Constrained Edge Devices.

Project Description

This research project aims to develop and evaluate novel methods for optimizing distributed deep neural network (DNN) inference to achieve low latency and energy efficiency specifically on resource-constrained Raspberry Pi 4B (4GB RAM) devices.

We start by manually splitting neural network models (layer-wise model parallelism) across multiple Raspberry Pis. Eventually, our goal is to dynamically partition these models based on real-time metrics such as device CPU load, memory usage, and network latency.

Goals & Objectives

Develop manual and dynamic model partitioning methods.
Measure performance metrics per device:
- Throughput
- Network latency & delay
- End-to-end inference time
- Inference time per image/batch
- CPU usage, frequency, memory usage
- Accuracy
Optimize for low latency and minimal energy consumption.
Implement adaptive logic for dynamic layer assignment.

Models and Datasets

Currently using pretrained models and datasets:

Models:

MobileNetV2
MobileNetV3 (Small)
SqueezeNet (v1.1)
ShuffleNetV2 (x0.5)
EfficientNet-B0

Datasets:

MNIST / FashionMNIST
CIFAR-10 / CIFAR-100

Project Structure

distributed-dnn-inference-pi/
├── README.md
├── requirements.txt
├── rpc_layer_split.py           # Main script for distributed inference using PyTorch RPC
├── models/                      # Pretrained models and related scripts
│   ├── mobilenetv2/
│   ├── squeezenet/
│   └── efficientnet_b0/
├── data/                        # Datasets used for inference
│   ├── cifar10/
│   └── mnist/
├── scripts/                     # Utility scripts for performance metrics and setup
│   └── metrics_collection.py
│   ├── model_evaluation/           # New subdirectory for evaluation scripts
│   │   ├── model_evaluation.py     # Main script
│   │   ├── custom_model_example.py # Extension example  
│   │   ├── batch_run.sh           # Batch execution script
│   │   └── README.md              
└── docs/                        # Documentation and instructions
    ├── SETUP.md                 # How to set up Raspberry Pi devices and environment
    └── USAGE.md                 # How to run inference and collect metrics

Setup & Installation

Detailed instructions in: docs/SETUP.md

Quick overview:

Install Ubuntu 22.04 on all Raspberry Pi 4Bs.
Install PyTorch and dependencies.
Configure network connectivity between Pis.
Clone this repository and install Python dependencies:

git clone <repo-url>
cd distributed-dnn-inference-pi
pip install -r requirements.txt

Usage

Detailed instructions in: docs/USAGE.md

Quick overview:

To start distributed inference (manual partition):

python rpc_layer_split.py

To collect performance metrics:

python scripts/metrics_collection.py

Performance Metrics Collected

Metric	Description
Model	Name of the DNN model used
Device	Hostname or identifier of each Pi
Throughput	Images processed per second
Network Latency	Latency between devices
End-to-end time	Total inference pipeline time
Model Parameters	Total parameters in the model
Average Accuracy	Accuracy over test set
Inference Time (per image)	Avg. inference time per image
Inference Time (per batch)	Avg. inference time per batch
CPU Usage & Frequency	Avg. CPU load and frequency (MHz)
Memory Usage	Avg. RAM used during inference (%)
Batch Size	Number of images processed per batch
Layers per Device	Layers assigned to each Pi

Developer Workflow

We use GitHub Projects to manage tasks, code reviews, and feature tracking. Contributions should follow this workflow:

Create a feature branch:

git checkout -b feature/your-feature-name

Commit your changes clearly:

git commit -m "Add feature X for Y"

Push your branch and open a Pull Request (PR) on GitHub:

git push origin feature/your-feature-name

Once reviewed, merge into main.

Future Work

Implement dynamic model partitioning based on runtime metrics.
Evaluate and integrate additional models and datasets.
Explore quantization and other optimization methods to enhance performance.

Contributing

Keep your code clean and readable.
Document your changes thoroughly.
Be sure your updates pass all tests before merging.

Resources & References

Contact

Project Lead: Name
Lab: CloudSys Lab at University of Texas at San Antonio

For questions or contributions, open an issue or contact the maintainers.

Name		Name	Last commit message	Last commit date
Latest commit History 88 Commits
docs		docs
inhouse-splitting		inhouse-splitting
models/mobilenetv2		models/mobilenetv2
scripts		scripts
.gitignore		.gitignore
README.md		README.md
check_environment.py		check_environment.py
list_packages.py		list_packages.py
mobilenetv2_cifar10.pth		mobilenetv2_cifar10.pth
requirements.txt		requirements.txt
rpc_layer_split.py		rpc_layer_split.py
start_worker.sh		start_worker.sh
template.env		template.env

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Distributed DNN Inference on Raspberry Pi Cluster

Project Description

Goals & Objectives

Models and Datasets

Models:

Datasets:

Project Structure

Setup & Installation

Usage

Performance Metrics Collected

Developer Workflow

Future Work

Contributing

Resources & References

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

Foley-ops/distributed-dnn-inference-pi

Folders and files

Latest commit

History

Repository files navigation

Distributed DNN Inference on Raspberry Pi Cluster

Project Description

Goals & Objectives

Models and Datasets

Models:

Datasets:

Project Structure

Setup & Installation

Usage

Performance Metrics Collected

Developer Workflow

Future Work

Contributing

Resources & References

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages