Here is a curated list of Split Computing & Early Exit papers and resources.
🚧 This is an active repository; you can watch for the latest advances.
The ✨ indicates that this is an article that I co-authored.
If you find it useful, please kindly star ⭐ this repository.
[2025-05-18]: Made this repository public.
This repository is maintained by Luigi Capogrosso (luigi.capogrosso@univr.it). I welcome feedback, suggestions, and contributions that can help improve this repository to make it a valuable resource that benefits the entire community.
I will actively maintain this repository by incorporating new research as it emerges. If you have any suggestions regarding the taxonomy, find any missed papers, or update any pre-print arXiv paper that has been accepted to some venue, please email me or submit a pull request using the following markdown format.
Paper Title, <ins>Conference/Journal/Preprint</ins>, [[article](link)], [[other resources](link)].
Articles are sorted by year, from most recent to oldest.
2025 | 2024 | 2023 | 2022 | 2021 | 2020 | 2019 | 2018 | 2017 | 2016
-
✨ LO-SC: Local-Only Split Computing for Accurate Deep Learning on Edge Devices, International Conference on VLSI Design (VLSID), [article], [website].
-
A Multi-task Supervised Compression Model for Split Computing, Winter Conference on Applications of Computer Vision (WACV), [article].
-
✨ Controllers for Edge-Cloud Cyber-Physical Systems, International Conference on COMmunication Systems and NETworks (COMSNETS), [article], [website].
-
LVMScissor: Split and Schedule Large Vision Model Inference on Mobile Edges via Salp Swarm Algorithm, IEEE Transactions on Mobile Computing, [article].
-
Request Deadline Split and Interference-Aware Request Migration in Edge Cloud, Concurrency and Computation: Practice and Experience, [article].
-
✨ MTL-Split: Multi-Task Learning for Edge Devices using Split Computing, Design Automation Conference (DAC), [article], [website].
-
Neuromorphic Split Computing With Wake-Up Radios: Architecture and Design via Digital Twinning, IEEE Transactions on Signal Processing, [article].
-
✨ Enhancing Split Computing and Early Exit Applications through Predefined Sparsity, Forum on Specification & Design Languages (FDL), [article], [website].
-
Enhancing the Reliability of Split Computing Deep Neural Networks, International Symposium on On-Line Testing and Robust System Design (IOLTS), [article].
-
✨ Learning-Enabled CPS for Edge-Cloud Computing, International Symposium on Industrial Embedded Systems (SIES), [article], [website].
-
Smart Split: Leveraging TinyML and Split Computing for Efficient Edge AI, Symposium on Edge Computing (SEC), [article].
-
Split DNN Inference for Exploiting Near-Edge Accelerators, International Conference on Edge Computing and Communications (EDGE), [article].
-
A novel middleware for adaptive and efficient split computing for real-time object detection, Pervasive and Mobile Computing, [article].
-
Fast and fair split computing for accelerating deep neural network (DNN) inference, ICT Express, [article].
-
✨ Split-Et-Impera: A Framework for the Design of Distributed Deep Learning Applications, International Symposium on Design and Diagnostics of Electronic Circuits and Systems (DDECS), [article], [website].
-
Dynamic Split Computing for Efficient Deep EDGE Intelligence, International Conference on Acoustics, Speech and Signal Processing (ICASSP), [article].
-
PrivyNAS: Privacy-Aware Neural Architecture Search for Split Computing in Edge–Cloud Systems, IEEE Internet of Things Journal, [article].
-
SplitEE: Early Exit in Deep Neural Networks with Split Computing, Third International Conference on AI-ML Systems (AIMLSystems), [article].
-
SplitBeam: Effective and Efficient Beamforming in Wi-Fi Networks Through Split Computing, International Conference on Distributed Computing Systems (ICDCS), [article].
-
Distributed Split Computing System in Cooperative Internet of Things (IoT), IEEE Access, [article].
-
DNN Split Computing: Quantization and Run-Length Coding are Enough, Global Communications Conference, [article].
-
Efficient Communication-Computation Tradeoff for Split Computing: A Multi-Tier Deep Reinforcement Learning Approach, Global Communications Conference, [article].
-
SC2 Benchmark: Supervised Compression for Split Computing, Transactions on Machine Learning Research, [article].
-
✨ I-SPLIT: Deep Network Interpretability for Split Computing, International Conference on Pattern Recognition (ICPR), [article], [website].
-
BottleFit: Learning Compressed Representations in Deep Neural Networks for Effective and Efficient Split Computing, International Symposium on a World of Wireless, Mobile and Multimedia Networks (WoWMoM), [article].
-
Supervised Compression for Resource-Constrained Edge Computing Systems, Winter Conference on Applications of Computer Vision (WACV), [article].
-
Packet-Loss-Tolerant Split Inference for Delay-Sensitive Deep Learning in Lossy Wireless Networks, Conference on Computer Vision and Pattern Recognition (CVPR), [article].
-
A Splittable DNN-Based Object Detector for Edge-Cloud Collaborative Real-Time Video Inference, International Conference on Advanced Video and Signal Based Surveillance (AVSS), [article].
-
Neural Compression and Filtering for Edge-assisted Real-time Object Detection in Challenged Networks, International Conference on Pattern Recognition (ICPR), [article].
-
Cut, Distil and Encode (CDE): Split Cloud-Edge Deep Inference, International Conference on Sensing, Communication, and Networking (SECON), [article].
-
Optimal Branch Location for Cost-effective Inference on Branchynet, International Conference on Big Data (Big Data), [article].
-
Packet-Loss-Tolerant Split Inference for Delay-Sensitive Deep Learning in Lossy Wireless Networks, Global Communications Conference (GLOBECOM), [article].
-
A Probabilistic Re-Intepretation of Confidence Scores in Multi-Exit Models, Entropy, [article].
-
Zero Time Waste: Recycling Predictions in Early Exit Neural Networks, A dvances in Neural Information Processing Systems (NeurIPS), [article].
-
Head Network Distillation: Splitting Distilled Deep Neural Networks for Resource-Constrained Edge Computing Systems, IEEE Access, [article].
-
Split Computing for Complex Object Detectors: Challenges and Preliminary Results, International Workshop on Embedded and Mobile Deep Learning, article].
-
BottleNet++: An End-to-End Approach for Feature Compression in Device-Edge Co-Inference Systems, International Conference on Communications Workshops (ICC Workshops), [article].
-
Resolution Adaptive Networks for Efficient Inference, Conference on Computer Vision and Pattern Recognition (CVPR), [article].
-
Fast and Accurate Streaming CNN Inference via Communication Compression on the Edge, International Conference on Internet-of-Things Design and Implementation (IoTDI), [article].
-
Joint Device-Edge Inference over Wireless Links with Pruning, International Workshop on Signal Processing Advances in Wireless Communications (SPAWC), [article].
-
SPINN: synergistic progressive inference of neural networks over device and cloud, Annual International Conference on Mobile Computing and Networking (MobiCom), [article].
-
Deep compressive offloading: speeding up neural network inference by trading edge computation for network latency, Conference on Embedded Networked Sensor Systems (SenSys), [article].
-
Lightweight Compression Of Neural Network Feature Tensors For Collaborative Intelligence, International Conference on Multimedia and Expo (ICME), [article].
-
CRIME: Input-Dependent Collaborative Inference for Recurrent Neural Networks, IEEE Transactions on Computers, [article].
-
Back-And-Forth Prediction for Deep Tensor Compression, International Conference on Acoustics, Speech and Signal Processing (ICASSP), [article].
-
Depth-Adaptive Transformer, International Conference on Learning Representations (ICLR), [article].
-
Dual Dynamic Inference: Enabling More Efficient, Adaptive, and Controllable Deep Inference, IEEE Journal of Selected Topics in Signal Processing, [article].
-
Early Exit or Not: Resource-Efficient Blind Quality Enhancement for Compressed Images, European Conference on Computer Vision(ECCV), [article].
-
JointDNN: An Efficient Training and Inference Engine for Intelligent Mobile Cloud Computing Services, IEEE Transactions on Mobile Computing, [article].
-
BottleNet: A Deep Learning Architecture for Intelligent Mobile Cloud Computing Services, International Symposium on Low Power Electronics and Design (ISLPED), [article].
-
Distilled Split Deep Neural Networks for Edge-Assisted Real-Time Systems, Workshop on Hot Topics in Video Analytics and Intelligent Edges, [article].
-
DynExit: A Dynamic Early-Exit Strategy for Deep Residual Networks, International Workshop on Signal Processing Systems (SiPS), [article].
-
Boomerang: On-Demand Cooperative Deep Neural Network Inference for Edge Intelligence on the Industrial Internet of Things, IEEE Network, [article].
-
Exploiting Energy-Accuracy Trade-off through Contextual Awareness in Multi-Stage Convolutional Neural Networks, International Symposium on Quality Electronic Design (ISQED), [article].
-
Distillation-Based Training for Multi-Exit Architectures, International Conference on Computer Vision (ICCV), [article].
-
Auto-tuning Neural Network Quantization Framework for Collaborative Inference Between the Cloud and Edge, Artificial Neural Networks and Machine Learning (ICANN), [article].
-
Learning IoT in Edge: Deep Learning for the Internet of Things with Edge Computing, IEEE Network, [article].
-
Computation Offloading for Machine Learning Web Apps in the Edge Server Environment, International Conference on Distributed Computing Systems (ICDCS), [article].
-
Deep Feature Compression for Collaborative Object Detection, International Conference on Image Processing (ICIP), [article].
-
Neurosurgeon: Collaborative Intelligence Between the Cloud and Mobile Edge, ACM SIGARCH Computer Architecture News, [article].
-
A Dynamic Deep Neural Network Design for Efficient Workload Allocation in Edge Computing, International Conference on Computer Design (ICCD), [article].
-
Distributed Deep Neural Networks Over the Cloud, the Edge and End Devices, International Conference on Distributed Computing Systems (ICDCS), [article].
- BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks, International Conference on Pattern Recognition (ICPR), [article].