Video anomaly detection (VAD) aims to identify, understand and describe anomalous events in videos. This repository collects the latest research papers, code, datasets, seminars, utilities and related resources for VAD, updated every Friday. Like this repoπ? β it and feel free to open an issue for feedback.
Note
Welcome to nominate VAD papers or related resources via pull request! Please refer to CONTRIBUTING.md for detailed guidelines.
- π Contents
- π₯ Recent Updates
- π§ͺ New Setting Papers
- π Weakly-supervised VAD Papers
- π Semi-supervised VAD Papers
- π Skeleton-based Papers
- π Fully-supervised VAD Papers
- π Surveys
- ποΈ Benchmarks
- π½ Datasets
- π§π»βπ« Seminars
- π·π»ββοΈ Evaluation Metrics
- π οΈ Utilities
- π Related Repositories
- ππ» Acknowledgements
- π Citation
Last Update: June, 2025
- ICCV '25
- CVPR '25
- ICML '25
- ICLR '25
Related Repositories:
uws4vad-wiki π₯π₯π₯
Video-Anomaly-Detection
Papers:
ADSM
SeeKer
ADRM
MA-PDM
TAO
LEC-VAD
Ο-VAD
SFN-VAD
SVTA
VAU-R1
VERA \
-
[TAO] Track Any Anomalous Object: A Granular Video Anomaly Detection Pipeline Β Β Β
CVPR '25 [paper][project][code] -
[SVTA] Towards Scalable Video Anomaly Retrieval: A Synthetic Video-Text Benchmark Β Β Β
arXiv '25 [paper][project] -
[VAU-R1] Advancing Video Anomaly Understanding via Reinforcement Fine-Tuning Β Β Β
arXiv '25 [paper][code] -
[VANE-Bench] VANE-Bench: Video Anomaly Evaluation Benchmark for Conversational LMMs
NAACL '25 [paper][code][dataset][project] -
[Sherlock] Sherlock: Towards Multi-scene Video Abnormal Event Extraction and Localization via a Global-local Spatial-sensitive LLM
WWW '25 [paper][OpenReview] -
[SurveillanceVQA-589K] SurveillanceVQA-589K: A Benchmark for Comprehensive Surveillance Video-Language Understanding with Large Models
arXiv '25 [paper][annotation] -
[VERA] VERA: Explainable Video Anomaly Detection via Verbalized Learning of Vision-Language Models Β Β Β
CVPR '25 [paper][code][project] -
[Holmes-VAU] Holmes-VAU: Towards Long-term Video Anomaly Understanding at Any Granularity
CVPR '25 [paper][code & annotation] -
[HAWK] HAWK: Learning to Understand Open-World Video Anomalies
NeurIPS '24 [paper][code][annotation] -
[VAR] Toward Video Anomaly Retrieval From Video Anomaly Detection: New Benchmarks and Model
TIP '24 [paper][dataset] -
[AnomalyRuler] Follow the Rules: Reasoning for Video Anomaly Detection with Large Language Models
ECCV '24 [paper][code] -
[UCA] Towards Surveillance Video-and-Language Understanding: New Dataset, Baselines, and Challenges
CVPR '24 [paper][code & annotation][project] -
[GlanceVAD] GlanceVAD: Exploring Glance Supervision for Label-efficient Video Anomaly Detection
ICME '25 [paper][code & annotation] -
[OVVAD] Open-Vocabulary Video Anomaly Detection
CVPR '24 [paper][supp] -
[LAVAD] Harnessing Large Language Models for Training-free Video Anomaly Detection
CVPR '24 [paper][code][supp] -
[CUVA] Uncovering What, Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly
CVPR '24 [paper][code & dataset][supp] -
[UCF-Crime-DVS] UCF-Crime-DVS: A Novel Event-Based Dataset for Video Anomaly Detection with Spiking Neural
[paper][code & dataset] -
[TDSD] TDSD: Text-Driven Scene-Decoupled Weakly Supervised Video Anomaly Detection
ACM MM '24 [paper][code][OpenReview]
-
[ADRM] DualβDetector Reoptimization for Federated Weakly Supervised Video Anomaly Detection via Adaptive Dynamic Recursive Mapping Β Β Β
TIIΒ '25 [paper] [code] [supp] -
[CDL] Cross-Domain Learning for Video Anomaly Detection with Limited Supervision
ECCV '24 [paper] -
[HLGAtt] Cross-Modal Fusion and Attention Mechanism for Weakly Supervised Video Anomaly Detection
CVPR '24 Workshop [paper] -
[ECU] Exploiting Completeness and Uncertainty of Pseudo Labels for Weakly Supervised Video Anomaly Detection
CVPR '23 [paper][code][supp] -
[CoMo] Look Around for Anomalies: Weakly-supervised Anomaly Detection via Context-Motion Relational Learning
CVPR '23 [paper][supp] -
[ADGCN] Graph Convolutional Label Noise Cleaner: Train a Plug-and-play Action Classifier for Anomaly Detection
CVPR 19' [paper][code]
-
[Ο-VAD] Just Dance with
$\pi$ ! A Poly-modal Inductor for Weakly-supervised Video Anomaly Detection Β Β Β
CVPR '25 [paper] -
[LEC-VAD] Learning Event Completeness for Weakly Supervised Video Anomaly Detection Β Β Β
ICML '25 [paper] -
[Fed-WSVAD] Federated Weakly Supervised video Anomaly Detection with Multimodal Prompt
AAAI '25 [paper][code] -
[STPrompt] Weakly Supervised Video Anomaly Detection and Localization with Spatio-Temporal Prompts
ACM MM '24 [paper][OpenReview] -
[Vadclip] Vadclip: Adapting vision-language models for weakly supervised video anomaly detection
AAAI '24 [paper][code] -
[PE-MIL] Prompt-Enhanced Multiple Instance Learning for Weakly Supervised Video Anomaly Detection
CVPR '24 [paper][code][supp] -
[TPWNG] Text Prompt with Normality Guidance for Weakly Supervised Video Anomaly Detection
CVPR '24 [paper][supp] -
[PEL4VAD] Learning Prompt-Enhanced Context features for Weakly-Supervised Video Anomaly Detection
TIP '24 [paper][code]
-
[ADSM] Autoregressive Denoising Score Matching is a Good Video Anomaly Detector Β Β Β
ICCV '25 [paper][code] -
[MA-PDM] Video Anomaly Detection with Motion and Appearance Guided Patch Diffusion Model Β Β Β
AAAI '25 [paper][code] -
[SFN-VAD] MemoryOut: Learning Principal Features via Multimodal Sparse Filtering Network for Semi-supervised Video Anomaly Detection Β Β Β
arXiv '25 [paper][project] -
[LPGB] Local Patterns Generalize Better for Novel Anomalies
ICLR '25 [paper][code][OpenReview] -
[LANP] Learning Anomalies with Normality Prior for Unsupervised Video Anomaly Detection
ECCV '24 [paper] -
[Joint-VAD] Interleaving One-Class and Weakly-Supervised Models with Adaptive Thresholding for Unsupervised Video Anomaly Detection
ECCV '24 [paper][code] -
[SSAE] Scene-Dependent Prediction in Latent Space for Video Anomaly Detection and Anticipation
T-PAMI '24[paper][project][code][dataset] -
[DoTA] DoTA: Unsupervised Detection of Traffic Anomaly in Driving Videos
T-PAMI '23 [paper][code][dataset] -
[AED-MAE] Self-Distilled Masked Auto-Encoders are Efficient Video Anomaly Detectors
CVPR '24 [paper][code][supp] -
[MSTL] Multi-Scale Video Anomaly Detection by Multi-Grained Spatio-Temporal Representation Learning
CVPR '24 [paper][supp] -
[MULDE] MULDE: Multiscale Log-Density Estimation via Denoising Score Matching for Video Anomaly Detection
CVPR '24 [paper][code][supp] -
[CLAP] Collaborative Learning of Anomalies with Privacy (CLAP) for Unsupervised Video Anomaly Detection: A New Baseline
CVPR '24 [paper][code][supp] -
[MGENet] A Multilevel Guidance-Exploration Network and Behavior-Scene Matching Method for Human Behavior Anomaly Detection
ACM MM '24 [paper][code][OpenReview] -
[MPT] Video Anomaly Detection via Progressive Learning of Multiple Proxy Tasks
ACM MM '24 [paper][OpenReview]
-
[SeeKer] Sequential keypoint density estimator:an overlooked baseline of skeleton-based video anomaly detectionΒ Β Β
ICCV '25 [paper][code] -
[GiCiSAD] Graph-Jigsaw Conditioned Diffusion Model for Skeleton-based Video Anomaly Detection
WACV '25 [paper][code] -
[PoseWatch] PoseWatch: A Transformer-based Architecture for Human-centric Video Anomaly Detection Using Spatio-temporal Pose Tokenization
arXiv '25 [paper][code] -
[DA-Flow] DA-Flow: Dual Attention Normalizing Flow for Skeleton-based Video Anomaly Detection
arXiv '24 [paper] -
[MoCoDAD] Multimodal Motion Conditioned Diffusion Model for Skeleton-based Video Anomaly Detection
ICCV '23 [paper][code][supp]
-
[EBB] Exploring Background-bias for Anomaly Detection in Surveillance Videos
ACM MM 19' [paper][annotation] -
[ALVS] ANOMALY LOCALITY IN VIDEO SURVEILLANCE
arXiv 19' [paper][project][annotation]
-
[MSAD] Advancing Video Anomaly Detection: A Concise Review and a New Dataset
NeurIPS '24 [paper][project] -
[BenchRev] Unveiling the performance of video anomaly detection models β A benchmark-based review
Intelligent Systems with Applications '23 [paper] -
[WVAD-Review] Weakly Supervised Anomaly Detection: A Survey
arXiv '23 [paper][repo] -
[VAD-10] Video Anomaly Detection in 10 Years: A Survey and Outlook
arXiv '24 [paper] -
[GNN4TS] A Survey on Graph Neural Networks for Time Series: Forecasting, Classification, Imputation, and Anomaly Detection
T-PAMI '24 [paper][repo] -
[GTCNN] Graph-Time Convolutional Neural Networks: Architecture and Theoretical Analysis
T-PAMI '23 [paper]
Supervision | Method | Publication | Visual Features | STC (AUC) | UCF (AUC) | XDV (AP) | Ave (AUC) | Cor (AUC) | UBnormal (AUC) | Ped2 (AUC) | Campus (AUC) | NWPU (AUC) | NPDI (AUC) | TAD (AUC) | Audio Features | Text Prompt | LLM Involved | Data Augumentation |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Semi-supervised | LANP | ECCV '24 | I3D | 88.32 | 80.02 | - | - | - | - | - | - | - | - | - | - | - | - | - |
ResNext | 86.46 | 76.64 | - | - | - | - | - | - | - | - | - | - | - | - | - | |||
SSAE | T-PAMI '24 | - | 80.5 | - | - | 90.2 | 75.8 | - | - | - | 75.6 | - | - | - | - | - | - | |
AED-MAE | CVPR '24 | - | 79.1 | - | - | 91.3 | - | 58.5 | 95.4 | - | - | - | - | - | - | - | - | |
MSTL | CVPR '24 | I3D | 87.5 | 80.6 | - | 94.3 | - | - | - | 70.1 | - | - | - | - | - | - | - | |
MULDE | CVPR '24 | CLIP+Hiera | 81.3 | 78.5 | - | - | - | 72.8 | - | - | - | - | - | - | - | - | - | |
MGEnet | MM '24 | Video Swin | 86.9 | - | - | - | - | 74.3 | - | - | - | - | - | - | - | - | - | |
MPT | MM '24 | - | 88.6 | 83.2 | - | 94.5 | - | - | - | - | - | - | - | - | - | - | - | |
Weakly-supervised | HLGAtt | CVPR '24 Workshop | I3D | - | - | 86.34 | - | - | - | - | - | - | 99.45 | - | VGGish | - | - | - |
STPrompt | MM '24 | CLIP | 97.81 | 88.08 | - | - | - | 63.98 | - | - | - | - | - | - | β | - | - | |
ECU | CVPR '23 | I3D | - | 86.22 | 81.43 | - | - | - | - | - | - | - | 91.66 | VGGish | - | - | - | |
CoMo | CVPR '23 | I3D | 97.6 | 86.1 | 81.3 | 89.8 | - | - | - | - | - | - | - | - | - | - | - | |
Vadclip | AAAI '24 | CLIP | - | 88.02 | 84.51 | - | - | - | - | - | - | - | - | - | β | - | - | |
PE-MIL | CVPR '24 | I3D | 98.35 | 86.83 | 88.21 | - | - | - | - | - | - | - | - | VGGish | β | - | - | |
ADRM | TII '25 | MAEv2 | 97.91/97.86(Fed) | - | - | - | - | 70.91/76.51(Fed) | - | - | - | - | - | - | - | - | - |
Dataset | Download Links | Features | Additional Annotations | Publication |
---|---|---|---|---|
ShanghaiTech Campus | BaiduYun | I3D | - | CVPR 18' |
UCF-Crime | Dropbox | I3D | Full Annotation & Bounding Box Annotation | CVPR 18' |
XD-Violence | OneDrive | I3D & VGGish | - | ECCV 20' |
NWPU Campus | BaiduYun & Google Drive | - | - | CVPR '23 & T-PAMI '24 |
UBnormal | GoogleDrive | - | β | CVPR '22 |
ECVA | ModelScope | - | - | CVPR '24 |
MSAD | Request Form | Swin & I3D | - | NeurIPS '24 |
TAD | GoogleDrive | - | - | TIP '21 |
NPDI | Contact Email | - | - | CVIU 13' |
Street Scene | Page | - | - | WACV '20 |
Pornography-2k | Contact Email | - | - | FSI 16' |
* : ShanghaiTech was initially proposed as a semi-supervised VAD dataset, and Zhong etal. later introduced its weakly supervised split.
β : UCF-Crime was originally introduced as a weakly-supervised VAD dataset, and the dataset split under semi-supervision are sourced from MULDE.
β‘: NWPU Campus was initially proposed as a semi-supervised VAD dataset, and TDSD later introduced its weakly supervised split.
- Recent advances in anomaly detection.
CVPR '23 Tutorial [link]
1. AUC
Area Under Curve (AUC) of Receiver-Operating Characteristic curve (ROC) is a primary evaluation metric for VAD, measuring classification performance across all thresholds. The ROC curve plots the True Positive Rate (TPR) on the y-axis against the False Positive Rate (FPR) on the x-axis at various thresholds. AUC is computed as the integral of the ROC curve, ranging from 0 to 1, where 0.5 indicates performance approximating random guessing, and 1 denotes perfect discrimination. A higher AUC value indicates superior performance. AUC is a commonly used evaluation metric for the ShanghaiTech Campus and UCF-Crime datasets .2. AP
Average precision (AP) summarizes a Precision-Recall (PR) curve into a single value representing the average of all precisions. It is generally understood as the approximation of the area under the PR curve. AP ranges between 0 and 1, where a perfect model has precision, recall, and AP scores of 1. The larger the metric, the better a model performs across different thresholds[*].3. FAR
False Alarm Rate (FAR) with a threshold of 0.5 is evaluated to measure the reliability of detection results, proposed in UCF-Crime.[Video & Audio Feature Extraction]: video_features allows you to extract features from video clips, supporting a variety of modalities and extractors, i.e., S3D, R(2+1)d RGB, I3D-Net RGB + Flow, VGGish, CLIP.
uws4vad-wiki: a comprehensive benchmark table for VAD datasets and methods, frequently updated. Β Β Β
Video-Anomaly-Detection: a curated list of video anomaly detection papers. Β Β Β
awesome-video-anomaly-detection: an awesome collection of papers and codes for video anomaly detection, updated to CVPR '22.
WSAD: a comprehensive collection and categorization of weakly supervised anomaly detection papers.
awesome anomaly detection: a curated list of awesome anomaly detection resources, including time-series anomaly detection, video-level anomaly detection, image-level anomaly detection, last updated in November 2021.
anomaly detection resources: a comprehensive resource for anomaly detection, featuring a wide range of papers on various domains, e.g., image, time-series, financial, and social media anomaly detection. It contains only a subset of materials specifically related to video anomaly detection.
Special thanks to [@Zuble] for his valuable help and contributions to this project.
Thanks to [@inaomIIsfarell] | [sky836] | [rekkles2] for their nomination of VAD papers.
If you find this repository useful, please consider citing it:
@misc{awesome-vad,
author = {Junxi Chen},
title = {Awesome Video Anomaly Detection.},
publisher = {GitHub},
journal = {GitHub repository},
howpublished = {\url{https://github.com/Junxi-Chen/Awesome-Video-Anomaly-Detection}},
year = {2024},
}