🎯 Abstract

Towards Low-Latency Event Stream-based Visual Object Tracking: A Slow-Fast Approach, Shiao Wang, Xiao Wang*, Liye Jin, Bo Jiang*, Lin Zhu, Lan Chen, Yonghong Tian, Bin Luo, arXiv:2505.12903 [Paper]

🎯 Abstract

Existing tracking algorithms typically rely on low-frame-rate RGB cameras coupled with computationally intensive deep neural network architectures to achieve effective tracking. However, such frame-based methods inherently face challenges in achieving low-latency performance and often fail in resource-constrained environments. Visual object tracking using bio-inspired event cameras has emerged as a promising research direction in recent years, offering distinct advantages for low-latency applications. In this paper, we propose a novel Slow-Fast Tracking paradigm that flexibly adapts to different operational requirements, termed SFTrack. The proposed framework supports two complementary modes, i.e., a high-precision slow tracker for scenarios with sufficient computational resources, and an efficient fast tracker tailored for latency-aware, resource-constrained environments. Specifically, our framework first performs graph-based representation learning from high-temporal-resolution event streams, and then integrates the learned graph-structured information into two FlashAttention-based vision backbones, yielding the slow and fast trackers, respectively. The fast tracker achieves low latency through a lightweight network design and by producing multiple bounding box outputs in a single forward pass. Finally, we seamlessly combine both trackers via supervised fine-tuning and further enhance the fast tracker’s performance through a knowledge distillation strategy. Extensive experiments on public benchmarks, including FE240, COESOT, and EventVOT, demonstrate the effectiveness and efficiency of our proposed method across different real-world scenarios.

🔨 Environment

A Slow-Fast framework for Event Stream-based Visual Object Tracking.

Install env

conda create -n sftrack python=3.10
conda activate sftrack
bash install.sh

wget https://github.com/Dao-AILab/flash-attention/releases/download/v2.7.3/flash_attn-2.7.3+cu11torch2.3cxx11abiFALSE-cp310-cp310-linux_x86_64.whl
pip install flash_attn-2.7.3+cu11torch2.3cxx11abiFALSE-cp310-cp310-linux_x86_64.whl
pip install torch-scatter torch-sparse torch-cluster torch-spline-conv -f https://data.pyg.org/whl/torch-2.3.0+cu118.html
pip install torch-geometric==1.7.2

You can modify paths by editing these files

Stage1/lib/train/admin/local.py  # paths about training
Stage1/lib/test/evaluation/local.py  # paths about testing

Stage2/lib/train/admin/local.py  # paths about training
Stage2/lib/test/evaluation/local.py  # paths about testing

Train & Test

# Stage1
bash train.sh 
bash test.sh

# Stage2
bash train.sh 
bash test.sh

Download pre-trained MAE ViT-Base weights and put it under $Stage1/pretrained_models and $Stage2/pretrained_models

In the first training stage, Slow_ep0050.pth.tar and Fast_ep0050.pth.tar are obtained separately. (You can choose to train the Slow Tracker or the Fast Tracker in "Stage1/experiments/sftrack/**.yaml")

Then, put Slow_ep0050.pth.tar and Fast_ep0050.pth.tar under $Stage2/pretrained_models for supervised fine-tuning in the second training stage.

Test FLOPs, and Speed

Note: The speeds reported in our paper were tested on a single RTX 4090 GPU.

📀 Dataset

The dataset directory (using the EventVOT dataset as an example) should be organized in the following structure:

├── EventVOT
    ├── train
        ├── recording_2022-10-10_17-28-38
            ├── img
            ├── recording_2022-10-10_17-28-38_bin
            ├── groundtruth.txt
        ├── ... 
    ├── test
        ├── recording_2022-10-10_17-28-24
            ├── img
            ├── recording_2022-10-10_17-28-24_bin
            ├── groundtruth.txt
        ├── ...

💕 Citation

If you have any questions about this work, please leave an issue. Also, please give us a star if you think this paper helps your research.

@misc{wang2025SFTrack,
      title={Towards Low-Latency Event Stream-based Visual Object Tracking: A Slow-Fast Approach}, 
      author={Shiao Wang and Xiao Wang and Liye Jin and Bo Jiang and Lin Zhu and Lan Chen and Yonghong Tian and Bin Luo},
      year={2025},
      eprint={2505.12903},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2505.12903}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 75 Commits
Stage1		Stage1
Stage2		Stage2
figures		figures
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
aedat42bin_coesot_test.py		aedat42bin_coesot_test.py
aedat42bin_coesot_train.py		aedat42bin_coesot_train.py
csv2bin_test.py		csv2bin_test.py
csv2bin_train.py		csv2bin_train.py
install.sh		install.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🎯 Abstract

🔨 Environment

Train & Test

Test FLOPs, and Speed

📀 Dataset

💕 Citation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

Event-AHU/SlowFast_Event_Track

Folders and files

Latest commit

History

Repository files navigation

🎯 Abstract

🔨 Environment

Train & Test

Test FLOPs, and Speed

📀 Dataset

💕 Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages