Boost UAV-based Ojbect Detection via Scale-Invariant Feature Disentanglement and Adversarial Learning
Fan Liu (刘凡) ✉
,
Liang Yao (姚亮)
,
Chuanyi Zhang (张传一) ✉
,
Ting Wu (吴婷)
,
- 2025/04/23: Our paper has been accepted by IEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS)!
- 2024/01/17: We propose a Scale-Invariant Feature Disentanglement and Adversarial Learning method for UAV-OD. Codes and models will be open-sourced at this repository.
-
Clone this repo:
git clone https://github.com/1e12Leon/SIFDAL.git
-
Create a conda virtual environment and activate it:
conda create -n SIFDAL python=3.8 -y conda activate SIFDAL
-
Install
CUDA Toolkit 11.3
(link) andcudnn==8.2.1
(link), then installPyTorch==1.10.1
:conda install pytorch torchvision torchaudio cudatoolkit=11.3 -c pytorch -y # if you prefer other cuda versions, please choose suitable pytorch versions # see: https://pytorch.org/get-started/locally/
This code repository uses VOC format object detection data.
VOCdevkit
├───VOC2007
│ ├───ImageSets
│ | ├───Main
│ | test.txt
│ | train.txt
│ | trainval.txt
│ | val.txt
│ ├───JPEGImages
│ │ xxx1.jpg
│ │ xxx2.jpg
│ │ ...
│ └───Annotations
xxx1.xml
xxx2.xml
...
We constructed a multi-scene and multi-modal UAV-based object detection dataset, State-Air. It was captured in a real-world outdoor setting with a wide variety of scenes and weather conditions. We are committed to further enhancing the scope and scale of State-Air, expanding both the coverage and depth of it.
By downloading or using the Dataset, as a Licensee I/we understand, acknowledge, and hereby agree to all the terms of use. This dataset is provided "as is" and without any warranty of any kind, express or implied. The authors and their affiliated institutions are not responsible for any errors or omissions in the dataset, or for the results obtained from the use of the dataset. The dataset is intended for academic research purposes only, and not for any commercial or other purposes. The users of the dataset agree to acknowledge the source of the dataset and cite the relevant papers in any publications or presentations that use the dataset. The users of the dataset also agree to respect the intellectual property rights of the original data owners.
@article{liu2024boost,
title={Boost UAV-based Ojbect Detection via Scale-Invariant Feature Disentanglement and Adversarial Learning},
author={Liu, Fan and Yao, Liang and Zhang, Chuanyi and Wu, Ting and Zhang, Xinlei and Jiang, Xiruo and Zhou, Jun},
journal={arXiv preprint arXiv:2405.15465},
year={2024}
}
Please Contact yaoliang@hhu.edu.cn