G2A-VReID is a large-scale video-based pedestrain Re-Identification datasets, constructed by National Engineering Laboratory for Integrated Aero-Space-Ground-Ocean (ASGO) Big Data Application Technology. G2A-VReID consists of 2,788 person IDs and 185,907 images, corresponding to 5,576 tracklets. The number of identities is significantly higher than most of existing datasets. To best of our knowledge, G2A-VReID is the first dataset for video ReID under Ground-to-Aerial scenarios. G2A-VReID dataset has the following characteristics:
- Drastic view changes;
- Large number of annotated identities;
- Rich outdoor scenarios;
- Huge difference in resolution.
G2A-VReID.v2 is an extended large-scale video-based pedestrian Re-Identification dataset. Compared to its predecessor, G2A-VReID.v2 significantly expands the dataset scale and complexity, containing 5,605 person IDs, 2.54 million images, and 11,282 tracklets, making it one of the largest publicly available video ReID datasets to date. The dataset possesses the following key characteristics:
- Extreme cross-viewpoint variations, including top-down and oblique angles from UAVs and conventional ground-level views;
- Large-scale identity and video tracklet annotations, more than doubling the size of the original G2A-VReID;
- Rich environmental diversity, covering 13 outdoor scenes such as construction sites, plazas, flyovers, and grasslands;
- Wide resolution disparity, especially under high-altitude UAV views;
- Multi-season and day-night coverage, capturing diverse lighting, weather, and appearance conditions across time.
We try our best to protect the privacy of pedestrians from the following aspects:
- we make our best efforts to inform pesdestrains about data collection,
cordons are used to mark data collection areas and notifications (including ) are
post near the sites in collection process.
- To further minimize privacy risks,
we use mosaic mask the clear face information. Sepecially, we construct a
face detection model (FDM) based on YOLOv5, which are trained on
both widerface-m and darkface-m datasets. Then, the FDM is empoly to mark out
clear face of pedestrains, and the masking them by mosaic (RGB:[96, 96, 96]). Meanwhile,
10 experienced annotators are employed to check for missing detection images
and mask them by manual.
- Finally, the dataset will be licensed for non-profit academic
research only, any researcher who downloads the G2A-VReID dataset must agree to observe the restrictions.
G2A-VReID is available at Link. G2A-VReID.v2 is available at Link. If the link becomes invalid, please feel free to contact luowenlong@mail.nwpu.edu.cn.
@inproceedings{zhang2024cross,
title={Cross-platform video person reid: A new benchmark dataset and adaptation approach},
author={Zhang, Shizhou and Luo, Wenlong and Cheng, De and Yang, Qingchun and Ran, Lingyan and Xing, Yinghui and Zhang, Yanning},
booktitle={European Conference on Computer Vision},
pages={270--287},
year={2024},
organization={Springer}
}