VERI-Wild: A Large Dataset and a New Method for Vehicle Re-Identification in the Wild
A large-scale vehicle ReID dataset in the wild (VERI-Wild) is captured from a large CCTV surveillance system consisting of 174 cameras across one month (30× 24h) under unconstrained scenarios. The cameras are distributed in a large urban district of more than 200km2. The YOLO-v2 [2] is used to detect the bounding box of vehicles. The raw vehicle image set contains 12 million vehicle images, and 11 volunteers are invited to clean the dataset for 1 month. After data cleaning and annotation, 416,314 vehicle images of 40,671 identities are collected. The statistics of VERI-Wild is illustrated in Figure. For privacy issues, the license plates are masked in the dataset. The distinctive features of VERI-Wild are summarized into the following aspects:
Unconstrained capture conditions in the wild
The VERI-Wild dataset is collected from a real CCTV camera system consisting of 174 surveillance cameras, in which the unconstrained image capture conditions pose a variety of challenges.
Complex capture conditions
The 174 surveillance cameras are distributed in an urban district over 200km2, presenting various backgrounds, resolutions, viewpoints, and occlusion in the wild. In extreme cases, one vehicle appears in more than 40 different cameras, which would be challenging for ReID algorithms.
Large time span involving severe illumination and weather changes
The VERI-Wild is collected from a duration of 125, 280 (174x24x30) video hours. Figure (b) gives the vehicle distributions in 4 time slots of 24h, i.e., morning, noon, afternoon, evening across 30 days. VERI-Wild also contains poor weather conditions, such as rainy, foggy, etc, which are not provided in previous datasets.
Rich Context Information
We provide rich context information such as camera IDs, timestamp, tracks relation across cameras, which are potential to facilitate the research on behavior analysis in camera networks, like vehicle behavior modeling, cross-camera tracking and graph-based retrieval.
Note that, for CVPR 2019 test set, given a query image, we remove the images with the same camera id and same vehicle id as query image in the gallery set. They are not considered when computing the mAP and CMC.
Baidu cloud disk: Link: https://pan.baidu.com/s/1ib_gj5IKI2hSv1RbGsQmFw Password: fbm1
Google driven: https://drive.google.com/drive/folders/1q9kClC0gy0bz2i06sHlKUh2Pqk6RxKan?usp=sharing
@inproceedings{lou2019large,
title={VERI-Wild: A Large Dataset and a New Method for Vehicle Re-Identification in the Wild},
author={Lou, Yihang and Bai, Yan and Liu, Jun and Wang, Shiqi and Duan, Ling-Yu},
booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
pages = {3235--3243},
year={2019}
}
Yan Bai, Email: [email protected]
Methods | Small | Medium | Large | ||||||
mAP | Top1 | Top5 | mAP | Top1 | Top5 | mAP | Top1 | Top5 | |
GoogleNet[1] | 24.27 | 57.16 | 75.13 | 24.15 | 53.16 | 71.1 | 21.53 | 44.61 | 63.55 |
FDA-Net(VGGM)[2] | 35.11 | 64.03 | 82.80 | 29.80 | 57.82 | 78.34 | 22.78 | 49.43 | 70.48 |
MLSL[3] | 46.32 | - | - | 42.37 | - | - | 36.61 | - | - |
Triplet(Resnet50) | 58.43 | 65.76 | 86.98 | 49.72 | 57.76 | 80.86 | 38.57 | 47.65 | 71.66 |
FDA-Net(Resnet50)[2] | 61.57 | 73.62 | 91.23 | 52.69 | 64.29 | 85.39 | 45.78 | 58.76 | 80.97 |
AAVER(Resnet50)[4] | 62.23 | 75.80 | 92.70 | 53.66 | 68.24 | 88.88 | 41.68 | 58.69 | 81.59 |
DFLNet(Resnet50)[5] | 68.21 | 80.68 | 93.24 | 60.07 | 70.67 | 89.25 | 49.02 | 61.60 | 82.73 |
BS(mobilenet)[6] | 70.54 | 84.17 | 95.30 | 62.83 | 78.22 | 93.06 | 51.63 | 69.99 | 88.45 |
UMTS(Resnet50)[7] | 72.7 | 84.5 | - | 66.1 | 79.3 | - | 54.2 | 72.8 | - |
Strong baesline(Resnet50)[8] | 76.61 | 90.83 | 97.29 | 70.11 | 87.45 | 95.24 | 61.3 | 82.58 | 92.73 |
HPGN(Resnet50+PGN)[9] | 80.42 | 91.37 | - | 75.17 | 88.21 | - | 65.04 | 82.68 | - |
GLAMOR(Resnet50+PGN)[10] | 77.15 | 92.13 | 97.43 | - | - | - | - | - | - |
SAVER(Resnet50)[11] | 80.9 | 93.78 | 97.93 | 75.3 | 92.7 | 97.48 | 67.7 | 89.5 | 95.8 |
[1] Yang, L., Luo, P., Change Loy, C., Tang, X.: A large-scale car dataset for fine-grained categorization and verification. In: IEEE Conference on Computer Visionand Pattern Recognition. pp. 3973–3981 (2015)
[2] Lou, Y., Bai, Y., Liu, J., Wang, S., Duan, L.: Veri-wild: A large dataset and a newmethod for vehicle re-identification in the wild. In: IEEE Conference on ComputerVision and Pattern Recognition. pp. 3235–3243 (2019)
[3] Alfasly, S., Hu, Y., Li, H., Liang, T., Jin, X., Liu, B., Zhao, Q.: Multi-label-basedsimilarity learning for vehicle re-identification. IEEE Access7, 162605–162616(2019)
[4] Pirazh, K., Kumar, A., Peri, N., et al: A dual path modelwith adaptive attentionfor vehicle re-identification. In: IEEE International Conference on Computer Vision(2019)
[5] Yan Bai, Yihang Lou, Yongxing Dai, et al: Disentangled Feature Learning Network for Vehicle Re-Identification. In: IJCAI 2020
[6] Kuma Ratnesh and Weill Edwin and et al: Vehicle re-identification: an efficient baseline using triplet embedding. IN IJCNN 2019
[7] Xin Jin, Cuiling Lan, Wenjun Zeng, Zhibo Chen: Uncertainty-Aware Multi-Shot Knowledge Distillation for Image-Based Object Re-Identification. In: AAAI 2020
[8] Luo Hao and Gu Youzhi and et al:Bag of Tricks and a Strong Baseline for Deep Person Re-Identification. In CVPR workshop 2019.
[9] Shen Fei, Zhu Jianqing and et al: Exploring Spatial Significance via Hybrid Pyramidal Graph Network for Vehicle Re-identification. In arXiv preprint arXiv:2005.14684
[10] Abhijit Suprem and Calton Pu: Looking GLAMORous: Vehicle Re-Id in Heterogeneous Cameras Networks with Global and Local Attention. In arXiv preprint arXiv:2002.02256
[11] Khorramshahi Pirazh, Peri Neehar, Chen Jun-cheng, Chellappa Rama: The Devil is in the Details: Self-Supervised Attention for Vehicle Re-Identification. In ECCV 2020