This repo contains our car crash dataset (CCD) for traffic accident anticipation.
Car Crash Dataset (CCD) is collected for traffic accident analysis. It contains real traffic accident videos captured by dashcam mounted on driving vehicles, which is critical to developing safety-guaranteed self-driving systems. CCD is distinguished from existing datasets for diversified accident annotations, including environmental attributes (day/night, snowy/rainy/good weather conditions), whether ego-vehicles involved, accident participants, and accident reason descriptions. An overview of our accident annotations is dipicted in the figure above.
Download CCD from Google Drive.
Please make sure your downloaded files are structured as following:
CarCrash
├── codes # useful codes for analyzing the dataset
├── vgg16_features
│ ├── positive # feature files of possitive (accident) videos
│ │ ├── 000001.npz
│ │ ├── ...
│ │ └── 001500.npz
│ ├── negative # feature files of negative (normal) videos
│ │ ├── 000001.npz
│ │ ├── ...
│ │ └── 003000.npz
│ ├── train.txt # list file of training split
│ └── test.txt # list file of testing split
├── videos
│ ├── Normal # normal driving videos
│ │ ├── 000001.mp4
│ │ ├── ...
│ │ └── 003000.mp4
│ ├── Crash-1500 # crash accident videos
│ │ ├── 000001.mp4
│ │ ├── ...
│ │ └── 001500.mp4
│ └── Crash-1500.txt # annotation file for crash accident
└── README.md
Videos: We collected traffic accident videos from YouTube channels and split them to get 1,500 trimmed videos. Each video contains 50 frames with 10 frames per second. The 3,000 normal videos are randomly sampled from BDD100K dataset.
Features: Feature files in folders vgg16_features/
are extracted VGG-16 features of all frames and all detected bounding boxes. All of bounding boxes are saved in these files. We use the public detection codebase MMDetection to train Cascade R-CNN with ResNeXt-101 backbone and FPN neck as our object detector on KITTI 2D detection dataset. We use Cascade R-CNN to obtain top-19 bounding boxes and then they are used for feature extraction with pre-trained ImageNet VGG-16 model. For each feature file (i.e., 000001.npz
), the following data are provided:
- data: Extracted 4096-dim features with shape (50, 20, 4096). It contains frame-level feature with shape (50, 1, 4096) and 19 box-level features with shape (50, 19, 4096).
- det: Detected bounding boxes with shape (50, 19, 6), where the last dim denotes (x1, y1, x2, y2, prob, cls).
- labels: One-hot video labels to indicate whether the video contains an accident, i.e.,
[0, 1]
denotes positive (accident) and[1, 0]
denotes negative (normal).- ID: The video name for current feature file.
The annotations of 1,500 accident videos are saved in the file CarCrash/videos/Crash-1500.txt
. For each line, the followings are provided:
- vidname: Video name of each accident video, i.e.,
000001
- binlabels: Binary labels of all 50 frames for each video. Label
1
indicates accident frame.- startframe: The zero-padded starting frame of each video in original untrimmed YouTube video.
- youtubeID: The numeric ID of original YouTube video identifier.
- timing: Timing conditions, i.e.,
Day
andNight
.- weather: Weather conditions, i.e.,
Normal
,Snowy
, andRainy
.- egoinvolve: Boolean identifier to indicate wheather the ego-vehicle is involved in the accident of this video.
Note: Current version of CCD only provides the temporal annotations and environmental attributes listed above. For more detailed annotations such as traffic accident reasons and tracklets, we will release them soon.
Please cite our paper if you find our dataset useful.
@InProceedings{BaoMM2020,
author = {Bao, Wentao and Yu, Qi and Kong, Yu},
title = {Uncertainty-based Traffic Accident Anticipation with Spatio-Temporal Relational Learning},
booktitle = {ACM Multimedia Conference},
month = {May},
year = {2020}
}
We thank Haiting Hao a lot for data collection. If you have any questions about the dataset, please feel free to contact Wentao Bao.