A Large-scale Video Dataset of Diverse Traffic Scenarios in China
The D²-City dataset is a comprehensive collection of dashcam videos collected by vehicles on DiDi’s platform in 5 Chinese cities. All videos are 30-second clips in 720p or 1080p resolution at a frame rate of 25fps. The dataset will be divided into training, validation, and testing subsets. The data files and statistics will then be released in stages.For around 1000 of the videos, we have annotated the bounding boxes and tracking IDs of road objects into 12 different categories, shown in the following table. For some of the remainder of the videos, we annotate the bounding boxes in key frames.