KITTI Vision Benchmark Suite

A widely used computer vision benchmark which was released in 2012. A Volkswagen station was fitted with grayscale and color cameras, a Velodyne 3D Laser Scanner and a GPS/IMU system.

All datasets and benchmarks in this suite are published under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License. This means that you must attribute the work in the manner specified by the authors, you may not use this work for commercial purposes and if you alter, transform, or build upon this work, you may distribute the resulting work only under the same license.

Data and Resources

Additional Info

Field Value
Source Karlsruhe Institute of Technology
Maintainer Prof. Andreas Geiger
Associated Tasks Image Recognition, Image Parsing
Geographical Area Karlsruhe, Germany
Number of Instances Unknown
Package Description The dataset comprises the following information, captured and synchronized at 10 Hz: Raw (unsynced+unrectified) and processed (synced+rectified) grayscale stereo sequences (0.5 Megapixels, stored in png format), Raw (unsynced+unrectified) and processed (synced+rectified) color stereo sequences (0.5 Megapixels, stored in png format), 3D Velodyne point clouds (100k points per frame, stored as binary float matrix), 3D GPS/IMU data (location, speed, acceleration, meta information, stored as text file), Calibration (Camera, Camera-to-GPS/IMU, Camera-to-Velodyne, stored as text file), 3D object tracklet labels (cars, trucks, trams, pedestrians, cyclists, stored as xml file).
Dataset has missing values False