[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Vision meets robotics: The KITTI dataset

Published: 01 September 2013 Publication History

Abstract

We present a novel dataset captured from a VW station wagon for use in mobile robotics and autonomous driving research. In total, we recorded 6 hours of traffic scenarios at 10-100 Hz using a variety of sensor modalities such as high-resolution color and grayscale stereo cameras, a Velodyne 3D laser scanner and a high-precision GPS/IMU inertial navigation system. The scenarios are diverse, capturing real-world traffic situations, and range from freeways over rural areas to inner-city scenes with many static and dynamic objects. Our data is calibrated, synchronized and timestamped, and we provide the rectified and raw image sequences. Our dataset also contains object labels in the form of 3D tracklets, and we provide online benchmarks for stereo, optical flow, object detection and other tasks. This paper describes our recording platform, the data format and the utilities that we provide.

References

[1]
Brubaker MA,Geiger A,Urtasun R.Lost! leveraging the crowd for probabilistic visual self-localization.Conference on computer vision and pattern recognition (CVPR); 2013; 2013. .
[2]
Geiger A,Lauer M,Urtasun R.A generative model for 3D urban scene understanding from movable platforms.Conference on computer vision and pattern recognition (CVPR); 2011 a; 2011 a. .
[3]
Geiger A,Wojek C,Urtasun R.Joint 3D estimation of objects and scene layout.Conference on neural information processing systems (NIPS); 2011 b; 2011 b. .
[4]
Geiger A,Lenz P,Urtasun R.Are we ready for autonomous driving? The KITTI vision benchmark suite.Conference on computer vision and pattern recognition (CVPR); 2012 a; 2012 a. .
[5]
Geiger A,Moosmann F,Car O,Schuster B.A toolbox for automatic calibration of range and camera sensors using a single shot.International conference on robotics and automation (ICRA); 2012 b; 2012 b. .
[6]
Goebl M,Faerber G.A real-time-capable hard- and software architecture for joint image and knowledge processing in cognitive automobiles.Proceedings of the Intelligent Vehicles Symposium (IV); 2007; 2007. .
[7]
Horaud R,Dornaika F.Hand-eye calibration.International Journal of Robotics Research. 1995;14 (3): 195-210
[8]
OsborneP (2008) The mercator projections. Available at: http://mercator.myzen.co.uk/mercator.pdf.
[9]
Paul R,Newman P.FAB-MAP 3D: Topological mapping with spatial and visual appearance.International conference on robotics and automation (ICRA); 2010; 2010. .
[10]
Pfeiffer D,Franke U.Efficient representation of traffic scenes by means of dynamic stixels.Proceedings of the Intelligent Vehicles Symposium (IV); 2010; 2010. .
[11]
Singh G,Kosecka J.Acquiring semantics induced topology in urban environments.International conference on robotics and automation (ICRA); 2012; 2012. .
[12]
Wojek C,Walk S,Roth S,Schindler K,Schiele B.Monocular visual scene understanding: Understanding multi-object traffic scenes.Proceedings of the Intelligent Vehicles Symposium (IV); 2012; 2012. .

Cited By

View all
  • (2025)Advancing 3D point cloud understanding through deep transfer learningInformation Fusion10.1016/j.inffus.2024.102601113:COnline publication date: 1-Jan-2025
  • (2025)Graph-based robust 3D point cloud map merging approach for large scaleCluster Computing10.1007/s10586-024-04797-628:1Online publication date: 1-Feb-2025
  • (2024)Evaluation of Robustness of Off-Road Autonomous Driving Segmentation against Adversarial Attacks: A Dataset-Centric StudyProceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems10.5555/3635637.3663119(2237-2239)Online publication date: 6-May-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image International Journal of Robotics Research
International Journal of Robotics Research  Volume 32, Issue 11
September 2013
127 pages

Publisher

Sage Publications, Inc.

United States

Publication History

Published: 01 September 2013

Author Tags

  1. Dataset
  2. GPS
  3. KITTI
  4. SLAM
  5. autonomous driving
  6. benchmarks
  7. cameras
  8. computer vision
  9. field robotics
  10. laser
  11. mobile robotics
  12. object detection
  13. optical flow
  14. stereo
  15. tracking

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 10 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2025)Advancing 3D point cloud understanding through deep transfer learningInformation Fusion10.1016/j.inffus.2024.102601113:COnline publication date: 1-Jan-2025
  • (2025)Graph-based robust 3D point cloud map merging approach for large scaleCluster Computing10.1007/s10586-024-04797-628:1Online publication date: 1-Feb-2025
  • (2024)Evaluation of Robustness of Off-Road Autonomous Driving Segmentation against Adversarial Attacks: A Dataset-Centric StudyProceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems10.5555/3635637.3663119(2237-2239)Online publication date: 6-May-2024
  • (2024)Multiple moving object classification and tracking using DenCNN classifierJournal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology10.3233/JIFS-23484046:5-6(11311-11329)Online publication date: 24-Oct-2024
  • (2024)Nighttime vehicle detection algorithm based on image translation technologyJournal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology10.3233/JIFS-23389946:2(5377-5389)Online publication date: 14-Feb-2024
  • (2024)Research on GDR Obstacle Detection Method Based on Stereo VisionAutomatic Control and Computer Sciences10.3103/S014641162401006158:1(90-100)Online publication date: 1-Feb-2024
  • (2024)Semi-automated computer vision-based tracking of multiple industrial entities: a framework and dataset creation approachJournal on Image and Video Processing10.1186/s13640-024-00623-62024:1Online publication date: 22-Mar-2024
  • (2024)MUN-FRLInternational Journal of Robotics Research10.1177/0278364924123835843:12(1853-1866)Online publication date: 1-Oct-2024
  • (2024)The INSANE datasetInternational Journal of Robotics Research10.1177/0278364924122724543:8(1083-1113)Online publication date: 1-Jul-2024
  • (2024)Formalizing and evaluating requirements of perception systems for automated vehicles using spatio-temporal perception logicInternational Journal of Robotics Research10.1177/0278364923122354643:2(203-238)Online publication date: 1-Feb-2024
  • Show More Cited By

View Options

View options

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media