Skip to content

Latest commit

 

History

History
84 lines (43 loc) · 4.54 KB

File metadata and controls

84 lines (43 loc) · 4.54 KB

Object Detection on KITTI Autonomous Driving Scenes Based on YOLOv5 and PointNet

中文版本

YOLOv5_and_pointnet_for_object_detection_on_kitti

Object Detection Pipeline in KITTI Autonomous Driving Scenes Using Images and Lidar Point Clouds

Abstract

In recent years, the rapid development of artificial intelligence has significantly advanced autonomous driving, garnering increasing attention from researchers and society. In this project, we developed an object recognition system for images and radar point clouds based on the public KITTI autonomous driving dataset. For image object detection, we employed YOLOv5 as the detection network. Using the YOLOv5x model (the largest, most effective, but slowest model) on the Kaggle platform with a Tesla P100-PCIE-16GB, the average detection time per image was 0.044s, achieving 22.73 FPS with excellent detection results. For radar point cloud object detection, due to the unique nature of point cloud data, we segmented the ground using an algorithm, clustered the remaining points to identify objects, and classified these objects using the PointNet point cloud classification network, converting them into detection boxes. This approach yielded good results in the KITTI official evaluation code. Additionally, we visualized the results, including projecting images onto point clouds and vice versa. Finally, we discussed some limitations and future improvement directions.

Complete code available on GitHub (click to enter)

Code with partial datasets available on Baidu Netdisk (click to enter)

Link: https://pan.baidu.com/s/1tjJuhY47BHEms3uokNnvIg

Password: sda6

2D Object Detection on KITTI Dataset Based on YOLOv5

Code: Reproduction on Kaggle

For the image side, we used YOLOv5 as the 2D object detection model, utilizing the official pre-trained model on the COCO dataset for detection on the KITTI public dataset. The experimental results confirmed its accuracy and recall rate, achieving good image detection results.

Object Detection Under Point Cloud Classification Network Based on PointNet

Code: GitHub

Main ideas:

  1. Segment the ground from the point cloud using non-deep learning methods.
  2. Cluster the remaining point cloud points above the ground.
  3. Classify the clustered point cloud using PointNet.
  4. Convert the classified point cloud data into 3D detection boxes.

Data Visualization of Object Detection

In the final part, to facilitate the work and future data integration, we visualized the data, including but not limited to:

  1. Visualization of the original image and point cloud data.
  2. Visualization of image + 2D detection box.

Visualization of image + 2D detection box

  1. Visualization of image + 3D detection box.

Visualization of image + 3D detection box

  1. Visualization of point cloud + 3D detection box.

Visualization of point cloud + 3D detection box

  1. Visualization of image RGB values projected onto the point cloud.

Visualization of image RGB values projected onto the point cloud

  1. Visualization of point cloud projected onto the image.

Visualization of point cloud projected onto the image

  1. Visualization of point cloud ground segmentation results.

Visualization of point cloud ground segmentation results

  1. Visualization of point cloud segmentation results.

Visualization of point cloud segmentation results

  1. Visualization of point cloud consecutive frame detection GIF.

Visualization of point cloud consecutive frame detection GIF 1

Visualization of point cloud consecutive frame detection GIF 2

  1. Video detection GIF visualization.

Video detection GIF visualization 1

Video detection GIF visualization 2