GitHub - duongquangduc/Udacity-CarND-Advanced-Lane-Lines: This is my fourth project in Self-Driving Car Nanodegree Program by Udacity. The purpose of this project is to write a software pipeline to identify the lane boundaries in a video.

Advanced Lane Finding

Writeup Report / README

This is the project 4: Advanced Lane Finding in Self-Driving Car Nanodegree course by Udacity. The goal is to write a software pipeline to identify the lane boundaries in a video.

For the original assignments can be found in the project repository.

Advanced Lane Finding Project

The goals / steps of this project are the following:

Compute the camera calibration matrix and distortion coefficients given a set of chessboard images.
Apply a distortion correction to raw images.
Use color transforms, gradients, etc., to create a thresholded binary image.
Apply a perspective transform to rectify binary image ("birds-eye view").
Detect lane pixels and fit to find the lane boundary.
Determine the curvature of the lane and vehicle position with respect to center.
Warp the detected lane boundaries back onto the original image.
Output visual display of the lane boundaries and numerical estimation of lane curvature and vehicle position.

Project Structure

My project includes the following files:

Advanced_Lane_Finding.ipynb file containing all the steps performed in the project.
writeup_report.md file containing the explanation for each steps
utils is the folder for the python files like thresholds.py, etc...
output_images folder for all the output images for the images in camera_cal and test_images folders.
project_video_result.mp4 showing the final result for lane finding
report.pdf is the pdf file of Advanced_Lane_Finding.ipynb
Other default files in the project repository

Writeup / README

1. Compute the camera calibration matrix and distortion coefficients given a set of chessboard images

The code for this step is contained in the code cells of the IPython notebook located in "./Advanced_Lane_Finding.ipynb". The main process contains the following steps:

Step 1: Preparing "object points", which will be the (x, y, z) coordinates of the chessboard corners in the world. Here I am assuming the chessboard is fixed on the (x, y) plane at z=0, such that the object points are the same for each calibration image. Thus, objp is just a replicated array of coordinates, and objpoints will be appended with a copy of it every time I successfully detect all chessboard corners in a test image.

imgpoints will be appended with the (x, y) pixel position of each of the corners in the image plane with each successful chessboard detection.

Step 2: Computing the camera calibration and distortion coefficients by using the output objpoints and imgpoints and cv2.calibrateCamera() function. ret, mtx, dist, rvecs, tvecs = cv2.calibrateCamera(objpoints, imgpoints, gray.shape[::-1], None, None)

Step 3: Applying the distortion correction the cv2.undistort() function to the test images. Here is an example of the result: def undistort_image(img): # Undistort a test image img = cv2.undistort(img, mtx, dist, None, mtx) return img

In the first part, I also draw and display the chess corners. Please refer to the jupyter notebook for the display.

2. Apply a distortion correction to raw images

I applied the function undistort_image() to the image samples in camera_cal and test_images folders and save the results into ./output_images/ folders. Please check the folders in Github to verify.

3. Use color transforms, gradients, etc., to create a threshold binary image

I used a combination of color and gradient thresholds to generate a binary image. The toBinary() funtion is defined in ./utils/thresholds.py file. The final image is shown as below.

4. Apply a perspective transform to rectify binary image ("bird-eye-view")

The code for my perspective transform includes a function called warper(), which appears in line [8] in the IPython notebook. The warper() function takes as inputs an image (img), as well as source (src) and destination (dst) points. I chose the hardcode the source and destination points in the following manner:

s# offset for dst points
    offset = 350
    
    # Source points
    src = np.float32([[[ 610,  450]], 
                      [[ 680,  450]], 
                      [[ img_size[0]-300,  680]],
                      [[ 380,  680]]])

    # Result points
    dst = np.float32([[offset, 0], 
                    [img_size[0]-offset, 0], 
                    [img_size[0]-offset, img_size[1]], 
                    [offset, img_size[1]]])

This resulted in the following source and destination points:

Source	Destination
610, 450	offset, 0
680, 450	img_size[0]-offset, 0
img_size[0]-300, 680	img_size[0]-offset, img_size[1]
380, 680	offset, img_size[1]

I verified that my perspective transform was working as expected by drawing the src and dst points onto a test image and its warped counterpart to verify that the lines appear parallel in the warped image.

5. Detect lane pixels and fit to the lane boundary

I used a convolution function find_window_centroids which will maximize the number of pixels in each window. A convolution is the summation of the product of two seperate signals, in this case, they are the window template and the vertice slice of the pixel image.

By slidingthe window template across the image from left to right, the overlapping values are summed together, created the convolved signal. The peak of the colvolved signal is where there was the highest overlap of pixels and the most likely position for the lane marker.

Finally, combining the left and right points over the original image. Here is the example:

6. Determine the curvature of the lane and vehicle position with respect to center

The radius of curvature can be calculated as the following equation:

By using the leftx and rightx from the find_window_centroids, I can define the funtion curvature() as in line [11].

7. Warp the detected lane boundaries back onto the original image

I implemented this step in lines #12. Here is an example of my result on a test image:

8. Output visual display of the lane boundaries and numerical estimation of lane curvature and vehicle position

I used text() from matplotlib, to write the numerical estimation of lane curvature and vehicle position on top of the image. Here is one example:

9. The Final Pipeline

Combining all the steps from the previous, I define lane_finding() function as in line #14.

The I applied the funtion for project video in step #10 and see it gets a very good result.

10. Pipeline (video)

Please check my output video for the project video as below:

Here's a link to my video result. Or you can watch it by clicking the below thumbnail.

Discussion

The final lane_finding() funtion works pretty well in the project_video but show limited results in the challenge_video and harder_challenge_video. It may be due to different light conditions, the huge curvatures or speed...So I need to find further advanced techniques to overcome these causes.

There are many things that I could improve this project like:

Using different transforms techniques (colors, gradients, etc...) to covert an image to its binary image and compare the performance on the algorithms.
Experiment with different threshold values, color spaces, its combinations, etc...
Improve the perspective transform to rectify binay image by applying different techniques
Find solutions for challenge videos

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
camera_cal		camera_cal
examples		examples
output_images		output_images
test_images		test_images
utils		utils
Advanced_Lane_Finding.ipynb		Advanced_Lane_Finding.ipynb
README.md		README.md
challenge_video.mp4		challenge_video.mp4
harder_challenge_video.mp4		harder_challenge_video.mp4
project_video.mp4		project_video.mp4
project_video_result.mp4		project_video_result.mp4
writeup_report.md		writeup_report.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Advanced Lane Finding

Writeup Report / README

Project Structure

Writeup / README

1. Compute the camera calibration matrix and distortion coefficients given a set of chessboard images

2. Apply a distortion correction to raw images

3. Use color transforms, gradients, etc., to create a threshold binary image

4. Apply a perspective transform to rectify binary image ("bird-eye-view")

5. Detect lane pixels and fit to the lane boundary

6. Determine the curvature of the lane and vehicle position with respect to center

7. Warp the detected lane boundaries back onto the original image

8. Output visual display of the lane boundaries and numerical estimation of lane curvature and vehicle position

9. The Final Pipeline

10. Pipeline (video)

Please check my output video for the project video as below:

Discussion

About

Releases

Packages

Languages

duongquangduc/Udacity-CarND-Advanced-Lane-Lines

Folders and files

Latest commit

History

Repository files navigation

Advanced Lane Finding

Writeup Report / README

Project Structure

Writeup / README

1. Compute the camera calibration matrix and distortion coefficients given a set of chessboard images

2. Apply a distortion correction to raw images

3. Use color transforms, gradients, etc., to create a threshold binary image

4. Apply a perspective transform to rectify binary image ("bird-eye-view")

5. Detect lane pixels and fit to the lane boundary

6. Determine the curvature of the lane and vehicle position with respect to center

7. Warp the detected lane boundaries back onto the original image

8. Output visual display of the lane boundaries and numerical estimation of lane curvature and vehicle position

9. The Final Pipeline

10. Pipeline (video)

Please check my output video for the project video as below:

Discussion

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages