Visual Perception for Self-Driving Cars

Ends in 4 days! Save 40% on your access to 10,000+ programs and make a real impact in your career. Save now.

Visual Perception for Self-Driving Cars

Name: Visual Perception for Self-Driving Cars
Rating: 4.6763202725724025 (587 reviews)

This course is part of Self-Driving Cars Specialization

Instructors: Steven Waslander

45,841 already enrolled

Included with Learn more

Ask Coursera

7 modules

Gain insight into a topic and learn the fundamentals.

587 reviews

Advanced level

Recommended experience

Flexible schedule

3 weeks at 10 hours a week

Learn at your own pace

95%

Most learners liked this course

7 modules

Gain insight into a topic and learn the fundamentals.

587 reviews

Advanced level

Recommended experience

Flexible schedule

3 weeks at 10 hours a week

Learn at your own pace

95%

Most learners liked this course

What you'll learn

Work with the pinhole camera model, and perform intrinsic and extrinsic camera calibration
Detect, describe and match image features and design your own convolutional neural networks
Apply these methods to visual odometry, object detection and tracking
Apply semantic segmentation for drivable surface estimation

Skills you'll gain

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

4 assignments

Taught in English

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Build your subject-matter expertise

This course is part of the Self-Driving Cars Specialization

When you enroll in this course, you'll also be enrolled in this Specialization.

Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate

There are 7 modules in this course

Welcome to Visual Perception for Self-Driving Cars, the third course in University of Toronto’s Self-Driving Cars Specialization.

This course will introduce you to the main perception tasks in autonomous driving, static and dynamic object detection, and will survey common computer vision methods for robotic perception. By the end of this course, you will be able to work with the pinhole camera model, perform intrinsic and extrinsic camera calibration, detect, describe and match image features and design your own convolutional neural networks. You'll apply these methods to visual odometry, object detection and tracking, and semantic segmentation for drivable surface estimation. These techniques represent the main building blocks of the perception system for self-driving cars. For the final project in this course, you will develop algorithms that identify bounding boxes for objects in the scene, and define the boundaries of the drivable surface. You'll work with synthetic and real image data, and evaluate your performance on a realistic dataset. This is an advanced course, intended for learners with a background in computer vision and deep learning. To succeed in this course, you should have programming experience in Python 3.0, and familiarity with Linear Algebra (matrices, vectors, matrix multiplication, rank, Eigenvalues and vectors and inverses).

This module introduces the main concepts from the broad and exciting field of computer vision needed to progress through perception methods for self-driving vehicles. The main components include camera models and their calibration, monocular and stereo vision, projective geometry, and convolution operations.

What's included

4 videos4 readings1 discussion prompt

4 videosTotal 18 minutes

Welcome to the Self-Driving Cars Specialization!6 minutes
Welcome to the course5 minutes
Meet the Instructor, Steven Waslander6 minutes
Meet the Instructor, Jonathan Kelly2 minutes

4 readingsTotal 60 minutes

Course Prerequisites15 minutes
How to Use Discussion Forums15 minutes
How to Use Supplementary Readings in This Course15 minutes
Recommended Textbooks15 minutes

1 discussion promptTotal 30 minutes

Get to Know Your Classmates30 minutes

This module introduces the main concepts from the broad field of computer vision needed to progress through perception methods for self-driving vehicles. The main components include camera models and their calibration, monocular and stereo vision, projective geometry, and convolution operations.

What's included

6 videos4 readings1 assignment1 programming assignment2 ungraded labs

6 videosTotal 43 minutes

Lesson 1 Part 1: The Camera Sensor7 minutes
Lesson 1 Part 2: Camera Projective Geometry8 minutes
Lesson 2: Camera Calibration7 minutes
Lesson 3 Part 1: Visual Depth Perception - Stereopsis8 minutes
Lesson 3 Part 2: Visual Depth Perception - Computing the Disparity6 minutes
Lesson 4: Image Filtering7 minutes

4 readingsTotal 90 minutes

Supplementary Reading: The Camera Sensor30 minutes
Supplementary Reading: Camera Calibration15 minutes
Supplementary Reading: Visual Depth Perception30 minutes
Supplementary Reading: Image Filtering15 minutes

1 assignmentTotal 30 minutes

Module 1 Graded Quiz30 minutes

1 programming assignmentTotal 90 minutes

(Submission) Applying Stereo Depth to a Driving Scenario90 minutes

2 ungraded labsTotal 180 minutes

Practice Assignment: Applying Stereo Depth to a Driving Scenario120 minutes
(Solution) Applying Stereo Depth to a Driving Scenario60 minutes

Visual features are used to track motion through an environment and to recognize places in a map. This module describes how features can be detected and tracked through a sequence of images and fused with other sources for localization as described in Course 2. Feature extraction is also fundamental to object detection and semantic segmentation in deep networks, and this module introduces some of the feature detection methods employed in that context as well.

What's included

6 videos5 readings1 programming assignment1 ungraded lab

6 videosTotal 44 minutes

Lesson 1: Introduction to Image features and Feature Detectors7 minutes
Lesson 2: Feature Descriptors7 minutes
Lesson 3 Part 1: Feature Matching7 minutes
Lesson 3 Part 2: Feature Matching: Handling Ambiguity in Matching5 minutes
Lesson 4: Outlier Rejection8 minutes
Lesson 5: Visual Odometry10 minutes

5 readingsTotal 85 minutes

Supplementary Reading: Feature Detectors and Descriptors30 minutes
Supplementary Reading: Feature Matching15 minutes
Supplementary Reading: Feature Matching15 minutes
Supplementary Reading: Outlier Rejection15 minutes
Supplementary Reading: Visual Odometry10 minutes

1 programming assignmentTotal 150 minutes

Visual Odometry for Localization in Autonomous Driving150 minutes

1 ungraded labTotal 150 minutes

Visual Odometry for Localization in Autonomous Driving150 minutes

Deep learning is a core enabling technology for self-driving perception. This module briefly introduces the core concepts employed in modern convolutional neural networks, with an emphasis on methods that have been proven to be effective for tasks such as object detection and semantic segmentation. Basic network architectures, common components and helpful tools for constructing and training networks are described.

What's included

6 videos6 readings1 assignment

6 videosTotal 58 minutes

Lesson 1: Feed Forward Neural Networks10 minutes
Lesson 2: Output Layers and Loss Functions11 minutes
Lesson 3: Neural Network Training with Gradient Descent11 minutes
Lesson 4: Data Splits and Neural Network Performance Evaluation8 minutes
Lesson 5: Neural Network Regularization9 minutes
Lesson 6: Convolutional Neural Networks9 minutes

6 readingsTotal 80 minutes

Supplementary Reading: Feed-Forward Neural Networks15 minutes
Supplementary Reading: Output Layers and Loss Functions15 minutes
Supplementary Reading: Neural Network Training with Gradient Descent15 minutes
Supplementary Reading: Data Splits and Neural Network Performance Evaluation10 minutes
Supplementary Reading: Neural Network Regularization15 minutes
Supplementary Reading: Convolutional Neural Networks10 minutes

1 assignmentTotal 30 minutes

Feed-Forward Neural Networks30 minutes

The two most prevalent applications of deep neural networks to self-driving are object detection, including pedestrian, cyclists and vehicles, and semantic segmentation, which associates image pixels with useful labels such as sign, light, curb, road, vehicle etc. This module presents baseline techniques for object detection and the following module introduce semantic segmentation, both of which can be used to create a complete self-driving car perception pipeline.

What's included

4 videos4 readings1 assignment

4 videosTotal 52 minutes

Lesson 1: The Object Detection Problem15 minutes
Lesson 2: 2D Object detection with Convolutional Neural Networks11 minutes
Lesson 3: Training vs. Inference11 minutes
Lesson 4: Using 2D Object Detectors for Self-Driving Cars14 minutes

4 readingsTotal 120 minutes

Supplementary Reading: The Object Detection Problem15 minutes
Supplementary Reading: 2D Object detection with Convolutional Neural Networks30 minutes
Supplementary Reading: Training vs. Inference45 minutes
Supplementary Reading: Using 2D Object Detectors for Self-Driving Cars30 minutes

1 assignmentTotal 30 minutes

Object Detection For Self-Driving Cars30 minutes

The second most prevalent application of deep neural networks to self-driving is semantic segmentation, which associates image pixels with useful labels such as sign, light, curb, road, vehicle etc. The main use for segmentation is to identify the drivable surface, which aids in ground plane estimation, object detection and lane boundary assessment. Segmentation labels are also being directly integrated into object detection as pixel masks, for static objects such as signs, lights and lanes, and moving objects such cars, trucks, bicycles and pedestrians.

What's included

3 videos3 readings1 assignment

3 videosTotal 31 minutes

Lesson 1: The Semantic Segmentation Problem8 minutes
Lesson 2: ConvNets for Semantic Segmentation11 minutes
Lesson 3: Semantic Segmentation for Road Scene Understanding11 minutes

3 readingsTotal 90 minutes

Supplementary Reading: The Semantic Segmentation Problem30 minutes
Supplementary Reading: ConvNets for Semantic Segmentation30 minutes
Supplementary Reading: Semantic Segmentation for Road Scene Understanding30 minutes

1 assignmentTotal 20 minutes

Semantic Segmentation For Self-Driving Cars20 minutes

The final module of this course focuses on the implementation of a collision warning system that alerts a self-driving car about the position and category of obstacles present in their lane. The project is comprised of three major segments: 1) Estimating the drivable space in 3D, 2) Semantic Lane Estimation and 3) Filter wrong output from object detection using semantic segmentation.

What's included

4 videos1 programming assignment1 discussion prompt1 ungraded lab

4 videosTotal 24 minutes

Project Overview: Using CARLA for object detection and segmentation6 minutes
Final Project Hints6 minutes
Final Project Solution [LOCKED]9 minutes
Congratulations for completing the course!3 minutes

1 programming assignmentTotal 180 minutes

Environment Perception For Self-Driving Cars180 minutes

1 discussion promptTotal 15 minutes

Your Learning Journey15 minutes

1 ungraded labTotal 180 minutes

Environment Perception For Self-Driving Cars180 minutes

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructors

Instructor ratings

(76 ratings)

Steven Waslander

University of Toronto

4 Courses182,089 learners

Jonathan Kelly

University of Toronto

4 Courses182,089 learners

Offered by

University of Toronto

Explore more from Software Development

University of Toronto
Introduction to Self-Driving Cars
Course
Status: Free Trial
University of Toronto
Motion Planning for Self-Driving Cars
Course
Status: Free Trial
Columbia University
Visual Perception
Course
Status: Free Trial
Packt
Self-Driving Car Specialization Course
Course

Why people choose Coursera for their career

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

5 stars
77.55%
4 stars
16.32%
3 stars
3.91%
2 stars
0.68%
1 star
1.53%

Showing 3 of 587

Reviewed on Mar 18, 2025

it was good, but it could be more in depth. what provided in the course was just the tip of the iceberg.

Reviewed on Oct 6, 2019

Many thanks for this amazing course!!!! was very hard to me but I have learned a lot!!! Thanks!!!

Reviewed on Jul 17, 2019

Content is great but lack of instructor support makes the course hard to understand.

View more reviews

Frequently asked questions

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.