Introduction to Computer Vision

Get one of our best deals with Coursera Plus for $199 (usually $399). Save now.

Introduction to Computer Vision

This course is part of Computer Vision Specialization

Instructor: Tom Yeh

8,689 already enrolled

Included with

Learn more

4 modules

Gain insight into a topic and learn the fundamentals.

36 reviews

Beginner level

Recommended experience

Flexible schedule

2 weeks at 10 hours a week

Learn at your own pace

Build toward a degree

Learn more

4 modules

Gain insight into a topic and learn the fundamentals.

36 reviews

Beginner level

Recommended experience

Flexible schedule

2 weeks at 10 hours a week

Learn at your own pace

Build toward a degree

Learn more

What you'll learn

Understand the fundamental principles and algorithms of classical computer vision.
Apply deep learning models to various computer vision tasks.
Evaluate and implement computer vision solutions for real-world applications.

Skills you'll gain

Tools you'll learn

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

23 assignments

Taught in English

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Build your subject-matter expertise

This course is part of the Computer Vision Specialization

When you enroll in this course, you'll also be enrolled in this Specialization.

Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate

There are 4 modules in this course

Introduction to Computer Vision guides learners through the essential algorithms and methods to help computers 'see' and interpret visual data. You will first learn the core concepts and techniques that have been traditionally used to analyze images. Then, you will learn modern deep learning methods, such as neural networks and specific models designed for image recognition, and how it can be used to perform more complex tasks like object detection and image segmentation. Additionally, you will learn the creation and impact of AI-generated images and videos, exploring the ethical considerations of such technology.

This course can be taken for academic credit as part of CU Boulder’s MS in Data Science or MS in Computer Science degrees offered on the Coursera platform. These fully accredited graduate degrees offer targeted courses, short 8-week sessions, and pay-as-you-go tuition. Admission is based on performance in three preliminary courses, not academic history. CU degrees on Coursera are ideal for recent graduates or working professionals. Learn more: MS in Data Science: https://www.coursera.org/degrees/master-of-science-data-science-boulder MS in Computer Science: https://coursera.org/degrees/ms-computer-science-boulder

Welcome to Introduction to Computer Vision, the first course in the Computer Vision specialization. In this first module, you'll be introduced to how this course operates "by Hand" and "in Excel." Then, you'll build a foundation in image matrices and arrays to explore different image types: binary, grayscale, and RGB. Next, you'll transition into using functions to perform basic image operations such as addition, negation, and masking. You'll then be introduced to the concept of image transformation through linear algebra. Finally, you'll perform translation, scaling, and rotation matrix operations.

What's included

34 videos9 readings8 assignments

34 videosTotal 136 minutes

Meet Your Instructor 3 minutes
Image Overview2 minutes
Image Array & Matrix2 minutes
Binary Image & Byte Array2 minutes
Double Image3 minutes
RGB Image5 minutes
LED Display4 minutes
Byte Image 32x323 minutes
Greyscale4 minutes
RGB Image 32x32x33 minutes
LED Display 32x324 minutes
2D Image Function3 minutes
Add Images3 minutes
Solid Square2 minutes
Add, Negate, and Multiply3 minutes
Flip Axes3 minutes
Linear Combination3 minutes
Masking3 minutes
Absolute Reference12 minutes
L1 & L2 Function Examples2 minutes
2D Gaussian5 minutes
Array Formula9 minutes
Pixels vs. Function vs. Points6 minutes
Translate and Scale by Linear Combination5 minutes
Matrix Multiplication10 minutes
Translate and Scale Matrix5 minutes
Multiple Transformations3 minutes
Rotation Matrix4 minutes
Matrix Multiplication Associativity3 minutes
Matrix Multiplication in Excel4 minutes
Linear Transformation 3 minutes
Scale and Translate in Excel4 minutes
Rotate and Multiple Transformations4 minutes
Pre-multiplied Transformation Matrix 3 minutes

9 readingsTotal 57 minutes

Course Updates and Accessibility Support1 minute
Earn Academic Credit for your Work!10 minutes
Course Support10 minutes
Inside the Course10 minutes
Assessment Expectations10 minutes
AI Citation and Acknowledgement10 minutes
Get the Workbook: Image2 minutes
Get the Workbook: Function2 minutes
Get the Workbook: Transform2 minutes

8 assignmentsTotal 155 minutes

AI Policy Quiz5 minutes
Image, Function, and Transform60 minutes
Image by Hand15 minutes
Image in Excel15 minutes
Function by Hand15 minutes
Function in Excel15 minutes
Transform by Hand15 minutes
Transform in Excel15 minutes

This module dives into feature extraction—quantitative measures that describe image content. Students compute features such as image mass, center, and statistical moments to describe the shape and structure of images. These are implemented both manually and in Excel. The module also explores how to compare images using distance metrics and similarity measures, offering insight into how visual data can be analyzed, categorized, and classified.

What's included

23 videos2 readings5 assignments

23 videosTotal 104 minutes

Image Mass2 minutes
Image Center5 minutes
First Moment2 minutes
Second Moment4 minutes
Image Gradients8 minutes
Image Histogram6 minutes
Image Batch, Mass, and Center8 minutes
First Moment in Excel4 minutes
Second Moment in Excel4 minutes
Parameterized Moment Calculation5 minutes
Image Gradient in Excel8 minutes
Image Histogram in Excel5 minutes
Histogram of Gradients (HOG)5 minutes
Similarity vs. Distance6 minutes
L1 and L2 Distance2 minutes
L2 Normalization3 minutes
Cosine Similarity2 minutes
Cross Entropy3 minutes
L1 and L2 Distance in Excel2 minutes
L2 Normalization in Excel2 minutes
L1 and L2 Distance Map6 minutes
Cosine Similarity and Cross Entropy in Excel4 minutes
Comparing Two Groups10 minutes

2 readingsTotal 4 minutes

Get the Workbook: Feature2 minutes
Get the Workbook: Compare2 minutes

5 assignmentsTotal 90 minutes

Feature and Compare30 minutes
Feature by Hand15 minutes
Feature in Excel15 minutes
Compare by Hand15 minutes
Compare in Excel15 minutes

Filtering techniques are central to detecting patterns in images. This module introduces learners to 1D and 2D filters, covering foundational concepts like convolution, cross-correlation, and Gaussian smoothing. Through both manual and spreadsheet-based exercises, learners apply various filters (e.g., mean, Laplacian, Sobel) and morphological operations like dilation and erosion. These filtering methods enhance image features, detect edges, and prepare data for further processing.

What's included

26 videos2 readings5 assignments

26 videosTotal 109 minutes

Overview and Scale2 minutes
Sliding Window and Cross-Correlation7 minutes
Convolution by Hand3 minutes
Lapacian Filter by Hand5 minutes
Shift Filter by Hand3 minutes
ReLU & Maxpool by Hand3 minutes
Scale and Sum Filter3 minutes
Mean, Lapacian, and Shift Filter6 minutes
Detection in Excel3 minutes
Cross-Correlation and Convolution7 minutes
Gaussian Filter2 minutes
Parameterized Gaussian Filter8 minutes
ReLU & Maxpool in Excel3 minutes
Sliding Window by Hand3 minutes
Dilate by Hand4 minutes
Erode by Hand3 minutes
Cross-Correlation for Filter 2D5 minutes
Convolution for Filter 2D4 minutes
Mean Filter for Filter 2D3 minutes
Sliding Window in Excel4 minutes
Dilate in Excel4 minutes
Erode in Excel3 minutes
Open and Close Filter 2D8 minutes
Smoothing in Excel6 minutes
Lapacian Filter in Excel4 minutes
Sobel Filter in Excel4 minutes

2 readingsTotal 4 minutes

Get the Workbook: Filter 1D2 minutes
Get the Workbook: Filter 2D2 minutes

5 assignmentsTotal 90 minutes

Filter 1D & 2D30 minutes
Filter 1D by Hand15 minutes
Filter 1D in Excel15 minutes
Filter 2D by Hand15 minutes
Filter 2D in Excel15 minutes

This module delves into key concepts of camera models and their role in computer vision and photogrammetry. You will learn about the Extrinsic Matrix, exploring how it defines the position and orientation of a camera in 3D space. Understand the Pinhole Camera Model, a simplified optical system that forms the basis for many computer vision applications, alongside the Intrinsic Matrix, which captures the internal parameters of the camera. Epipolar geometry is examined, with a focus on its significance in 3D reconstruction and stereo vision. The module covers the motivation behind epipolar geometry, breaking down its basic components, and explaining the Essential Matrix, which encapsulates the geometric relationship between camera views, as well as the Fundamental Matrix, a core component in epipolar geometry that represents the relationship between two cameras in stereo vision.

What's included

15 videos3 readings5 assignments

15 videosTotal 119 minutes

Orthographic Projection9 minutes
World to Camera11 minutes
Camera (3D) to Pixel (2D)11 minutes
Extrinsic & Intrinsic Matrix6 minutes
Motivation for Epipolar Geometry8 minutes
Basic Components of Epipolar Geometry12 minutes
Epipolar Constraints 7 minutes
Derive the Epipolar Constraint Equation9 minutes
Object in the World 3 minutes
Two Camera System 11 minutes
Pixel to World10 minutes
Epipolar Line4 minutes
Pixels to Epipolar Lines3 minutes
Epipolar Constraints (Camera)8 minutes
Essential and Fundamental Matrix7 minutes

3 readingsTotal 6 minutes

Get the Workbook: Camera2 minutes
Get the Workbook: Epipolar Part 12 minutes
Get the Workbook: Epipolar Part 2 & 32 minutes

5 assignmentsTotal 90 minutes

Camera and Epipolar30 minutes
Camera15 minutes
Epipolar Part 115 minutes
Epipolar Part 215 minutes
Epipolar Part 315 minutes

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Build toward a degree

This course is part of the following degree program(s) offered by University of Colorado Boulder. If you are admitted and enroll, your completed coursework may count toward your degree learning and your progress can transfer with you.¹

Instructor

Instructor ratings

(6 ratings)

Tom Yeh

University of Colorado Boulder

4 Courses21,364 learners

Offered by

University of Colorado Boulder

Explore more from Algorithms

IBM
Introduction to Computer Vision and Image Processing
Course
Status: Free Trial
MathWorks
Introduction to Computer Vision
Course
Status: Free Trial
MathWorks
Introduction to Deep Learning for Computer Vision
Course
Status: Free Trial
University of Colorado Boulder
Deep Learning for Computer Vision
Course

Why people choose Coursera for their career

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

5 stars
72.97%
4 stars
13.51%
3 stars
8.10%
2 stars
2.70%
1 star
2.70%

Showing 3 of 36

Reviewed on Feb 21, 2026

The course was nice and easy until the last module where some lectures were presented in a very confused way.

View more reviews

Frequently asked questions

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.