Retrieval Optimization: Tokenization to Vector Quantization

Retrieval Optimization: Tokenization to Vector Quantization

Instructor: Kacper Łukawski

Project

Build in-demand job skills with step-by-step instructions

Beginner level

Recommended experience

1 hour

Learn at your own pace

Hands-on learning

Learn more

Project

Build in-demand job skills with step-by-step instructions

Beginner level

Recommended experience

1 hour

Learn at your own pace

Hands-on learning

Learn more

What you'll learn

Learn how tokenization works in large language and embedding models and how the tokenizer can affect the quality of your search.
Explore how different tokenization techniques including Byte-Pair Encoding, WordPiece, and Unigram are trained and work.
Understand how to measure the quality of your retrieval and how to optimize your search by adjusting HNSW parameters and vector quantizations.

Details to know

Taught in English

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

Learn, practice, and apply job-ready skills in less than 2 hours

Receive training from industry experts
Gain hands-on experience solving real-world job tasks

About this project

In Retrieval Optimization: Tokenization to Vector Quantization, taught by Kacper Łukawski, Developer Relations Lead of Qdrant, you’ll learn all about tokenization and also how to optimize vector search in your large-scale customer-facing RAG applications. You’ll explore the technical details of how vector search works and how to optimize it for better performance.

This course focuses on optimizing the first step in your RAG and search results. You’ll see how different tokenization techniques like Byte-Pair Encoding, WordPiece, and Unigram work and how they affect search relevancy. You’ll also learn how to address common challenges such as terminology mismatches and truncated chunks in embedding models. To optimize your search, you need to be able to measure its quality. You will learn several quality metrics for this purpose. Most vector databases use Hierarchical Navigable Small Worlds (HNSW) for approximate nearest-neighbor search. You’ll see how to balance the HNSW parameters for higher speed and maximum relevance. Finally, you would use different vector quantization techniques to enhance memory usage and search speed. What you’ll do, in detail: 1. Learn about the internal workings of the embedding model and how your text is turned into vectors. 2. Understand how several tokenizers such as Byte-Pair Encoding, WordPiece, Unigram, and SentencePiece are trained. 3. Explore common challenges with tokenizers such as unknown tokens, domain-specific identifiers, and numerical values, that can negatively affect your vector search. 4. Understand how to measure the quality of your search across several quality metrics. 5. Understand how the main parameters in HNSW algorithms affect the relevance and speed of vector search and how to optimally adjust these parameters. 6. Experiment with the three major quantization methods, product, scalar, and binary, and learn how they impact memory requirements, search quality, and speed. By the end of this course, you’ll have a solid understanding of how tokenization is done and how to optimize vector search in your RAG systems.

Instructor

Kacper Łukawski

DeepLearning.AI

1 Course127 learners

Offered by

DeepLearning.AI

How you'll learn

Hands-on, project-based learning
Practice new skills by completing job-related tasks with step-by-step instructions.
No downloads or installation required
Access the tools and resources you need in a cloud environment.
Available only on desktop
This project is designed for laptops or desktop computers with a reliable Internet connection, not mobile devices.

Why people choose Coursera for their career

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Google Cloud
Vector Search and Embeddings
Course
DeepLearning.AI
Building Applications with Vector Databases
Course
DeepLearning.AI
Vector Databases: from Embeddings to Applications
Course
DeepLearning.AI
Quantization in Depth
Course

New to Software Development? Start here.

What Is Django and How Is It Used?

October 25, 2024

Article

Transforming Medicine: The Impact of Augmented Reality in Health Care

January 7, 2025

Article

What Does a Software Engineer Do?

January 10, 2025

Article

React Developer Salary: From Entry-Level to Senior Engineer

November 29, 2023

Article

Open new doors with Coursera Plus

Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Learn more

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Explore degrees

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy

Learn more

Frequently asked questions

In Projects, you'll complete an activity or scenario by following a set of instructions in an interactive hands-on environment. Projects are completed in a real cloud environment and within real instances of various products as opposed to a simulation or demo environment.

By purchasing a Project, you'll get everything you need to complete the Project including temporary access to any product required to complete the Project.

Even though Projects are technically available on mobile devices, we highly recommend that you complete Projects on a laptop or desktop only.

Goals

Subjects

Retrieval Optimization: Tokenization to Vector Quantization

What you'll learn

Details to know

See how employees at top companies are mastering in-demand skills

Learn, practice, and apply job-ready skills in less than 2 hours

About this project

Instructor

Offered by

How you'll learn

Why people choose Coursera for their career

You might also like

Vector Search and Embeddings

Building Applications with Vector Databases

Vector Databases: from Embeddings to Applications

Quantization in Depth

New to Software Development? Start here.

What Is Django and How Is It Used?

Transforming Medicine: The Impact of Augmented Reality in Health Care

What Does a Software Engineer Do?

React Developer Salary: From Entry-Level to Senior Engineer

Open new doors with Coursera Plus

Advance your career with an online degree

Join over 3,400 global companies that choose Coursera for Business

Frequently asked questions

More questions

Retrieval Optimization: Tokenization to Vector Quantization

What you'll learn

Details to know

See how employees at top companies are mastering in-demand skills

Learn, practice, and apply job-ready skills in less than 2 hours

About this project

Instructor

Offered by

How you'll learn

Why people choose Coursera for their career

You might also like

Vector Search and Embeddings

Building Applications with Vector Databases

Vector Databases: from Embeddings to Applications

Quantization in Depth

New to Software Development? Start here.

What Is Django and How Is It Used?

Transforming Medicine: The Impact of Augmented Reality in Health Care

What Does a Software Engineer Do?

React Developer Salary: From Entry-Level to Senior Engineer

Open new doors with Coursera Plus

Advance your career with an online degree

Join over 3,400 global companies that choose Coursera for Business

Frequently asked questions

What is the learning experience like with Projects?

What will I get if I purchase a Project?

Are Projects available on desktop and mobile?

More questions