What Is Django and How Is It Used?
October 25, 2024
Article
Recommended experience
Beginner level
Anyone with basic Python knowledge who wants to learn to build effective customer-facing RAG applications!
Recommended experience
Beginner level
Anyone with basic Python knowledge who wants to learn to build effective customer-facing RAG applications!
Learn how tokenization works in large language and embedding models and how the tokenizer can affect the quality of your search.
Explore how different tokenization techniques including Byte-Pair Encoding, WordPiece, and Unigram are trained and work.
Understand how to measure the quality of your retrieval and how to optimize your search by adjusting HNSW parameters and vector quantizations.
October 2024
Only available on desktop
In Retrieval Optimization: Tokenization to Vector Quantization, taught by Kacper Łukawski, Developer Relations Lead of Qdrant, you’ll learn all about tokenization and also how to optimize vector search in your large-scale customer-facing RAG applications. You’ll explore the technical details of how vector search works and how to optimize it for better performance.
This course focuses on optimizing the first step in your RAG and search results. You’ll see how different tokenization techniques like Byte-Pair Encoding, WordPiece, and Unigram work and how they affect search relevancy. You’ll also learn how to address common challenges such as terminology mismatches and truncated chunks in embedding models. To optimize your search, you need to be able to measure its quality. You will learn several quality metrics for this purpose. Most vector databases use Hierarchical Navigable Small Worlds (HNSW) for approximate nearest-neighbor search. You’ll see how to balance the HNSW parameters for higher speed and maximum relevance. Finally, you would use different vector quantization techniques to enhance memory usage and search speed. What you’ll do, in detail: 1. Learn about the internal workings of the embedding model and how your text is turned into vectors. 2. Understand how several tokenizers such as Byte-Pair Encoding, WordPiece, Unigram, and SentencePiece are trained. 3. Explore common challenges with tokenizers such as unknown tokens, domain-specific identifiers, and numerical values, that can negatively affect your vector search. 4. Understand how to measure the quality of your search across several quality metrics. 5. Understand how the main parameters in HNSW algorithms affect the relevance and speed of vector search and how to optimally adjust these parameters. 6. Experiment with the three major quantization methods, product, scalar, and binary, and learn how they impact memory requirements, search quality, and speed. By the end of this course, you’ll have a solid understanding of how tokenization is done and how to optimize vector search in your RAG systems.
DeepLearning.AI is an education technology company that develops a global community of AI talent. DeepLearning.AI's expert-led educational experiences provide AI practitioners and non-technical professionals with the necessary tools to go all the way from foundational basics to advanced application, empowering them to build an AI-powered future.
Hands-on, project-based learning
Practice new skills by completing job-related tasks with step-by-step instructions.
No downloads or installation required
Access the tools and resources you need in a cloud environment.
Available only on desktop
This project is designed for laptops or desktop computers with a reliable Internet connection, not mobile devices.
Google Cloud
Course
DeepLearning.AI
Course
DeepLearning.AI
Course
DeepLearning.AI
Course
Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription
Earn a degree from world-class universities - 100% online
Upskill your employees to excel in the digital economy
In Projects, you'll complete an activity or scenario by following a set of instructions in an interactive hands-on environment. Projects are completed in a real cloud environment and within real instances of various products as opposed to a simulation or demo environment.
By purchasing a Project, you'll get everything you need to complete the Project including temporary access to any product required to complete the Project.
Even though Projects are technically available on mobile devices, we highly recommend that you complete Projects on a laptop or desktop only.
Yes, you can download and keep any of your created files from the Project. To do so, please make sure you save any files and work to your device before exiting the product environment.
Projects are not eligible for refunds. See our full refund policy.
Financial aid is not available for Projects.
In rare instances, Projects may be taken down for maintenance or other reasons. If you are experiencing any issues, please contact us.
Auditing is not available for Projects.
At the top of the page, you can view the experience level recommended for this Project.
Yes, everything you need to complete your Project will be available in your browser.