What Is Bayesian Statistics?
December 11, 2024
Article
(23 reviews)
Recommended experience
Intermediate level
For anyone who wants to start building multimodal applications. Basic Python knowledge, as well as familiarity with RAG is recommended.
(23 reviews)
Recommended experience
Intermediate level
For anyone who wants to start building multimodal applications. Basic Python knowledge, as well as familiarity with RAG is recommended.
Learn multimodality with contrastive learning to create modality-independent embeddings for seamless any-to-any retrieval.
Build multimodal RAG systems that retrieve multimodal context and reason over it to generate more relevant answers.
Implement industry applications of multimodal search and build multi-vector recommender systems.
Only available on desktop
Learn how to build multimodal search and RAG systems. RAG systems enhance an LLM by incorporating proprietary data into the prompt context. Typically, RAG applications use text documents, but, what if the desired context includes multimedia like images, audio, and video? This course covers the technical aspects of implementing RAG with multimodal data to accomplish this.
1. Learn how multimodal models are trained through contrastive learning and implement it on a real dataset. 2. Build any-to-any multimodal search to retrieve relevant context across different data types. 3. Learn how LLMs are trained to understand multimodal data through visual instruction tuning and use them on multiple image reasoning examples. 4. Implement an end-to-end multimodal RAG system that analyzes retrieved multimodal context to generate insightful answers. 5. Explore industry applications like visually analyzing invoices and flowcharts to output structured data. 6. Create a multi-vector recommender system that suggests relevant items by comparing their similarities across multiple modalities. As AI systems increasingly need to process and reason over multiple data modalities, learning how to build such systems is an important skill for AI developers. This course equips you with the key skills to embed, retrieve, and generate across different modalities. By gaining a strong foundation in multimodal AI, you’ll be prepared to build smarter search, RAG, and recommender systems.
We asked all learners to give feedback on our instructors based on the quality of their teaching style.
DeepLearning.AI is an education technology company that develops a global community of AI talent. DeepLearning.AI's expert-led educational experiences provide AI practitioners and non-technical professionals with the necessary tools to go all the way from foundational basics to advanced application, empowering them to build an AI-powered future.
Hands-on, project-based learning
Practice new skills by completing job-related tasks with step-by-step instructions.
No downloads or installation required
Access the tools and resources you need in a cloud environment.
Available only on desktop
This project is designed for laptops or desktop computers with a reliable Internet connection, not mobile devices.
DeepLearning.AI
Course
Coursera Instructor Network
Course
DeepLearning.AI
Course
Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription
Earn a degree from world-class universities - 100% online
Upskill your employees to excel in the digital economy
In Projects, you'll complete an activity or scenario by following a set of instructions in an interactive hands-on environment. Projects are completed in a real cloud environment and within real instances of various products as opposed to a simulation or demo environment.
By purchasing a Project, you'll get everything you need to complete the Project including temporary access to any product required to complete the Project.
Even though Projects are technically available on mobile devices, we highly recommend that you complete Projects on a laptop or desktop only.
Yes, you can download and keep any of your created files from the Project. To do so, please make sure you save any files and work to your device before exiting the product environment.
Projects are not eligible for refunds. See our full refund policy.
Financial aid is not available for Projects.
In rare instances, Projects may be taken down for maintenance or other reasons. If you are experiencing any issues, please contact us.
Auditing is not available for Projects.
At the top of the page, you can view the experience level recommended for this Project.
Yes, everything you need to complete your Project will be available in your browser.