Multimodal Retrieval Augmented Generation (RAG) using the Vertex AI Gemini API
Completed by Mohommed Arman Motiwala
January 13, 2025
1 hours (approximately)
Mohommed Arman Motiwala's account is verified. Coursera certifies their successful completion of Multimodal Retrieval Augmented Generation (RAG) using the Vertex AI Gemini API
What you will learn
Extract and store metadata of documents containing both text and images, and generate embeddings the documents.
Search the metadata with text queries to find similar text or images.
Search the metadata with image queries to find similar images.Using a text query as input, search for contextual answers using both text and images.
Skills you will gain
- Category: Embeddings
- Category: Artificial Intelligence
- Category: Multimodal Prompts
- Category: Prompt Engineering
- Category: Data Store
- Category: Cloud Computing
- Category: Gemini
- Category: Retrieval-Augmented Generation
- Category: Metadata Management
- Category: Image Analysis
- Category: Large Language Modeling
- Category: Google Gemini

