- Data Manipulation
- Artificial Intelligence
- Query Languages
- Text Mining
- Metadata Management
- Image Analysis
- Google Gemini
- Data Capture
- Multimodal Prompts
- Data Store
- Embeddings
- Retrieval-Augmented Generation
Multimodal Retrieval Augmented Generation (RAG) using the Vertex AI Gemini API
Completed by Argha Sarker
July 24, 2025
1 hours (approximately)
Argha Sarker's account is verified. Coursera certifies their successful completion of Multimodal Retrieval Augmented Generation (RAG) using the Vertex AI Gemini API
What you will learn
Extract and store metadata of documents containing both text and images, and generate embeddings the documents.
Search the metadata with text queries to find similar text or images.
Search the metadata with image queries to find similar images.Using a text query as input, search for contextual answers using both text and images.
Skills you will gain

