Build Multimodal Generative AI Applications
Completed by Prosper Enwerem
March 20, 2026
7 hours (approximately)
Prosper Enwerem's account is verified. Coursera certifies their successful completion of Build Multimodal Generative AI Applications
What you will learn
Build the job-ready skills you need to build multimodal generative AI applications in just 3 weeks
Understand the fundamental concepts and challenges in multimodal AI, including the integration of text, speech, images, and video
Build multimodal AI applications using state-of-the-art models and frameworks such as IBM’s Granite, Meta’s Llama, OpenAI’s Whisper, DALL·E and Sora
Develop multimodal AI solutions, including chatbots and image/video generation models, using IBM watsonx.ai, Hugging Face, Flask and Gradio
Skills you will gain
- Category: Multimodal Prompts
- Category: LLM Application
- Category: Prompt Engineering
- Category: Application Development
- Category: Flask (Web Framework)
- Category: Web Development
- Category: Generative Model Architectures
- Category: Software Development
- Category: Web Applications
- Category: OpenAI API

