We are introducing a new course to replace the "Coding with ChatGPT" course in the Generative AI specialization. This updated course will cover materials, models, and content released in 2024. Some of the new additions include material on using AI for image-to-text (vision), text-to-speech, speech-to-text, and the Assistant API. All these topics come with new labs, lessons, and exercises.
Multimodal Generative AI: Vision, Speech, and Assistants
This course is part of Getting Started with Generative AI API Specialization
Instructor: Kevin Noelsaint
Sponsored by BrightStar Care
Recommended experience
What you'll learn
Learn how to use AI models for image-to-text (vision), text-to-speech, and speech-to-text tasks using the latest APIs released in 2024.
Details to know
Add to your LinkedIn profile
October 2024
See how employees at top companies are mastering in-demand skills
Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review
There are 4 modules in this course
Welcome to Week 1 of the course. These assignments cover vision and image-to-text capabilities. You'll learn how to analyze and interpret images using AI. The module ends with graded summative assessments.
What's included
5 readings3 app items
Welcome to Week 2 of the course. This week focuses on understanding the fundamentals of text-to-speech (TTS). These assignments cover generating spoken audio in different voices. The module ends with graded summative assessments.
What's included
4 readings3 app items
Welcome to Week 3 of the course. You'll understand the basics of Whisper and interact with ChatGPT to enhance and optimize the Whisper API. The module ends with graded summative assessments.
What's included
4 readings3 app items
Welcome to Week 4 of the course. These assignments cover understanding the basics of the Assistants API, including their purpose, primary components, and available tools like Code Interpreter, File Search, and Function Calling. The module ends with graded summative assessments.
What's included
4 readings3 app items
Instructor
Offered by
Why people choose Coursera for their career
Recommended if you're interested in Computer Science
Google Cloud
Indian Institute of Technology Guwahati
Open new doors with Coursera Plus
Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription
Advance your career with an online degree
Earn a degree from world-class universities - 100% online
Join over 3,400 global companies that choose Coursera for Business
Upskill your employees to excel in the digital economy