This course addresses evaluating Large Language Models (LLMs), starting with foundational evaluation methods, exploring advanced techniques with Vertex AI's tools like Automatic Metrics and AutoSxS, and forecasting the evolution of generative AI evaluation.
Evaluating Large Language Model Outputs: A Practical Guide
Instructor: Reza Moradinezhad
Sponsored by Coursera Learning Team
Recommended experience
What you'll learn
Identify the fundamentals of Large Language Models, including current evaluation methods and access to Vertex AI's evaluation models.
Apply hands-on knowledge of using Vertex AI's Automatic Metrics and AutoSxS for LLM evaluation.
Evaluate upcoming trends in generative AI evaluation, encompassing text, image, and audio models, and the importance of human evaluation.
Details to know
Add to your LinkedIn profile
1 assignment
See how employees at top companies are mastering in-demand skills
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review
There is 1 module in this course
This course addresses evaluating Large Language Models (LLMs), starting with foundational evaluation methods, exploring advanced techniques with Vertex AI's tools like Automatic Metrics and AutoSxS, and forecasting the evolution of generative AI evaluation.
What's included
13 videos4 readings1 assignment2 discussion prompts
Instructor
Offered by
Why people choose Coursera for their career
Recommended if you're interested in Data Science
DeepLearning.AI
Duke University
Google Cloud
Open new doors with Coursera Plus
Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription
Advance your career with an online degree
Earn a degree from world-class universities - 100% online
Join over 3,400 global companies that choose Coursera for Business
Upskill your employees to excel in the digital economy