Large language models (LLMs) are trained on human-generated text, but additional methods are needed to align an LLM with human values and preferences.
(11 reviews)
Recommended experience
What you'll learn
Get a conceptual understanding of Reinforcement Learning from Human Feedback (RLHF), as well as the datasets needed for this technique.
Fine-tune the Llama 2 model using RLHF with the open source Google Cloud Pipeline Components Library.
Evaluate tuned model performance against the base model with evaluation methods.
Details to know
July 2024
Only available on desktop
See how employees at top companies are mastering in-demand skills
Learn, practice, and apply job-ready skills in less than 2 hours
- Receive training from industry experts
- Gain hands-on experience solving real-world job tasks
About this project
Instructor
Offered by
How you'll learn
Hands-on, project-based learning
Practice new skills by completing job-related tasks with step-by-step instructions.
No downloads or installation required
Access the tools and resources you need in a cloud environment.
Available only on desktop
This project is designed for laptops or desktop computers with a reliable Internet connection, not mobile devices.
Why people choose Coursera for their career
Learner reviews
11 reviews
- 5 stars
63.63%
- 4 stars
36.36%
- 3 stars
0%
- 2 stars
0%
- 1 star
0%
Showing 3 of 11
Reviewed on Jan 11, 2025
Overall worth a shot. Not in depth but good overview
You might also like
DeepLearning.AI
University of Michigan
Coursera Project Network
Google Cloud
Open new doors with Coursera Plus
Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription
Advance your career with an online degree
Earn a degree from world-class universities - 100% online
Join over 3,400 global companies that choose Coursera for Business
Upskill your employees to excel in the digital economy