What Is Django and How Is It Used?
October 25, 2024
Article
(16 reviews)
Recommended experience
Intermediate level
Anyone with intermediate Python knowledge who’s interested in learning about using the Reinforcement Learning from Human Feedback technique.
(16 reviews)
Recommended experience
Intermediate level
Anyone with intermediate Python knowledge who’s interested in learning about using the Reinforcement Learning from Human Feedback technique.
Get a conceptual understanding of Reinforcement Learning from Human Feedback (RLHF), as well as the datasets needed for this technique.
Fine-tune the Llama 2 model using RLHF with the open source Google Cloud Pipeline Components Library.
Evaluate tuned model performance against the base model with evaluation methods.
Only available on desktop
Large language models (LLMs) are trained on human-generated text, but additional methods are needed to align an LLM with human values and preferences.
Reinforcement Learning from Human Feedback (RLHF) is currently the main method for aligning LLMs with human values and preferences. RLHF is also used for further tuning a base LLM to align with values and preferences that are specific to your use case. In this course, you will gain a conceptual understanding of the RLHF training process, and then practice applying RLHF to tune an LLM. You will: 1. Explore the two datasets that are used in RLHF training: the “preference” and “prompt” datasets. 2. Use the open source Google Cloud Pipeline Components Library, to fine-tune the Llama 2 model with RLHF. 3. Assess the tuned LLM against the original base model by comparing loss curves and using the “Side-by-Side (SxS)” method.
We asked all learners to give feedback on our instructors based on the quality of their teaching style.
DeepLearning.AI is an education technology company that develops a global community of AI talent. DeepLearning.AI's expert-led educational experiences provide AI practitioners and non-technical professionals with the necessary tools to go all the way from foundational basics to advanced application, empowering them to build an AI-powered future.
Hands-on, project-based learning
Practice new skills by completing job-related tasks with step-by-step instructions.
No downloads or installation required
Access the tools and resources you need in a cloud environment.
Available only on desktop
This project is designed for laptops or desktop computers with a reliable Internet connection, not mobile devices.
16 reviews
68.75%
31.25%
0%
0%
0%
Showing 3 of 16
Reviewed on Jan 11, 2025
Overall worth a shot. Not in depth but good overview
University of Alberta
Specialization
DeepLearning.AI
Course
DeepLearning.AI
Course
Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription
Earn a degree from world-class universities - 100% online
Upskill your employees to excel in the digital economy
In Projects, you'll complete an activity or scenario by following a set of instructions in an interactive hands-on environment. Projects are completed in a real cloud environment and within real instances of various products as opposed to a simulation or demo environment.
By purchasing a Project, you'll get everything you need to complete the Project including temporary access to any product required to complete the Project.
Even though Projects are technically available on mobile devices, we highly recommend that you complete Projects on a laptop or desktop only.
Yes, you can download and keep any of your created files from the Project. To do so, please make sure you save any files and work to your device before exiting the product environment.
Projects are not eligible for refunds. See our full refund policy.
Financial aid is not available for Projects.
In rare instances, Projects may be taken down for maintenance or other reasons. If you are experiencing any issues, please contact us.
Auditing is not available for Projects.
At the top of the page, you can view the experience level recommended for this Project.
Yes, everything you need to complete your Project will be available in your browser.