Generative AI and LLMs: Architecture and Data Preparation

Generative AI and LLMs: Architecture and Data Preparation

This course is part of multiple programs.

Instructors: Joseph Santarcangelo

What you'll learn

Differentiate between generative AI architectures and models, such as RNNs, Transformers, VAEs, GANs, and Diffusion Models.
Describe how LLMs, such as GPT, BERT, BART, and T5, are used in language processing.
Implement tokenization to preprocess raw textual data using NLP libraries such as NLTK, spaCy, BertTokenizer, and XLNetTokenizer.
Create an NLP data loader using PyTorch to perform tokenization, numericalization, and padding of text data.

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

4 assignments

Taught in English

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

Build your subject-matter expertise

This course is available as part of

When you enroll in this course, you'll also be asked to select a specific program.

Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV

Share it on social media and in your performance review

There are 2 modules in this course

This IBM short course, a part of Generative AI Engineering Essentials with LLMs Professional Certificate, will teach you the basics of using generative AI and Large Language Models (LLMs). This course is suitable for existing and aspiring data scientists, machine learning engineers, deep-learning engineers, and AI engineers.

You will learn about the types of generative AI and its real-world applications. You will gain the knowledge to differentiate between various generative AI architectures and models, such as Recurrent Neural Networks (RNNs), Transformers, Generative Adversarial Networks (GANs), Variational AutoEncoders (VAEs), and Diffusion Models. You will learn the differences in the training approaches used for each model. You will be able to explain the use of LLMs, such as Generative Pre-Trained Transformers (GPT) and Bidirectional Encoder Representations from Transformers (BERT). You will also learn about the tokenization process, tokenization methods, and the use of tokenizers for word-based, character-based, and subword-based tokenization. You will be able to explain how you can use data loaders for training generative AI models and list the PyTorch libraries for preparing and handling data within data loaders. The knowledge acquired will help you use the generative AI libraries in Hugging Face. It will also prepare you to implement tokenization and create an NLP data loader. For this course, a basic knowledge of Python and PyTorch and an awareness of machine learning and neural networks would be an advantage, though not strictly required.

In this module, you will learn about the significance of generative AI models and how they are used across a wide range of fields for generating various types of content. You will learn about the architectures and models commonly used in generative AI and the differences in the training approaches of these models. You will learn how large language models (LLMs) are used to build NLP-based applications. You will build a simple chatbot using the transformers library from Hugging Face.

What's included

5 videos2 readings2 assignments1 app item3 plugins

5 videosTotal 28 minutes

Overview of AI Engineering with LLMs5 minutesPreview module
Course Introduction3 minutes
Significance of Generative AI 5 minutes
Generative AI Architectures and Models 6 minutes
Generative AI for NLP7 minutes

2 readingsTotal 13 minutes

Course Overview10 minutes
Summary and Highlights3 minutes

2 assignmentsTotal 25 minutes

Graded Quiz: Generative AI Architecture15 minutes
Practice Quiz: Generative AI Overview and Architecture10 minutes

1 app itemTotal 60 minutes

Lab: Exploring Generative AI Libraries60 minutes

3 pluginsTotal 32 minutes

Helpful Tips for Course Completion2 minutes
Reading: Basics of AI Hallucinations10 minutes
Reading: Overview of Libraries and Tools20 minutes

In this module, you will learn to prepare data for training large language models (LLMs) by implementing tokenization. You will learn about the tokenization methods and the use of tokenizers. You will also learn about the purpose of data loaders and how you can use the DataLoader class in PyTorch. You will implement tokenization using various libraries such as nltk, spaCy, BertTokenizer, and XLNetTokenizer. You will also create a data loader with a collate function that processes batches of text.

What's included

2 videos5 readings2 assignments2 app items2 plugins

2 videosTotal 13 minutes

Tokenization6 minutesPreview module
Overview of Data Loaders6 minutes

5 readingsTotal 13 minutes

Data Quality and Diversity for Effective LLM Training 5 minutes
Summary and Highlights2 minutes
Course Conclusion3 minutes
Congratulations and Next Steps2 minutes
Team and Acknowledgments1 minute

2 assignmentsTotal 25 minutes

Graded Quiz: Data Preparation for LLMs15 minutes
Practice Quiz: Preparing Data10 minutes

2 app itemsTotal 120 minutes

Lab: Implementing Tokenization60 minutes
Lab: Creating an NLP Data Loader60 minutes

2 pluginsTotal 9 minutes

Cheat Sheet: Generative AI and LLMs: Architecture and Data Preparation5 minutes
Course Glossary: Generative AI and LLMs: Architecture and Data Preparation4 minutes