Beginning Llamafile for Local Large Language Models (LLMs)

Duke University

Beginning Llamafile for Local Large Language Models (LLMs)

Noah Gift

Instructors: Noah Gift

Sponsored by Coursera Learning Team

Gain insight into a topic and learn the fundamentals.

Beginner level

Recommended experience

3 hours to complete

3 weeks at 1 hour a week

Flexible schedule

Learn at your own pace

Gain insight into a topic and learn the fundamentals.

Beginner level

Recommended experience

3 hours to complete

3 weeks at 1 hour a week

Flexible schedule

Learn at your own pace

What you'll learn

Learn how to serve large language models as production-ready web APIs using the llama.cpp framework
Understand the architecture and capabilities of the llama.cpp example server for text generation, tokenization, and embedding extraction
Gain hands-on experience in configuring and customizing the server using command line options and API parameters

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

4 assignments

Taught in English

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV

Share it on social media and in your performance review

There is 1 module in this course

Learners will gain the skills to serve powerful language models as practical and scalable web APIs. They will learn how to use the llama.cpp example server to expose a large language model through a set of REST API endpoints for tasks like text generation, tokenization, and embedding extraction.

The course dives into the technical details of running the llama.cpp server, configuring various options to customize model behavior, and efficiently handling requests. Learners will understand how to interact with the API using tools like curl and Python, allowing them to integrate language model capabilities into their own applications. Throughout the course, hands-on exercises and code examples reinforce the concepts and provide learners with practical experience in setting up and using the llama.cpp server. By the end, participants will be equipped to deploy robust language model APIs for a variety of natural language processing tasks. The course stands out by focusing on the practical aspects of serving large language models in production environments using the efficient and flexible llama.cpp framework. It empowers learners to harness the power of state-of-the-art NLP models in their projects through a convenient and performant API interface.

This week, you run language models locally. Keep data private. Avoid latency and fees. Use Mixtral model and llamafile.

What's included

8 videos17 readings4 assignments1 discussion prompt4 ungraded labs

8 videosTotal 29 minutes

Meet your instructor: Alfredo Deza1 minutePreview module
Llamafile overview by Mozilla5 minutes
Using the Llamafile API2 minutes
Creating a Llamafile5 minutes
Building portable binaries with Cosmopolitan4 minutes
Building a phrase generator with cosmopolitan3 minutes
Getting Started with Llamafile3 minutes
Llamafile local system metrics3 minutes

17 readingsTotal 61 minutes

Meet your instructor: Noah Gift1 minute
Connect with your instructors1 minute
Course structure and etiquette1 minute
Key Terms5 minutes
What is Llamafile?5 minutes
Cosmopolitan5 minutes
Lesson Reflection5 minutes
Key Terms1 minute
Bash Phrase Generator5 minutes
Lesson Reflection5 minutes
Key Terms5 minutes
What are LLMs?5 minutes
Lesson Reflection5 minutes
Key Terms1 minute
Llamafile server5 minutes
Course Conclusion5 minutes
Next Steps1 minute

4 assignmentsTotal 40 minutes

Quiz-Key Components of Llamafile10 minutes
Quiz-Portable CLI with Cosmopolitan10 minutes
Quiz-Running Llamafile10 minutes
Final Quiz-Llamafile10 minutes

1 discussion promptTotal 1 minute

Meet and Greet (optional)1 minute

4 ungraded labsTotal 90 minutes

Cosmopolitan10 minutes
Portable CLI10 minutes
Local Llamafile API10 minutes
Interactive Llamafile Sandbox60 minutes

Instructors

Noah Gift

Duke University

40 Courses151,358 learners

Offered by

Duke University

Why people choose Coursera for their career

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Recommended if you're interested in Computer Science

DeepLearning.AI
Pair Programming with a Large Language Model
Project
DeepLearning.AI
Generative AI with Large Language Models
Course
Fred Hutchinson Cancer Center
AI for Efficient Programming: Harnessing the Power of LLMs
Course
Board Infinity
Building Services with ASP.NET Web API
Course

Open new doors with Coursera Plus

Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Explore degrees

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy