EdX

Beginning Llamafile (edX)

Beginning Llamafile (edX)

Learn to serve powerful language models as practical, scalable web APIs using the llama.cpp server. Keep your data private and avoid cloud latency and fees.

Class Deals by MOOC List - Click here and see EdX's Active Discounts, Deals, and Promo Codes.

In this course, you will:

  • Gain the skills to expose large language models through REST API endpoints
  • Learn how to configure the llama.cpp server to customize model behavior
  • Understand how to efficiently handle requests and integrate language model capabilities into applications
  • Reinforce concepts through hands-on exercises and code examples using tools like curl and Python
  • Be equipped to deploy robust language model APIs for various NLP tasks

The course empowers you to harness state-of-the-art NLP models in your projects through a convenient and performant API interface, focusing on the practical aspects of serving large language models in production environments using the efficient and flexible llama.cpp framework.
This course is part of the Generative AI Fundamentals Professional Certificate.

What you'll learn

  • Installing and using the Cosmopolitan Libc toolkit
  • Running language models locally with llamafile
  • Understanding the Mixtral model license and llamafile packaging
  • Developing portable command-line interfaces with Cosmopolitan
  • Interacting with the llamafile API for NLP tasks

Syllabus

Module 1: Getting Started with Mozilla Llamafile (2 hours)

  • Video: Meet your instructor: Alfredo Deza (1 minute) [Preview module]
  • Reading: Meet your instructor: Noah Gift (1 minute)
  • Reading: Connect with your instructors (1 minute)
  • Reading: Course structure and etiquette (1 minute)
  • Reading: Key Terms (5 minutes)
  • Reading: What is Llamafile? (5 minutes)
  • Video: Llamafile overview by Mozilla (5 minutes)
  • Video: Using the Llamafile API (2 minutes)
  • Video: Creating a Llamafile (5 minutes)
  • Reading: Cosmopolitan (5 minutes)
  • Video: Building portable binaries with Cosmopolitan (4 minutes)
  • Video: Building a phrase generator with cosmopolitan (3 minutes)
  • Reading: Lesson Reflection (5 minutes)
  • Assignment: Quiz-Key Components of Llamafile (10 minutes)
  • Reading: Key Terms (1 minute)
  • Reading: Bash Phrase Generator (5 minutes)
  • Ungraded Lab: Cosmopolitan (10 minutes)
  • Reading: Lesson Reflection (5 minutes)
  • Assignment: Quiz-Portable CLI with Cosmopolitan (10 minutes)
  • Reading: Key Terms (5 minutes)
  • Reading: What are LLMs? (5 minutes)
  • Video: Getting Started with Llamafile (3 minutes)
  • Video: Llamafile local system metrics (3 minutes)
  • Ungraded Lab: Portable CLI (10 minutes)
  • Reading: Lesson Reflection (5 minutes)
  • Assignment: Quiz-Running Llamafile (10 minutes)
  • Reading: Key Terms (1 minute)
  • Reading: Llamafile server (5 minutes)
  • Ungraded Lab: Local Llamafile API (10 minutes)
  • Reading: Course Conclusion (5 minutes)
  • Reading: Next Steps (1 minute)
  • Assignment: Final Quiz-Llamafile (10 minutes)
  • Discussion Prompt: Meet and Greet (optional) (1 minute)
Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

NoSQL Databases (edX) EdX
University System of Maryland - USM,USMx,University of Maryland Baltimore County,UMBC

NoSQL Databases (edX)

Build a foundation of the four NoSQL (non-relational) databases: Document-oriented, Key-Value Pair, Column-oriented, and Graph. This NoSQL database course provides you with an introduction, overview, and history of NoSQL databases (non-relational databases). In this course you will explore in detail the four types of NoSQL databases (e.g., Document-oriented, Key-Value Pair, Column-oriented, and Graph).

Sep 19th 2022
5-12 Weeks
Quantum Networking (edX) EdX
Purdue University,PurdueX

Quantum Networking (edX)

Learn about the science and engineering of future quantum networks whose security is guaranteed by laws of quantum physics. Applying exotic quantum properties such as entanglement to every-day applications such as communication and computation reveals new dimensions of such applications. Quantum encoding and entanglement distribution provide means to establish fundamentally secure communication links for transfer of classical and quantum data.

Mar 27th 2023
5-12 Weeks
Circuits and Electronics 3: Applications (edX) EdX
MIT,MITx

Circuits and Electronics 3: Applications (edX)

Learn about cool applications, op-amps and filters in the design of microchips used in smartphones, computers, and the internet. Want to learn how your radio works? Wondering how to implement filters using resistors, inductors, and capacitors? Wondering what are some other applications of RLC and CMOS circuits? This free circuit course, taught by edX CEO and MIT Professor Anant Agarwal and MIT colleagues, is for you.

Self Paced
Self-Paced
Big Data Analytics Using Spark (edX) EdX
University of California, San Diego,UC San DiegoX

Big Data Analytics Using Spark (edX)

Learn how to analyze large datasets using Jupyter notebooks, MapReduce and Spark as a platform. In data science, data is called “big” if it cannot fit into the memory of a single standard laptop or workstation. The analysis of big datasets requires using a cluster of tens, hundreds or thousands of computers. Effectively using such clusters requires the use of distributed files systems, such as the Hadoop Distributed File System (HDFS) and corresponding computational models, such as Hadoop, MapReduce and Spark.

Dec 5th 2023
5-12 Weeks