1
Understanding LLMs and Llama
Free
The field of large language models has exploded, and Llama is a standout. With Llama 3, possibilities have soared. Explore how it was built, learn to use it with llama-cpp-python, and understand how to craft precise prompts to control the model's behavior.
2
Using Llama Locally
Language models are often useful as agents, and in this Chapter, you'll explore how you can leverage llama-cpp-python's capabilities for local text generation and creating agents with personalities. You'll also learn about decoding parameters' impact on output quality. Finally, you'll build specialized inference classes for diverse text generation tasks.
3
Finetuning Llama for Customer Service using Hugging Face & Bitext Dataset
Language models are powerful, and you can unlock their full potential with the right techniques. Learn how fine-tuning can significantly improve the performance of smaller models for specific tasks. Dive into fine-tuning smaller Llama models to enhance their task-specific capabilities. Next, discover parameter-efficient fine-tuning techniques such as LoRA, and explore quantization to load and use even larger models.
4
Creating a Customer Service Chatbot with Llama and LangChain
LLMs work best when they solve a real-world problem, such as creating a customer service chatbot using Llama and LangChain. Explore how to customize LangChain, integrate fine-tuned models, and craft templates for a real-world use case, utilizing RAG to enhance your chatbot's intelligence and accuracy. This chapter equips you with the technical skills to develop responsive and specialized chatbots.

Initializing

Make a philosophy chatbot

You are a tester at a company building AI personas, and your task is to evaluate how well the new Llama models are able to generate completions in certain voices and styles.

You will make a chatbot that thinks it's a philosopher and answers questions by pretending it is Plato. You are given a partially completed create_chat_completion call, which you will modify to make the chatbot respond to a user's question as if it was Plato himself.

Fill in the dictionary in the first index of the messages list with the instructions to make the model respond as if it is the Greek philosopher Plato and the appropriate role.
Fill in the dictionary in the second index of messages with the prompting question from the user and the appropriate role.
Ensure that both your instruction, and the user's question are correctly passed to the function call.