Karaktere göre bölme

Retrieval Augmented Generation (RAG) uygularken önemli bir süreç, belgeleri bir vektör veritabanında depolamak üzere parçalara ayırmaktır.

LangChain'de bazıları diğerlerinden daha karmaşık rutinlere sahip birkaç farklı bölme stratejisi bulunur. Bu egzersizde, belgeleri karakterlere göre bölen ve parça uzunluğunu karakter sayısıyla ölçen bir character text splitter uygulayacaksın.

Unutma, tek bir ideal bölme stratejisi yoktur; kullanım durumuna en uygununu bulmak için birkaçını denemen gerekebilir.

Bu egzersiz

LangChain ile LLM Uygulamaları Geliştirme

kursunun bir parçasıdır

Kursu Görüntüle

Egzersiz talimatları

langchain_text_splitters içinden CharacterTextSplitter sınıfını içe aktar.
separator="\n", chunk_size=24 ve chunk_overlap=10 ile bir CharacterTextSplitter örneği oluştur.
.split_text() yöntemini kullanarak quote'u böl ve parçaları ile parça uzunluklarını yazdır.

Uygulamalı interaktif egzersiz

Bu örnek kodu tamamlayarak bu egzersizi bitirin.

# Import the character splitter
from langchain_text_splitters import ____

quote = 'Words are flowing out like endless rain into a paper cup,\nthey slither while they pass,\nthey slip away across the universe.'
chunk_size = 24
chunk_overlap = 10

# Create an instance of the splitter class
splitter = CharacterTextSplitter(
    separator=____,
    chunk_size=____,
    chunk_overlap=____)

# Split the string and print the chunks
docs = splitter.____(quote)
print(docs)
print([len(doc) for doc in docs])

Kodu Düzenle ve Çalıştır

Bu egzersiz

LangChain ile LLM Uygulamaları Geliştirme

kursunun bir parçasıdır

IntermediárioNível de habilidade

4.8+

Kursa Ücretsiz Başlayın

Welcome to the LangChain framework for building applications on LLMs! You'll learn about the main components of LangChain, including models, chains, agents, prompts, and parsers. You'll create chatbots using both open-source models from Hugging Face and proprietary models from OpenAI, create prompt templates, and integrate different chatbot memory strategies to manage context and resources during conversations.

Exercise 1: The LangChain ecosystem Exercise 2: OpenAI models in LangChain!Exercise 3: Hugging Face models in LangChain!Exercise 4: Prompt templates Exercise 5: Prompt templates and chaining Exercise 6: Chat prompt templates Exercise 7: Few-shot prompting Exercise 8: Creating the few-shot example set Exercise 9: Building the few-shot prompt template Exercise 10: Implementing few-shot prompting

Time to level up your LangChain chains! You'll learn to use the LangChain Expression Language (LCEL) for defining chains with greater flexibility. You'll create sequential chains, where inputs are passed between components to create more advanced applications. You'll also begin to integrate agents, which use LLMs for decision-making.

Exercise 1: Sequential chains Exercise 2: Building prompts for sequential chains Exercise 3: Sequential chains with LCEL Exercise 4: Introduction to LangChain agents Exercise 5: What's an agent?Exercise 6: ReAct agents Exercise 7: Custom tools for agents Exercise 8: Defining a function for tool use Exercise 9: Creating custom tools Exercise 10: Integrating custom tools with agents

One limitation of LLMs is that they have a knowledge cut-off due to being trained on data up to a certain point. In this chapter, you'll learn to create applications that use Retrieval Augmented Generation (RAG) to integrate external data with LLMs. The RAG workflow contains a few different processes, including splitting data, creating and storing the embeddings using a vector database, and retrieving the most relevant information for use in the application. You'll learn to master the entire workflow!

Exercise 1: Belge yükleyicileri entegre etme Exercise 2: PDF belge yükleyi̇ci̇ler Exercise 3: CSV belge yükleyi̇ci̇ler Exercise 4: HTML belge yükleyi̇ci̇ler Exercise 5: Alım için harici verileri bölme Exercise 6: Karaktere göre bölme

Geçerli Egzersiz

Exercise 7: Karaktere göre özyinelemeli bölme Exercise 8: HTML'i bölme Exercise 9: RAG vektör veri tabanları kullanarak depolama ve geri çağırma Exercise 10: Belgeleri ve vektör veritabanını hazırlama Exercise 11: Bir alma istemi şablonu oluşturma Exercise 12: Bir RAG zinciri oluşturma Exercise 13: Toparlanın!