Getting started with text splitting
Time to start splitting! You've been provided with a statement about RAG stored in the string variable text
. Your job is to split this string on occurrences of the '.'
character. Take a look at the splitting results to see how this strategy performed.
Cet exercice fait partie du cours
Retrieval Augmented Generation (RAG) with LangChain
Instructions
- Define a LangChain character text splitter that will split on the
'.'
character with a chunk size of75
and chunk overlap of10
. - Split
text
using thetext_splitter
you defined.
Exercice interactif pratique
Essayez cet exercice en complétant cet exemple de code.
text = '''RAG (retrieval augmented generation) is an advanced NLP model that combines retrieval mechanisms with generative capabilities. RAG aims to improve the accuracy and relevance of its outputs by grounding responses in precise, contextually appropriate data.'''
# Define a text splitter that splits on the '.' character
text_splitter = ____(
____,
____,
____
)
# Split the text using text_splitter
chunks = text_splitter.____
print(chunks)
print([len(chunk) for chunk in chunks])