Bir getirici (retrieval) fonksiyonu oluşturma

Retrieval Augmented Generation (RAG) iş akışında temel adımlardan biri, veritabanından veri getirmektir. Bu egzersizde, kursun final egzersizinde kritik rol oynayacak olan retrieve() adında özel bir fonksiyon tasarlayacaksın.

Bu egzersiz

Pinecone ile Vektör Veritabanları ve Embeddings

kursunun bir parçasıdır

Kursu Görüntüle

Egzersiz talimatları

Pinecone istemcisini API anahtarınla başlat (OpenAI istemcisi client olarak hazır).
Dört parametre alan retrieve fonksiyonunu tanımla: query, top_k, namespace ve emb_model.
emb_model argümanını kullanarak giriş query ifadesini embed et.
Fonksiyona argüman olarak verilen namespacei belirterek, query_emb ile en benzer top_k vektörü üstverileriyle birlikte getir.

Uygulamalı interaktif egzersiz

Bu örnek kodu tamamlayarak bu egzersizi bitirin.

# Initialize the Pinecone client
pc = Pinecone(api_key="____")
index = pc.Index('pinecone-datacamp')

# Define a retrieve function that takes four arguments: query, top_k, namespace, and emb_model
def retrieve(query, top_k, namespace, emb_model):
    # Encode the input query using OpenAI
    query_response = ____(
        input=____,
        model=____
    )
    
    query_emb = query_response.data[0].embedding
    
    # Query the index using the query_emb
    docs = index.query(vector=____, top_k=____, namespace=____, include_metadata=True)
    
    retrieved_docs = []
    sources = []
    for doc in docs['matches']:
        retrieved_docs.append(doc['metadata']['text'])
        sources.append((doc['metadata']['title'], doc['metadata']['url']))
    
    return retrieved_docs, sources

documents, sources = retrieve(
  query="How to build next-level Q&A with OpenAI",
  top_k=3,
  namespace='youtube_rag_dataset',
  emb_model="text-embedding-3-small"
)
print(documents)
print(sources)

Kodu Düzenle ve Çalıştır

Bu egzersiz

Pinecone ile Vektör Veritabanları ve Embeddings

kursunun bir parçasıdır

IntermediárioNível de habilidade

4.8+

Kursa Ücretsiz Başlayın

Explore the mechanics behind Pinecone's vector database, from pods and indexes to comparing it with other databases. Learn to differentiate pod types, acquire API keys, and initialise Pinecone connection using python. Finally, you’ll learn how to create Pinecone indexes, exploring different parameters such as dimensionality, distance metrics, pod types, and others.

Exercise 1: Introduction to Pinecone indexes Exercise 2: Creating a Pinecone client Exercise 3: Your first Pinecone index Exercise 4: Managing indexes Exercise 5: Connecting to an index Exercise 6: Deleting an index Exercise 7: The Pinecone ecosystem Exercise 8: Vector ingestion Exercise 9: Checking dimensionality Exercise 10: Ingesting vectors with metadata

Get hands-on with Pinecone in Python, where we explore the practical side of using Pinecone for managing indexes, adding vectors with metadata, searching and retrieving vectors, and making updates or deletions. Gain a solid grasp of the key functions and ideas to smoothly handle data in the Pinecone vector database.

Exercise 1: Retrieving vectors Exercise 2: Querying vs. fetching Exercise 3: Fetching vectors Exercise 4: Querying vectors Exercise 5: Returning the most similar vectors Exercise 6: Changing distance metrics Exercise 7: Metadata filtering Exercise 8: Filtering queries Exercise 9: Multiple metadata filters Exercise 10: Updating and deleting vectors Exercise 11: Updating vector values Exercise 12: Updating vector metadata Exercise 13: Deleting vectors

In this chapter, learners delve into optimizing Pinecone index performance, leveraging multi-tenant namespaces for cost reduction, building semantic search engines, and creating retrieval-augmented question answering systems using Pinecone with the OpenAI API. Through these lessons, learners gain practical skills in performance tuning, semantic search, and retrieval-augmented question answering, empowering them to apply Pinecone effectively in real-world AI applications.

Exercise 1: Toplu upsert işlemleri Exercise 2: Parçalama (chunking) için bir fonksiyon tanımlama Exercise 3: Upsert işlemlerini parçalara bölerek toplu yapmak Exercise 4: Toplu upsert işlemlerini paralel çalıştırma Exercise 5: Çoklu kiracılık ve ad alanları Exercise 6: Namespaces Exercise 7: Ad alanlarını sorgulama Exercise 8: Pinecone ile anlamsal arama Exercise 9: Bir Pinecone indeksi oluşturma ve yapılandırma Exercise 10: Anlamsal arama için vektörleri upsert etme Exercise 11: Anlamsal arama için vektör sorgulama Exercise 12: Pinecone ve OpenAI ile RAG sohbet botu Exercise 13: YouTube transkriptlerini upsert etme Exercise 14: Bir getirici (retrieval) fonksiyonu oluşturma

Geçerli Egzersiz

Exercise 15: RAG soru yanıtlama fonksiyonu Exercise 16: Tebrikler!