Aan de slagGa gratis aan de slag

Extracting prompts

You're working with the hendrydong/rlhf_helpful_eval dataset, which contains conversational text between a user (Human) and an assistant (Assistant). Your goal is to clean the text, preparing a clean prompt where only the relevant text appears with no specific patterns or tags.

Choose the option you'd use to clean the dataset to extract prompts. You can see a snippet of what the dataset looks like by running the following code in the console:

evaluation_data.iloc[range(1)]['text'][0]

Deze oefening maakt deel uit van de cursus

Reinforcement Learning from Human Feedback (RLHF)

Cursus bekijken

Praktische interactieve oefening

Zet theorie om in actie met een van onze interactieve oefeningen.

Begin met trainen