Extracting prompts
You're working with the hendrydong/rlhf_helpful_eval
dataset, which contains conversational text between a user (Human) and an assistant (Assistant). Your goal is to clean the text, preparing a clean prompt where only the relevant text appears with no specific patterns or tags.
Choose the option you'd use to clean the dataset to extract prompts. You can see a snippet of what the dataset looks like by running the following code in the console:
evaluation_data.iloc[range(1)]['text'][0]
This exercise is part of the course
Reinforcement Learning from Human Feedback (RLHF)
Hands-on interactive exercise
Turn theory into action with one of our interactive exercises
