Extracting prompts
You're working with the hendrydong/rlhf_helpful_eval
dataset, which contains conversational text between a user (Human) and an assistant (Assistant). Your goal is to clean the text, preparing a clean prompt where only the relevant text appears with no specific patterns or tags.
Choose the option you'd use to clean the dataset to extract prompts. You can see a snippet of what the dataset looks like by running the following code in the console:
evaluation_data.iloc[range(1)]['text'][0]
Diese Übung ist Teil des Kurses
Reinforcement Learning from Human Feedback (RLHF)
Interaktive Übung
In dieser interaktiven Übung kannst du die Theorie in die Praxis umsetzen.
