1. सीखें
  2. /
  3. पाठ्यक्रम
  4. /
  5. Reinforcement Learning from Human Feedback (RLHF)

Connected

अभ्यास

Extracting prompts

You're working with the hendrydong/rlhf_helpful_eval dataset, which contains conversational text between a user (Human) and an assistant (Assistant). Your goal is to clean the text, preparing a clean prompt where only the relevant text appears with no specific patterns or tags.

Choose the option you'd use to clean the dataset to extract prompts. You can see a snippet of what the dataset looks like by running the following code in the console:

evaluation_data.iloc[range(1)]['text'][0]

निर्देश

50 XP

संभावित उत्तर