1. Learn
  2. /
  3. Courses
  4. /
  5. Reinforcement Learning from Human Feedback (RLHF)

Connected

Exercise

Unreliable data source identification

Your team is developing a model for assisting in generating accurate reporting in the automotive safety industry. You have gathered preference data from three data sources - a "GlobalDrive Safety Institute," an "AutoTech Safety Alliance," and "QuickScan Auto Review". Recently, concerns have arisen about the integrity of the data, and you have been asked to assess it for any unreliable data sources.

automotive_df is a combined DataFrame loaded using the pre-imported pandas library. It contains data from the three sources. The pre-imported majority_vote function creates a dictionary-like object with the majority (chosen, rejected) pair per 'id'.

Instructions

100 XP
  • Define the condition for counting one disagreement with the majority vote for a given data source.