Evaluating RLHF with metrics
You are training a model using reinforcement learning with human feedback (RLHF) and need to evaluate its performance using appropriate metrics.
Which of the following would be best suited for evaluating a summarization task using automated metrics?
You can visualize the data using the console to help you decide on which metrics is most suited for the task. The data you need to evaluate has been pre-loaded as a DataFrame named df_evaluation
.
This exercise is part of the course
Reinforcement Learning from Human Feedback (RLHF)
Hands-on interactive exercise
Turn theory into action with one of our interactive exercises
