Evaluating RLHF with metrics
You are training a model using reinforcement learning with human feedback (RLHF) and need to evaluate its performance using appropriate metrics.
Which of the following would be best suited for evaluating a summarization task using automated metrics?
You can visualize the data using the console to help you decide on which metrics is most suited for the task. The data you need to evaluate has been pre-loaded as a DataFrame named df_evaluation
.
Este ejercicio forma parte del curso
Reinforcement Learning from Human Feedback (RLHF)
Ejercicio interactivo práctico
Pon en práctica la teoría con uno de nuestros ejercicios interactivos
