Evaluating RLHF with metrics
You are training a model using reinforcement learning with human feedback (RLHF) and need to evaluate its performance using appropriate metrics.
Which of the following would be best suited for evaluating a summarization task using automated metrics?
You can visualize the data using the console to help you decide on which metrics is most suited for the task. The data you need to evaluate has been pre-loaded as a DataFrame named df_evaluation
.
Este exercício faz parte do curso
Reinforcement Learning from Human Feedback (RLHF)
Exercício interativo prático
Transforme a teoria em ação com um de nossos exercícios interativos
