Aan de slagGa gratis aan de slag

Understanding comparison and rating in RLHF

Now, it's your turn. Imagine you're designing an AI assistant and need to understand user satisfaction. You're considering collecting comparison-based feedback or, alternatively, ratings. But what are the differences between the two? Each method has its own characteristics, and choosing the right one can greatly impact the success of your product.

Deze oefening maakt deel uit van de cursus

Reinforcement Learning from Human Feedback (RLHF)

Cursus bekijken

Praktische interactieve oefening

Zet theorie om in actie met een van onze interactieve oefeningen.

Begin met trainen