Classifying generated text for RLHF
You now want to categorize the generated reviews. One of the ways you can evaluate the output is by measuring the positivity of the generated reviews using the classifier lvwerra/distilbert-imdb
, which you can also instantiate using Hugging Face pipelines.
The pipeline
library has been pre-imported from transformers
. The lvwerra/distilbert-imdb
model has been pre-loaded as model
. The tokenizer has been pre-loaded as tokenizer
.
Diese Übung ist Teil des Kurses
Reinforcement Learning from Human Feedback (RLHF)
Anleitung zur Übung
- Use the
pipeline
function to create a sentiment-analysis pipeline with the model. - Classify the sentiment of the review provided.
Interaktive Übung
Vervollständige den Beispielcode, um diese Übung erfolgreich abzuschließen.
# Create a sentiment analysis pipeline
sentiment_analyzer = pipeline(____, model=____, tokenizer=____)
review_text = "Surprisingly, the film is a very good one"
# Classify the sentiment of the review
sentiment = sentiment_analyzer(____)
print(f"Sentiment Analysis Result: {sentiment}")