Prompting Vision Language Models (VLMs)
Over the next two exercises, you'll use a multi-modal model to analyze the sentiment of a news article and its corresponding headline image from the BBC News dataset on Hugging Face:

To start, you will prepare a chat template for the model that includes both the image and the news article. The dataset (dataset) and headline image (image) have been loaded.
Este ejercicio forma parte del curso
Multi-Modal Models with Hugging Face
Instrucciones del ejercicio
- Load the news article content (
content) from the datapoint at index6in thedataset. - Complete the text query to insert
contentintotext_queryusing f-strings. - Add the
imageandtext_queryto the chat template, specifying the content type oftext_queryas"text".
Ejercicio interactivo práctico
Prueba este ejercicio y completa el código de muestra.
# Load the news article content from datapoint 6
content = ____
# Complete the text query
text_query = f"Does the news article have a positive, negative, or neutral impact on championship winning chances: {____}. Provide reasoning."
# Add the text query dictionary to the chat template
chat_template = [
{
"role": "user",
"content": [
{
"type": "image",
"image": ____,
},
____
],
}
]