Self vs. multi-head attention
You are a data analyst in an AI development team. Your current project involves understanding and implementing the concepts of self-attention and multi-head attention in a language model. Consider the following phrases from a conversation dataset.
Use the interactive application to explore the difference between self-attention (relationships between specific words within text) and multi-head attention (processing multiple aspects of text simultaneously).
Which analysis tasks are examples of self-attention vs. multi-head attention?
This exercise is part of the course
Large Language Models (LLMs) Concepts
Hands-on interactive exercise
Turn theory into action with one of our interactive exercises
Start Exercise