MulaiMulai sekarang secara gratis

Quiz 4 - Question 1

Imagine that you have built a tiny language model with a vocabulary of 5 tokens. This model predicted the following probability distribution over the next token:

the: 0.05
chased: 0.01
a: 0.04
lion: 0.44
zebra: 0.46

If you apply top-k sampling with k=3, what is the modified probability distribution from which the model samples the next token?

Latihan ini adalah bagian dari kursus

Google DeepMind: Discover The Transformer Architecture

Lihat Kursus

Latihan interaktif praktis

Ubah teori menjadi tindakan dengan salah satu latihan interaktif kami.

Mulai berolahraga