Counting genre popularity
To find the most common genres across the catalog, reshape the genres list column so each genre lands on its own row, then compute a sorted value count to see which genres top the list.
The movies DataFrame is preloaded for you.
Questo esercizio fa parte del corso
Scaling and Optimizing Data Pipelines with Polars
Istruzioni dell'esercizio
- Explode the
genreslist column so each genre has its own row. - Compute the top 10 most common genres using a sorted value count.
esercizio interattivo pratico
Prova questo esercizio completando questo codice di esempio.
# Put each genre on its own row
genres = movies.select("genres").____("genres")["genres"]
# Rank by frequency
result = genres.drop_nulls().____(sort=True).head(10)
print(result)