Counting genre popularity
To find the most common genres across the catalog, reshape the genres list column so each genre lands on its own row, then compute a sorted value count to see which genres top the list.
The movies DataFrame is preloaded for you.
Deze oefening maakt deel uit van de cursus
Scaling and Optimizing Data Pipelines with Polars
Oefeninstructies
- Explode the
genreslist column so each genre has its own row. - Compute the top 10 most common genres using a sorted value count.
Interactieve oefening met praktijkervaring
Probeer deze oefening door deze voorbeeldcode aan te vullen.
# Put each genre on its own row
genres = movies.select("genres").____("genres")["genres"]
# Rank by frequency
result = genres.drop_nulls().____(sort=True).head(10)
print(result)