Counting genre popularity
To find the most common genres across the catalog, reshape the genres list column so each genre lands on its own row, then compute a sorted value count to see which genres top the list.
The movies DataFrame is preloaded for you.
This exercise is part of the course
Scaling and Optimizing Data Pipelines with Polars
Exercise instructions
- Explode the
genreslist column so each genre has its own row. - Compute the top 10 most common genres using a sorted value count.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Put each genre on its own row
genres = movies.select("genres").____("genres")["genres"]
# Rank by frequency
result = genres.drop_nulls().____(sort=True).head(10)
print(result)