Counting genre popularity
To find the most common genres across the catalog, reshape the genres list column so each genre lands on its own row, then compute a sorted value count to see which genres top the list.
The movies DataFrame is preloaded for you.
Diese Übung ist Teil des Kurses
<Kurs>Scaling and Optimizing Data Pipelines with Polars</Kurs>Übungsanweisungen
- Explode the
genreslist column so each genre has its own row. - Compute the top 10 most common genres using a sorted value count.
Interaktive praktische Übung
Versuche dich an dieser Übung, indem du diesen Beispielcode vervollständigst.
# Put each genre on its own row
genres = movies.select("genres").____("genres")["genres"]
# Rank by frequency
result = genres.drop_nulls().____(sort=True).head(10)
print(result)