Get startedGet started for free

Creating an Enum column

Movies have a fixed status vocabulary like Released, Rumored, Post Production, and so on. Since the allowed values are known upfront, an Enum is a better fit than Categorical: it adds validation that catches any unknown status before it pollutes the pipeline.

movies is still available, along with a pre-defined status_enum listing every allowed value.

This exercise is part of the course

Scaling and Optimizing Data Pipelines with Polars

View Course

Exercise instructions

  • Cast the status column using the pre-defined status_enum.

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

result = movies.with_columns(
    # Cast to the enum
    pl.col("____").____(____)
).select("movie_title", "status").head(8)
print(result)
Edit and Run Code