Aan de slagBegin gratis

Creating an Enum column

Movies have a fixed status vocabulary like Released, Rumored, Post Production, and so on. Since the allowed values are known upfront, an Enum is a better fit than Categorical: it adds validation that catches any unknown status before it pollutes the pipeline.

movies is still available, along with a pre-defined status_enum listing every allowed value.

Deze oefening maakt deel uit van de cursus

Scaling and Optimizing Data Pipelines with Polars

Bekijk cursus

Oefeninstructies

  • Cast the status column using the pre-defined status_enum.

Interactieve oefening met praktijkervaring

Probeer deze oefening door deze voorbeeldcode aan te vullen.

result = movies.with_columns(
    # Cast to the enum
    pl.col("____").____(____)
).select("movie_title", "status").head(8)
print(result)
Code bewerken en uitvoeren