CommencerCommencez gratuitement

Creating reusable expressions

Now that you can transform columns, you want to understand which artists have a dedicated fanbase by calculating streams per listener. Rather than rewriting this formula each time, store the calculation as a Polars expression variable that you can reuse across multiple DataFrames.

polars is loaded as pl. The DataFrame spotify is available with album streaming data.

Cet exercice fait partie du cours

<cours>Data Transformation with Polars</cours>
Voir le cours

Instructions de l’exercice

  • Create an expression that divides streams by monthly_listeners.
  • Apply the expression variable and sort by streams_per_listener in descending order.

Exercice interactif pratique

Essayez cet exercice en complétant ce code d’exemple.

# Create a reusable expression for streams per listener
streams_per_listener_expr = (
    (pl.col("____") / pl.col("____"))
    .alias("streams_per_listener")
)

# Apply the expression and sort by streams_per_listener
result = spotify.with_columns(____).sort(
    "streams_per_listener", descending=____
)

print(result.head())
Modifier et exécuter le code