Create a new column
In the last exercise, you converted the column plane_year
to an integer. This column holds the year each plane was manufactured. However, your model will use the planes' age, which is slightly different from the year it was made!
This exercise is part of the course
Foundations of PySpark
Exercise instructions
- Create the column
plane_age
using the.withColumn()
method and subtracting the year of manufacture (columnplane_year
) from the year (columnyear
) of the flight.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Create the column plane_age
model_data = model_data.withColumn("plane_age", ____.____ - ____.____)