String to integer
Now you'll use the .cast()
method you learned in the previous exercise to convert all the appropriate columns from your DataFrame model_data
to integers!
To convert the type of a column using the .cast()
method, you can write code like this:
dataframe = dataframe.withColumn("col", dataframe.col.cast("new_type"))
This exercise is part of the course
Foundations of PySpark
Exercise instructions
- Use the method
.withColumn()
to.cast()
the following columns to type"integer"
. Access the columns using thedf.col
notation:model_data.arr_delay
model_data.air_time
model_data.month
model_data.plane_year
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Cast the columns to integers
model_data = model_data.withColumn("arr_delay", ____)
model_data = model_data.withColumn("air_time", ____)
model_data = model_data.withColumn("month", ____)
model_data = model_data.withColumn("plane_year", ____)