Get startedGet started for free

String to integer

Now you'll use the .cast() method you learned in the previous exercise to convert all the appropriate columns from your DataFrame model_data to integers!

To convert the type of a column using the .cast() method, you can write code like this:

dataframe = dataframe.withColumn("col", dataframe.col.cast("new_type"))

This exercise is part of the course

Foundations of PySpark

View Course

Exercise instructions

  • Use the method .withColumn() to .cast() the following columns to type "integer". Access the columns using the df.col notation:
    • model_data.arr_delay
    • model_data.air_time
    • model_data.month
    • model_data.plane_year

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

# Cast the columns to integers
model_data = model_data.withColumn("arr_delay", ____)
model_data = model_data.withColumn("air_time", ____)
model_data = model_data.withColumn("month", ____)
model_data = model_data.withColumn("plane_year", ____)
Edit and Run Code