Imputing missing plane prices
Now there's just one column with missing values left!
You've removed the "Additional_Info"
column from planes
—the last step is to impute the missing data in the "Price"
column of the dataset.
As a reminder, you generated this boxplot, which suggested that imputing the median price based on the "Airline"
is a solid approach!
This exercise is part of the course
Exploratory Data Analysis in Python
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Calculate median plane ticket prices by Airline
airline_prices = planes.____("____")["____"].____()
print(airline_prices)