Print a 5-number summary
One of the quickest methods for getting a feel for new data is the 5-number summary. It prints out 5 metrics about a distribution - the minimum, 25th percentile, median, 75th percentile, and the maximum along with mean and standard deviation. By looking at the 5-number summary and the difference between the mean and the minimum/maximum values, you can get a rough idea of whether outliers are present in the distribution.
In the exercises of this chapter, you will be using the methods discussed in the videos to detect the prices of the most expensive (or inexpensive) US Airbnb listings. The dataset has been loaded as airbnb_df
as a pandas DataFrame.
This exercise is part of the course
Anomaly Detection in Python
Exercise instructions
- Extract the
price
column intoprices
from the US Airbnb Listings data. - Print the 5-number summary of the
prices
distribution.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Extract price
prices = ____
# Print 5-number summary
print(prices.____)