Print a 5-number summary
One of the quickest methods for getting a feel for new data is the 5-number summary. It prints out 5 metrics about a distribution - the minimum, 25th percentile, median, 75th percentile, and the maximum along with mean and standard deviation. By looking at the 5-number summary and the difference between the mean and the minimum/maximum values, you can get a rough idea of whether outliers are present in the distribution.
In the exercises of this chapter, you will be using the methods discussed in the videos to detect the prices of the most expensive (or inexpensive) US Airbnb listings. The dataset has been loaded as airbnb_df
as a pandas DataFrame.
Diese Übung ist Teil des Kurses
Anomaly Detection in Python
Anleitung zur Übung
- Extract the
price
column intoprices
from the US Airbnb Listings data. - Print the 5-number summary of the
prices
distribution.
Interaktive Übung
Versuche dich an dieser Übung, indem du diesen Beispielcode vervollständigst.
# Extract price
prices = ____
# Print 5-number summary
print(prices.____)