Print a 5-number summary
One of the quickest methods for getting a feel for new data is the 5-number summary. It prints out 5 metrics about a distribution - the minimum, 25th percentile, median, 75th percentile, and the maximum along with mean and standard deviation. By looking at the 5-number summary and the difference between the mean and the minimum/maximum values, you can get a rough idea of whether outliers are present in the distribution.
In the exercises of this chapter, you will be using the methods discussed in the videos to detect the prices of the most expensive (or inexpensive) US Airbnb listings. The dataset has been loaded as airbnb_df
as a pandas DataFrame.
Este exercício faz parte do curso
Anomaly Detection in Python
Instruções do exercício
- Extract the
price
column intoprices
from the US Airbnb Listings data. - Print the 5-number summary of the
prices
distribution.
Exercício interativo prático
Experimente este exercício completando este código de exemplo.
# Extract price
prices = ____
# Print 5-number summary
print(prices.____)