Analyze data distribution
Many product managers are adept at exploring the data around the product features for which they are responsible. In this scenario, you are a data savvy product manager handling the newsfeed for a social media application. You suspect that your newsfeed data collection infrastructure might have an issue with anomalous data. The average number of views per article per sample period is skewed way upwards from the typical number of views.
You start by examining the reading behavior of the top five articles in a one week reporting period to get a sense of what the reading behavior looks like for five popular but otherwise typical news articles.
The dc_news_dim
and dc_news_fact
tables are available to you.
This exercise is part of the course
Time Series Analysis in PostgreSQL
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
-- Calculate the max, average, and continuous median views
SELECT
___ AS max,
___ AS average,
___ AS median
FROM dc_news_fact;