Region premiums
In this exercise, your objective is to plot histograms that compare the yearly insurance premiums of policyholders across different regions. To achieve this, you will define a plotting chain that adjusts the units of the insurance charges from dollars to thousands of dollars, enhancing the visualization. You will plot the histograms on a two-by-two grid.
The DataFrames, StatsPlots and Chain packages have been imported, and the insurance DataFrame has been loaded.
This exercise is part of the course
Introduction to Data Visualization with Julia
Exercise instructions
- Apply the
transform()function to divide each element of the:Chargescolumn by1000and assign the resulting values back to the same column. - Set the
layoutargument to generate a two-by-two grid of histograms. - Customize the axis labels by setting the x-axis label of the bottom two histograms to
"Premium (kUSD)", and set the y-axis label of the first and third histograms to"Frequency".
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
colors = [:slategray1 :springgreen4 :deeppink :darkviolet]
@chain insurance begin
# Change charge units
transform(:____
=> ByRow(x -> ____) => :____)
# Create histograms
@df histogram(:Charges, group=:Region,
layout=____,
color=colors,
# Set axis labels
xlabel=____,
ylabel=____)
end
ylims!(0, 120)