Prepare data for the snake plot
Now you will prepare data for the snake plot. You will use the 3-cluster RFM segmentation solution you have built previously. You will transform the normalized RFM data into a long format by "melting" the metric columns into two columns - one for the name of the metric, and another for the actual numeric value.
We have loaded the normalized RFM data with the cluster labels already assigned. It is loaded as apandas DataFrame named datamart_normalized. Also, pandas is imported as pd.
Explore the datamart_normalized in the console before you begin the exercise to get a good sense of its structure!
This exercise is part of the course
Customer Segmentation in Python
Exercise instructions
- Transform the dataset into long format by applying
meltfunction on the normalized dataset with a reset index. - Assign
CustomerIDandClusteras ID variables. - Assign RFM values as value variables.
- Name the variable as
Metricand the value asValue.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Melt the normalized dataset and reset the index
datamart_melt = pd.____(
____.____(),
# Assign CustomerID and Cluster as ID variables
____=['____', '____'],
# Assign RFM values as value variables
____=['____', '____', '____'],
# Name the variable and value
____='____', ____='____'
)