1. Learn
  2. /
  3. Courses
  4. /
  5. Cluster Analysis in R

Connected

Exercise

Revisiting wholesale data: Exploration

From the previous analysis you have found that k = 2 has the highest average silhouette width. In this exercise you will continue to analyze the wholesale customer data by building and exploring a kmeans model with 2 clusters.

Instructions

100 XP
  • Build a k-means model called model_customers for the customers_spend data using the kmeans() function with centers = 2.
  • Extract the vector of cluster assignments from the model model_customers$cluster and store this in the variable clust_customers.
  • Append the cluster assignments as a column cluster to the customers_spend data frame and save the results to a new data frame called segment_customers.
  • Calculate the size of each cluster using count().