Session Ready
Exercise

Transactionalizing the online data frame

Back to the online retail transactions data from Chapter 1! The cleaned version of the dataset is loaded in your workspace and is denoted as Online_Retail_clean. Before being able to apply algorithms to drive your Market Basket Analysis, you need to transactionalize your data which will enable you to have a quicker and more efficient way of extracting association rules later on.

Recall that in this dataset, the product or item being purchased is described in the column Description while the column InvoiceNo refers to a unique invoicing number which you can consider as a transaction ID.

Instructions 1/3
undefined XP
  • 1
  • 2
  • 3
  • Create a list of transactions from Online_Retail_clean containing the products included in the transaction.