Checking results

In this exercise you're going to check the results of your DBSCAN fraud detection model. In reality, you often don't have reliable labels and this where a fraud analyst can help you validate the results. He/She can check your results and see whether the cases you flagged are indeed suspicious. You can also check historically known cases of fraud and see whether your model flags them.

In this case, you'll use the fraud labels to check your model results. The predicted cluster numbers are available under pred_labels as well as the original fraud labels labels.

Create a dataframe combining the cluster numbers with the actual labels. This has been done for you.
Create a condition that flags fraud for the three smallest clusters: clusters 21, 17 and 9.
Create a crosstab from the actual fraud labels with the newly created predicted fraud labels.

Introduction and preparing your data

Fraud detection using labeled data

Fraud detection using unlabeled data

Fraud detection using text

Exercise

Checking results

Instructions