Exploring and pseudonymizing a dataset
In this exercise, you will explore the student performance dataset and decide which columns are more suitable for specific sampling anonymization techniques.
numpy
has already been loaded as np
and the dataset is loaded as students
.
This exercise is part of the course
Data Privacy and Anonymization in Python
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Print the summarized information of the DataFrame
print(____)
# Print the number of unique values in each column of students
print(____)