Meet the human data

Welcome to the Dimensionality reduction techniques chapter.

In this chapter we will be using the human dataset to introduce Principal Components Analysis (PCA). The data originates from the United Nations Development Programme. See their data page for more information. For a nice overview see also the calculating the human development indices pdf.

Most of the variable names have been shortened and two new variables have been computed. See the meta file for the modified data here for descriptions.

This exercise is part of the course

Helsinki Open Data Science

View Course

Exercise instructions

Read the human data into memory
Print out the (column) names of the data
Look at the structure of the data
Print out summaries of the variables in the data

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

# read the human data
human <- read.table("http://s3.amazonaws.com/assets.datacamp.com/production/course_2218/datasets/human1.txt", sep  =",", header = T)

# look at the (column) names of human
names(human)

# look at the structure of human


# print out summaries of the variables

Edit and Run Code