Meet the human data
Welcome to the Dimensionality reduction techniques chapter.
In this chapter we will be using the human
dataset to introduce Principal Components Analysis (PCA). The data originates from the United Nations Development Programme. See their data page for more information. For a nice overview see also the calculating the human development indices pdf.
Most of the variable names have been shortened and two new variables have been computed. See the meta file for the modified data here for descriptions.
This exercise is part of the course
Helsinki Open Data Science
Exercise instructions
- Read the
human
data into memory - Print out the (column) names of the data
- Look at the structure of the data
- Print out summaries of the variables in the data
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# read the human data
human <- read.table("http://s3.amazonaws.com/assets.datacamp.com/production/course_2218/datasets/human1.txt", sep =",", header = T)
# look at the (column) names of human
names(human)
# look at the structure of human
# print out summaries of the variables