Standardizing Your Data
In the lecture, we saw that you can learn a lot about a dataset by creating the matrix \(A^TA\) from it. In this exercise, you'll do that with athletic data for players entering the National Football League college draft. The dataset combine
is loaded for you.
Este ejercicio forma parte del curso
Linear Algebra for Data Science in R
Instrucciones del ejercicio
- Extract only the numerical elements of the data frame by taking only the 5th through 12th columns. Call this
A
(we cannot do math on the non-numerical components in columns 1 through 4). - Turn this data frame into a matrix by using the
as.matrix()
command. - Subtract the mean of each of the columns of the matrix.
Ejercicio interactivo práctico
Prueba este ejercicio completando el código de muestra.
# Extract columns 5-12 of combine
A <- combine[, ___:___]
# Make A into a matrix
A <- ___(A)
# Subtract the mean of each column
A[, ___] <- A[, 1] - mean(A[, 1])
A[, 2] <- A[, 2] - ___(A[, 2])
A[, ___] <- A[, 3] - mean(A[, 3])
A[, ___] <- A[, ___] - mean(A[, 4])
A[, 5] <- A[, 5] - mean(A[, 5])
A[, ___] <- A[, 6] - mean(A[, ___])
A[, 7] <- A[, ___] - mean(A[, 7])
A[, ___] <- A[, 8] - mean(A[, 8])