The col_names argument
Apart from path
and sheet
, there are several other arguments you can specify in read_excel()
. One of these arguments is called col_names
.
By default it is TRUE
, denoting whether the first row in the Excel sheets contains the column names. If this is not the case, you can set col_names
to FALSE
. In this case, R will choose column names for you. You can also choose to set col_names
to a character vector with names for each column. It works exactly the same as in the readr
package.
You'll be working with the urbanpop_nonames.xlsx
(view) file. It contains the same data as urbanpop.xlsx
(view) but has no column names in the first row of the excel sheets.
This exercise is part of the course
Introduction to Importing Data in R
Exercise instructions
- Import the first Excel sheet of
"urbanpop_nonames.xlsx"
and store the result inpop_a
. Have R set the column names of the resulting data frame itself. - Import the first Excel sheet of
urbanpop_nonames.xlsx
; this time, use thecols
vector that has already been prepared for you to specify the column names. Store the resulting data frame inpop_b
. - Print out the summary of
pop_a
. - Print out the summary of
pop_b
. Can you spot the difference with the other summary?
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Import the first Excel sheet of urbanpop_nonames.xlsx (R gives names): pop_a
pop_a <- read_excel("____", col_names = ____)
# Import the first Excel sheet of urbanpop_nonames.xlsx (specify col_names): pop_b
cols <- c("country", paste0("year_", 1960:1966))
pop_b <- ___
# Print the summary of pop_a
___
# Print the summary of pop_b
___