Customizing your spreadsheet import
Here you'll parse your spreadsheets and use additional arguments to skip rows, rename columns and select only particular columns.
The spreadsheet 'battledeath.xlsx' is already loaded as
xl.
As before, you'll use the method parse(). This time, however,
you'll add the additional arguments skiprows, names and
parse_cols. These skip rows, name the columns and designate
which columns to parse, respectively. All these arguments can
be assigned to lists containing the specific row numbers, strings
and column numbers, respectively.
This exercise is part of the course
Importing Data in Python
Exercise instructions
- Parse the first sheet by index. In doing so, skip the first row of data, then name the columns
'Country'and'AAM due to War (2002)'by using the argumentnames. The arguments passed toskiprowsandnameswill all need to be of typelist. - Parse the second sheet by index. In doing so, parse only the first column, skip the first row and rename the column
'Country'. The argument passed toparse_colswill need to be of typelist.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Parse the first sheet and rename the columns: df1
df1 = xl.parse(____, skiprows=____, names=____)
# Print the head of the DataFrame df1
print(df1.head())
# Parse the first column of the second sheet and rename the column: df2
# Print the head of the DataFrame df2
print(df2.head())