Aan de slagGa gratis aan de slag

A first look

Using the full Avazu dataset, you will explore various new features by looking at the data types of columns. The new data includes categorical columns such as site_id, app_id, device_id, etc. all of which are various identifiers for a given site, app, and user respectively. To start off, you will identify and print out the numerical and categorical columns.

Sample data in DataFrame form is loaded as df. pandas as pd is also available in your workspace.

Deze oefening maakt deel uit van de cursus

Predicting CTR with Machine Learning in Python

Cursus bekijken

Oefeninstructies

  • Print the columns of df using .columns.
  • Print the corresponding data types of df using .dtypes.
  • Select the subset of df with numerical columns (by using include = ['int', 'float']) and print those columns.
  • Select the subset of df with categorical columns (by using include = ['object']) and print those columns.

Praktische interactieve oefening

Probeer deze oefening eens door deze voorbeeldcode in te vullen.

# Print columns
print(df.____)

# Print data types of columns
print(df.____)

# Select and print numeric columns
numeric_df = df.____(include=['____', 'float'])
print(numeric_df.____)

# Select and print categorical columns 
categorical_df = df.____(include=['____'])
print(categorical_df.____)
Code bewerken en uitvoeren