1. Learn
  2. /
  3. Courses
  4. /
  5. Introduction to Data Pipelines

Exercise

Extracting data from parquet files

One of the most common ways to ingest data from a source system is by reading data from a file, such as a CSV file. As data has gotten bigger, the need for better file formats has brought about new column-oriented file types, such as parquet files.

In this exercise, you'll practice extracting data from a parquet file.

Instructions

100 XP
  • Read the parquet file at the path "sales_data.parquet" into a pandas DataFrame.
  • Check the data types of the DataFrame via print()ing.
  • Output the shape of the DataFrame, as well as it's head.