MulaiMulai sekarang secara gratis

Load in the data

Reading in data is the first step to using PySpark for data science! Let's leverage the new industry standard of parquet files!

Latihan ini adalah bagian dari kursus

Feature Engineering with PySpark

Lihat Kursus

Petunjuk latihan

  • Use the parquet() file reader to read in 'Real_Estate.parq' as described in the video exercise.
  • Print out the list of columns with columns.

Latihan interaktif praktis

Cobalah latihan ini dengan menyelesaikan kode contoh berikut.

# Read the file into a dataframe
df = spark.read.____(____)
# Print columns in dataframe
____(df.____)
Edit dan Jalankan Kode