1. सीखें
  2. /
  3. पाठ्यक्रम
  4. /
  5. Big Data Fundamentals with PySpark

Connected

अभ्यास

Loading CSV into DataFrame

In the previous exercise, you have seen a method for creating a DataFrame from an RDD. Generally, loading data from CSV file is the most common method of creating DataFrames. In this exercise, you'll create a PySpark DataFrame from the people.csv file that is already provided to you as a file_path and confirm the created object is a PySpark DataFrame.

Remember, you already have a SparkSession spark and a variable file_path (the path to the people.csv file) available in your workspace.

निर्देश

100 XP
  • Create a DataFrame from file_path variable which is the path to the people.csv file.
  • Confirm the output as PySpark DataFrame.