Get startedGet started for free

Creating a SparkSession

We've already created a SparkSession for you called spark, but what if you're not sure there already is one? Creating multiple SparkSessions and SparkContexts can cause issues, so it's best practice to use the SparkSession.builder.getOrCreate() method. This returns an existing SparkSession if there's already one in the environment, or creates a new one if necessary!

This exercise is part of the course

Foundations of PySpark

View Course

Exercise instructions

  • Import SparkSession from pyspark.sql.
  • Make a new SparkSession called my_spark using SparkSession.builder.getOrCreate().
  • Print my_spark to the console to verify it's a SparkSession.

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

# Import SparkSession from pyspark.sql
from ____ import ____

# Create my_spark
my_spark = ____

# Print my_spark
print(____)
Edit and Run Code