Examining The SparkContext
In this exercise you'll get familiar with the SparkContext.
You'll probably notice that code takes longer to run than you might expect. This is because Spark is some serious software. It takes more time to start up than you might be used to. You may also find that running simpler computations might take longer than expected. That's because all the optimizations that Spark has under its hood are designed for complicated operations with big data sets. That means that for simple or small problems Spark may actually perform worse than some other solutions!
Questo esercizio fa parte del corso
Foundations of PySpark
Istruzioni dell'esercizio
Get to know the SparkContext.
- Call
print()onscto verify there's aSparkContextin your environment. print()sc.versionto see what version of Spark is running on your cluster.
Esercizio pratico interattivo
Prova a risolvere questo esercizio completando il codice di esempio.
# Verify SparkContext
print(____)
# Print Spark version
print(____)