Practice query plans
A dataframe text_df
is available. This dataframe is registered as a table called table1
.
This exercise is part of the course
Introduction to Spark SQL in Python
Exercise instructions
- Run explain on
text_df
. - Run explain on a SQL query that does a "SELECT COUNT(*) as count" on
table1
. - Run explain on a SQL query that counts the number of unique words in
table1
.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Run explain on text_df
text_df.____()
# Run explain on "SELECT COUNT(*) AS count FROM table1"
spark.sql("SELECT COUNT(*) AS count FROM table1").____()
# Run explain on "SELECT COUNT(DISTINCT word) AS words FROM table1"
spark.sql("____").____()