Create a SQL table from a dataframe
A dataframe can be used to create a temporary table. A temporary table is one that will not exist after the session ends. Spark documentation also refers to this type of table as a SQL temporary view. In the documentation this is referred to as to register the dataframe as a SQL temporary view. This command is called on the dataframe itself, and creates a table if it does not already exist, replacing it with the current data from the dataframe if it does already exist.
This exercise is part of the course
Introduction to Spark SQL in Python
Exercise instructions
- Load csv data from the file
trainsched.txt
into a dataframe stored in a variable nameddf
. - Create a temporary table from
df
. Call the table "table1".
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Load trainsched.txt
df = spark.____.____("trainsched.txt", header=True)
# Create temporary table called table1
df.____(____)