Filtering by company
Using that same dataset from the last exercise, you realized that you only care about the jobs that are entry level ("EN"
) in Canada ("CA"
). What does the salaries look like there?
Remember, there's already a SparkSession
called spark
in your workspace!
This exercise is part of the course
Introduction to PySpark
Exercise instructions
- Filter to subset the DataFrame to where
company_location
is"CA"
. - Calculate the average of the
salary_in_usd
column. - Show the result!
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Average salary for entry level in Canada
CA_jobs = ca_salaries_df.____(ca_salaries_df[____] == "CA").filter(ca_salaries_df['experience_level']
== "EN").groupBy().____("salary_in_usd")
# Show the result
CA_jobs.____()