Get startedGet started for free

Filtering by company

Using that same dataset from the last exercise, you realized that you only care about the jobs that are entry level ("EN") in Canada ("CA"). What does the salaries look like there? Remember, there's already a SparkSession called spark in your workspace!

This exercise is part of the course

Introduction to PySpark

View Course

Exercise instructions

  • Filter to subset the DataFrame to where company_location is "CA".
  • Calculate the average of the salary_in_usd column.
  • Show the result!

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

# Average salary for entry level in Canada
CA_jobs = ca_salaries_df.____(ca_salaries_df[____] == "CA").filter(ca_salaries_df['experience_level']
 == "EN").groupBy().____("salary_in_usd")

# Show the result
CA_jobs.____()
Edit and Run Code