Filtering by company
Using that same dataset from the last exercise, you realized that you only care about the jobs that are entry level ("EN"
) in Canada ("CA"
). What does the salaries look like there?
Remember, there's already a SparkSession
called spark
in your workspace!
Cet exercice fait partie du cours
Introduction to PySpark
Instructions
- Filter to subset the DataFrame to where
company_location
is"CA"
. - Calculate the average of the
salary_in_usd
column. - Show the result!
Exercice interactif pratique
Essayez cet exercice en complétant cet exemple de code.
# Average salary for entry level in Canada
CA_jobs = ca_salaries_df.____(ca_salaries_df[____] == "CA").filter(ca_salaries_df['experience_level']
== "EN").groupBy().____("salary_in_usd")
# Show the result
CA_jobs.____()