Filtering by company
Using that same dataset from the last exercise, you realized that you only care about the jobs that are entry level ("EN") in Canada ("CA"). What does the salaries look like there?
Remember, there's already a SparkSession called spark in your workspace!
Deze oefening maakt deel uit van de cursus
Introduction to PySpark
Oefeninstructies
- Filter to subset the DataFrame to where
company_locationis"CA". - Calculate the average of the
salary_in_usdcolumn. - Show the result!
Praktische interactieve oefening
Probeer deze oefening eens door deze voorbeeldcode in te vullen.
# Average salary for entry level in Canada
CA_jobs = ca_salaries_df.____(ca_salaries_df[____] == "CA").filter(ca_salaries_df['experience_level']
== "EN").groupBy().____("salary_in_usd")
# Show the result
CA_jobs.____()