Aan de slagGa gratis aan de slag

Filtering by company

Using that same dataset from the last exercise, you realized that you only care about the jobs that are entry level ("EN") in Canada ("CA"). What does the salaries look like there? Remember, there's already a SparkSession called spark in your workspace!

Deze oefening maakt deel uit van de cursus

Introduction to PySpark

Cursus bekijken

Oefeninstructies

  • Filter to subset the DataFrame to where company_location is "CA".
  • Calculate the average of the salary_in_usd column.
  • Show the result!

Praktische interactieve oefening

Probeer deze oefening eens door deze voorbeeldcode in te vullen.

# Average salary for entry level in Canada
CA_jobs = ca_salaries_df.____(ca_salaries_df[____] == "CA").filter(ca_salaries_df['experience_level']
 == "EN").groupBy().____("salary_in_usd")

# Show the result
CA_jobs.____()
Code bewerken en uitvoeren