BaşlayınÜcretsiz Başlayın

Pandas UDFs

This exercise covers Pandas UDFs, so that you can practice their syntax! As you work through this exercise, notice the differences between the Pyspark UDF from the last exercise and this type of UDF.

Remember, there's already a SparkSession called spark in your workspace!

Bu egzersiz

Introduction to PySpark

kursunun bir parçasıdır
Kursu Görüntüle

Egzersiz talimatları

  • Define the add_ten_pandas() function as a pandas UDF.
  • Add a new column to the DataFrame called "10_plus" that applies the pandas UDF to the df column "value".
  • Show the resulting DataFrame.

Uygulamalı interaktif egzersiz

Bu örnek kodu tamamlayarak bu egzersizi bitirin.

# Define a Pandas UDF that adds 10 to each element in a vectorized way
@____(DoubleType())
def add_ten_pandas(column):
    return column + 10

# Apply the UDF and show the result
df.withColumn("10_plus", ____)
df.____
Kodu Düzenle ve Çalıştır