BaşlayınÜcretsiz Başlayın

What are we predicting?

Which of these fields (or columns) is the value we are trying to predict for?

  • TAXES
  • SALESCLOSEPRICE
  • DAYSONMARKET
  • LISTPRICE

Bu egzersiz

Feature Engineering with PySpark

kursunun bir parçasıdır
Kursu Görüntüle

Egzersiz talimatları

  • From the listed columns above, identify which one we will use as our dependent variable $Y$.
  • Using the loaded data set df, filter it down to our dependent variable with select(). Store this dataframe in the variable Y_df.
  • Display summary statistics for the dependent variable using describe() on Y_df and calling show() to display it.

Uygulamalı interaktif egzersiz

Bu örnek kodu tamamlayarak bu egzersizi bitirin.

# Select our dependent variable
Y_df = df.____([____])

# Display summary statistics
Y_df.____().____()
Kodu Düzenle ve Çalıştır