Ratios
Ratios are all around us. Whether it's miles per gallon or click through rate, they are everywhere. In this exercise, we'll create some ratios by dividing out pairs of columns.
Cet exercice fait partie du cours
Feature Engineering with PySpark
Instructions
- Create a new variable ASSESSED_TO_LISTby dividingASSESSEDVALUATIONbyLISTPRICEto help us understand if the having a high or low assessment value impacts our price.
- Create another new variable TAX_TO_LISTto help us understand the approximate tax rate by dividingTAXESbyLISTPRICE.
- Lastly create another variable BED_TO_BATHSto help us know how crowded our bathrooms might be by dividingBEDROOMSbyBATHSTOTAL.
Exercice interactif pratique
Essayez cet exercice en complétant cet exemple de code.
# ASSESSED_TO_LIST
df = ____
df[['ASSESSEDVALUATION', 'LISTPRICE', 'ASSESSED_TO_LIST']].show(5)
# TAX_TO_LIST
df = ____
df[['TAX_TO_LIST', 'TAXES', 'LISTPRICE']].show(5)
# BED_TO_BATHS
df = ____
df[['BED_TO_BATHS', 'BEDROOMS', 'BATHSTOTAL']].show(5)