Ratios
Ratios are all around us. Whether it's miles per gallon or click through rate, they are everywhere. In this exercise, we'll create some ratios by dividing out pairs of columns.
Este ejercicio forma parte del curso
Feature Engineering with PySpark
Instrucciones del ejercicio
- Create a new variable
ASSESSED_TO_LIST
by dividingASSESSEDVALUATION
byLISTPRICE
to help us understand if the having a high or low assessment value impacts our price. - Create another new variable
TAX_TO_LIST
to help us understand the approximate tax rate by dividingTAXES
byLISTPRICE
. - Lastly create another variable
BED_TO_BATHS
to help us know how crowded our bathrooms might be by dividingBEDROOMS
byBATHSTOTAL
.
Ejercicio interactivo práctico
Prueba este ejercicio y completa el código de muestra.
# ASSESSED_TO_LIST
df = ____
df[['ASSESSEDVALUATION', 'LISTPRICE', 'ASSESSED_TO_LIST']].show(5)
# TAX_TO_LIST
df = ____
df[['TAX_TO_LIST', 'TAXES', 'LISTPRICE']].show(5)
# BED_TO_BATHS
df = ____
df[['BED_TO_BATHS', 'BEDROOMS', 'BATHSTOTAL']].show(5)