Finding gaps in both DataFrames
For data quality checks, you want to see all rows from both DataFrames - listings without benchmarks and benchmarks without listings. This helps identify gaps before analysis.
polars is loaded as pl, and the DataFrames hotels and type_benchmarks are available for you.
Este exercício faz parte do curso
Data Transformation with Polars
Instruções do exercício
- Join
hotelswithtype_benchmarksontypeandbeach, keeping all rows from both DataFrames. - Use
coalesce=Trueto avoid duplicate join columns.
Exercício interativo prático
Experimente este exercício completando este código de exemplo.
# Keep all rows from both DataFrames
full_view = hotels.____(
type_benchmarks,
on=["type", "beach"],
how="____",
# Avoid duplicate columns
coalesce=____
)
print(full_view.head())