Get startedGet started for free

Introduction

1. Introduction

Hello. My name is Elliot. I will be your instructor with this case study using Databricks.

2. Overview

Databricks is a cloud-based platform for data analytics, combining big data processing with tools for querying and visualizing data. It enables teams to analyze large datasets efficiently, collaborate on data projects, and create dashboards, all in one workspace. In this case study you will be working as a consultant data analyst at a consultancy company called Data X, a firm specializing in advanced data analytics solutions. So who will be your client? Your client is Airbnb. They have tasked us with analyzing one of their most active markets, New York City. As a consultant, your goal is to help them to understand its performance and the customer behavior in this market. Why we choose New York City? Because it offers a diverse dataset with thousands of listings, which present us with a unique opportunity to answer questions like: How do prices vary across different areas? What type of properties are most popular in each borough?

3. Goal of the case study

So your key objectives will include understanding the Airbnb dataset structure, analyzing the listings and the reviews to identify trends, provide insights for pricing, customer experience and operational strategies. By analyzing the data, you will provide our clients with insights to improve pricing, customer experience, and other strategic decisions.

4. Dataset overview

Next, let's explore the dataset provided by Airbnb in New York City, which is divided into two sets. The first asset is the property listing table. This table provides details about each property, including price, location, property type, and host information, across New York's five boroughs. The second data is our customer reviews table. This captures the data on customer reviews, including number of reviews, scores, last reviewed date and additional metrics related to hosts.

5. Get started!

Now let's connect to our Databricks workspace and start our exploration. In the provided Databricks workspace, we'll start by exploring the structure of these datasets and identifying key variables for analysis.

Create Your Free Account

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.