In this post we go through the process of analysing the Seattle Airbnb datasets available on Kaggle.
We are provided with 3 datasets by Kaggle.
- Listings, including full descriptions and average review score.
- Reviews, including unique id for each reviewer and detailed comments.
- Calendar, including listing id and the price and availability for that day.
We first need to thoroughly analyse each dataset first looking for fields that are of interest, check for NULLs, Unique values, Outliers and other statistically important information.
We are mainly driven by the below questions for our analysis:
I am a Data Science enthusiast