6 years of data (2019-2025) analyzed with 5 ML algorithms. 3 data sources combined. Inside Airbnb Oct 2025 data included.
Inside Airbnb data reveals dramatic market transformation
NYC regulations drastically reduced supply, prices surged
Gradient Boosting achieves $49 mean absolute error
Feature importance: Room Type + Location drive 60% of predictions
K-Means clustering reveals 5 distinct market segments
PCA visualization + Elbow method for optimal K
Isolation Forest identifies overpriced and underpriced listings
1,108 overpriced | 1,325 underpriced deals
DBSCAN clustering and demand forecasting
Density-based spatial clustering of premium listings
Booking rate patterns with regression trends
Exploratory data analysis across NYC
48,895 listings color-coded by borough and price
Manhattan: $197 | Brooklyn: $124
Most listed vs most expensive
Entire home vs private room pricing
Multi-listing professionals dominate supply
Two Kaggle sources analyzed
All insights at a glance