🏔️ Summit County Housing Analysis

A Personal Data Science Project

What started as curiosity became a full-stack data science exploration.

🏔️ Local Context

Understanding the housing market in my area.

🛠️ Data Engineering

Building a production-grade ETL pipeline from raw, messy public records.

🔮 ML Inference

Testing the limits of browser-based ML... and my own skillset.

📊 1. Data Story

Explore the data through interactive visualizations:

Distance to ski lifts and resort proximity analysis
20+ years of price trends vs. interest rates
Buyer origin patterns (local vs. out-of-state)
Seasonal purchase patterns and market cycles
Raw property data samples with 20+ attributes

🧬 2. ML Experiments

Dive into the model development process:

Tournament leaderboard comparing 10+ model runs
Gradient Boosting vs. Neural Network performance
SHAP values showing feature importance
Partial Dependence Plots for all numeric features
Model selection and version comparison tools

🔮 3. Price Predictor

Test the model with your own scenarios:

Interactive "What-If" simulator with real-time predictions
Adjust property features (size, beds, location, etc.)
Runs entirely in your browser using ONNX Runtime
Compare predictions across different model versions
No backend required—pure client-side ML inference

🛠️ How to use the Product

You can explore the dashboard immediately using the cards above or the navigation. The steps below are optional and intended for developers who want to run the data pipeline manually.

To see the full instructions, see the Public GitHub Repo →

1. Data Collection make scrape

Runs the asynchronous scraper to pull the latest property records.

2. ETL Pipeline make ingest

Resets the local SQLite warehouse and performs complex SQL feature engineering.

3. Model Training make tournament

Triggers a parameter sweep tournament. The best model is promoted.

⚡ Quick Presets

Property:

Market:

🧠 Current Model

Model Architecture

GBM NN

Model Version

🏠 House Properties

SqFt

Beds / Baths

Built

Property Type

Location

Grade

View (0-5)

Lot Size (Acres)

Garage (SqFt)

Condition

Dist. to Lift (miles)

📈 Macro-Economic Conditions

Rate (%)

CPI (Inflation)

S&P 500

County Pop. (k)

🧠 AI Explainer (Local Feature Contributions)

This chart shows how each feature pulls the predicted price away from the county average.

Sensitivity:

Summit County Housing Data Story

Exploring publicly available data to understand the housing market in Summit County, Colorado.

📍 The Landscape

Summit County, Colorado is a high-altitude destination, where skiing and mountain living are primary drivers of the economy. Click the map to identify the county hotspots and landmarks.

❄️ The "Ski Lift Effect"

Ski-in/ski-out properties are marketed to tourists and second-home owners. It feels intuitive that these properties command a premium, but is worth quantifying. This map shows every residential parcel's distance to its nearest chairlift.

📈 Market Cycles

Mortgage rates impact the cost of borrowing to purchase a home. This chart shows how prices and mortgage rates have moved over time.

County Aggregate Exclude Outliers (> 2σ)

🏠 Who is Buying?

Ownership records provide a mailing address, which I use as a proxy for the buyer's home base. This chart tracks the shift between locals, in-state buyers, and out-of-state investors.

Show as %

🌡️ Seasonal Pulse

Housing oftens sells seasonally and Summit County is no exception. This heatmap shows the number of sales by month and town.

Show as % of Town Total

🏗️ Supply & Density

There's a narrative right now that home prices are high due to supply constraints. Let's dig into that.

🔍 Deep Dive: The Data Explorer

This is a sample of the raw data that powers this analysis. Use the toggle below to view the full dataset attributes.

Show All Columns

📍 Points of Interest

🛠️ Model Feature Selection

✅ INCLUDED Structure, Quality, Location, Macro

❌ EXCLUDED Tax Values, Trans. Date, Address

🧠 Training Safeguards

Time-Based Split (No leakage)
Log-Space Target Transformation

📊 Correlation Matrix

Click any cell to drill down into the underlying relationship.

🧬 ML Experiment Registry

Training history and model convergence metrics.

Run ID	Model	MAE	R2	Status	Action

🏔️ Summit County Housing Analysis

📊 1. Data Story

🧬 2. ML Experiments

🔮 3. Price Predictor

🛠️ How to use the Product

🧠 AI Explainer (Local Feature Contributions)

Sensitivity:

Sensitivity:

Summit County Housing Data Story

📍 The Landscape

❄️ The "Ski Lift Effect"

📈 Market Cycles

🏠 Who is Buying?

🌡️ Seasonal Pulse

🏗️ Supply & Density

🔍 Deep Dive: The Data Explorer

🛠️ Model Feature Selection

🧠 Training Safeguards

📊 Correlation Matrix

Correlation Drill-down

🧬 ML Experiment Registry

Model Interpretability

SHAP Feature Importance

Partial Dependence (Marginal Effect)