Available for new engagements
Profile photo of Noah Weidig
Data Science • Analytics • GIS

Hi, I'm Noah

I turn messy data and raw geography into trusted pipelines, decision-ready analytics, and maps people actually read.

Works across
  • R & Python
  • Google Earth Engine
  • ArcGIS
  • Shiny & Quarto
  • Mapbox

Key metrics

4+

Years applied experience

25+

Pipelines & apps shipped

100%

Reproducible, documented

End‑to‑end

Ingest, model, visualize

Explore
Capabilities

What I do, end‑to‑end

Four pillars that turn noisy, scattered information into decisions your team can defend.

Data Wrangling

Reliable pipelines, schema validation, and scalable transformations that keep downstream analytics honest.

  • Tidyverse
  • GeoPandas
  • SQL

Analytics & Modeling

Reproducible analysis, hypothesis testing, and statistical models that move the KPIs leadership actually watches.

  • GLM / GAM
  • Time series
  • PCA

Geospatial & GIS

From TIGER roads to wildfire risk — I build spatial workflows in Earth Engine, ArcGIS, and Mapbox that answer where and why.

  • GEE
  • ArcGIS
  • Mapbox

Visualization

Dashboards, Shiny apps, Quarto reports, and publication-ready charts that make complex findings feel obvious.

  • Shiny
  • ggplot2
  • Quarto
In Practice

Three moves I make on every engagement

Scroll each act to watch the before / after.

01

I clean your data

Real data arrives in chaos — missing IDs, ages spelled out in words, impossible weights, inconsistent codings. I audit every column, flag each anomaly, and repair or document every issue so your downstream analysis stands on solid ground.

raw — before
idgenderageweight
NA"female""twenty-two"NA
34"M"3478.2
999"male"-7NA
29"female"2965.1
NA"male"419999
51"female"5172.4
-1"FEMALE"NA61.8
38"male"38NA
analysis-ready — after
idsexageweight
42F2268.4imputed
34M3478.2
dropped · outlier
29F2965.1
42M41dropped · outlier
51F5172.4
F4661.8imputed
38M3868.4imputed
02

I build reproducible pipelines

Brittle scripts that only run on one person's laptop are a liability. I design modular, version-controlled workflows — schema validation on ingest, pure functions, locked dependencies — so any teammate can reproduce your results from a cold start.

brittle scripts — before
01_load.R Error in read.csv
setwd("C:/Users/dan/Desktop") · no one else can run this
cleaning FINAL.R object not found
depends on df_raw from a different session · never saved
model_v4_USE THIS.R package missing
library() calls buried mid-script · order matters
make_plots.R never reached
upstream failure · output files missing
reproducible pipeline — after
R/01_ingest.R
here::here() paths · schema validated on load
R/02_clean.R
pure functions · testthat coverage · logged
R/03_model.R
set.seed() · renv.lock · targets DAG
R/04_visualize.R
ggsave() · deterministic · quarto report
03

I create publication-ready visualizations

Default chart themes export visual noise that buries your story. I strip away clutter, apply purposeful color, and use direct labels so readers see your finding immediately — no legend hunting required.

ggplot2 defaults
group_a group_b Jan May Sep Dec 0 10 20 30 40 50 value ~ month + group n=24, method=lm, geom=point+line month value
theme_custom()
Group A Group B Jan Jun Dec 0 10 20 30 Monthly value by group

and so much more.

Stack

Skills & tools I reach for daily

The technical stack, organized by domain.

Languages

  • R
  • Python
  • SQL
  • JavaScript
  • Bash
  • HTML
  • CSS
  • Markdown
  • LaTeX

Analytics & Stats

  • Regression
  • GLM / GAM
  • Mixed models
  • Time series
  • Forecasting
  • ANOVA / MANOVA
  • PCA
  • Clustering
  • Bootstrapping
  • Maximum likelihood
  • Hypothesis testing

Geospatial & GIS

  • Google Earth Engine
  • ArcGIS / ESRI
  • Mapbox
  • MapLibre
  • Leaflet
  • OSM
  • GDAL
  • GeoPandas
  • Spatial analysis
  • Geoprocessing

Visualization & Apps

  • Shiny
  • ggplot2
  • Quarto
  • Jupyter

Data Engineering

  • Tidyverse
  • NumPy / Pandas
  • Git / GitHub
  • renv / targets
  • Data wrangling
  • Pipeline design
  • Schema validation
Why Hire Me

What you actually get when you bring me on

Not a consultant who hands you a slide deck — an operator who hands you the pipeline, the repo, and the docs.

Decision-ready narratives

I don't just crunch numbers — I translate them into the two or three sentences an executive needs to make the call.

Reproducible by default

Every analysis lands as a versioned repo: locked dependencies, tested functions, and a README your team can run from a cold start.

Geospatial fluency

Planet-scale raster, national vector, urban analytics — I've shipped against all of them and published open datasets to prove it.

Autonomy, not babysitting

I scope, communicate, unblock myself, and surface tradeoffs early. You get a teammate, not another ticket in your queue.

Let's work

Got a data problem worth solving?

One email. Plain English. I'll tell you whether I'm the right fit — and if I'm not, who is.

Indicates required field